Research interests
- Efficient multimodal learning (vision + language), compute- and memory-aware fusion
- Adversarial evaluation and robustness of vision-language models under distribution shifts
Current research
Student Researcher, SafeTrip Lab, IIT Roorkee (Jun 2025 – Present)
- Reduced parameters by 52% using anchor-token pruning, Perceiver cross-attention, and gated view pooling
- Results on multi-view VQA: EM 51%, F1 70%, BLEU-4 47.1, ROUGE-L 69.1, CIDEr 2.99
Selected projects
GeoAnchors — Compute-efficient multimodal fusion
Adversarial Evaluation of Vision-Language Models
Aug 2025 – Nov 2025
- Training-free adversarial evaluation on IMP-v1-3B and Qwen2.5-VL-7B
- Answer flip rates: 75.1% (IMP-v1-3B), 38.4% (Qwen2.5-VL-7B)
- Counting deviation (mean absolute shift): 6.74, 1.24
DiffGraph+ — Heterogeneous Graph Diffusion
Feb 2025 – Apr 2025
- Extended DiffGraph (WSDM’25) for heterogeneous graphs with automatic view discovery via Graph Transformer
- DBLP author classification: Micro-F1 78%, Macro-F1 77%, AUC 93%
Flood Forecasting System
Aug 2023 – Nov 2023
- Pixel-level inundation mapping for the Narmada basin; historical F1: 0.86 (2001–2012)
Cminusminus — Programming Language & Compiler
Jan 2024 – Apr 2024
- Compiler toolchain using Lark, WASM backend, unit and integration testing
Waste Segregation System
Feb 2022 – May 2022
- ResNet-50 fine-tuning; 94% validation accuracy; OpenCV + Arduino prototype
Open-source
- PR: huggingface/transformers#42685
- Issue: huggingface/transformers#42629
- Contributed fixes for BLT training and CI stability (KV-cache generation mismatches, initialization, shared training tests).
Experience
Scientific Computing Contractor, Mercor (Jul 2025 – Nov 2025)
Software Engineer Intern, IntentSignal Systems (May 2024 – Sep 2024)
Cloud Operations Intern, Patible AI (May 2023 – Aug 2023)