I study efficient and reliable reasoning in foundation models.

I am currently a Visiting Researcher at MBZUAI, where I work under the supervision of Prof. Junpei Komiyama. Previously, at UC San Diego, I worked on latent reasoning in vision-language models, and at Purdue University, my work was focused on inference-time alignment of language models. I also founded Insituate, where we shipped agentic systems to banks and judiciary. I earned my bachelors degree in Computer Science and Biosciences from IIIT Delhi in 2024.

I like to understand how foundation models reason and how we can make that reasoning more efficient and trustworthy. A lot of what I do spans language and vision, with a focus on keeping these models safe, reliable and aligned with human values.

Interests
  • Multimodal Reasoning
  • Trustworthy AI
  • AI Alignment and Safety
  • Reinforcement Learning
Education
  • B.Tech in Computer Science and Biosciences, 2020-2024

    IIIT - Delhi

Experience

Visiting Researcher

Mohamed bin Zayed University of Artificial Intelligence

Feb 2026 - Present Abu Dhabi, UAE
Exploring self-consistency and uncertainty in language model reasoning.

Research Intern

University of Virginia

Oct 2025 - Feb 2026 Charlottesville, Virginia, United States
Developed a drift-resilient memory framework for code-execution agents using KL-constrained adapter updates, mitigating embedding distribution shifts during online learning to reduce unsafe code generation without compromising task success rates.

Research Intern

University of California - San Diego

Jul 2025 - Oct 2025 San Diego, California, United States
Worked on multimodal reasoning – interleaving text and visuals within the chain-of-thought, enabling models to “think” with sketches, diagrams, and images. This work bridges language and vision to solve complex problems with richer, more interpretable reasoning.

Research Intern

Purdue University

Feb 2025 - Jul 2025 West Lafayette, Indiana, United States
Proposed an inference-time alignment method that outperforms Best-of-N decoding by over 30%, while reducing reward model calls by 20%. Aligned LLMs in reducing harmlessness, improved reasoning and positive sentiment generation.

Co-Founder

Insituate

Sep 2023 - Feb 2025 New Delhi, India
Built agentic software for the Supreme Court of India, Mizuho Bank, PNC Bank and Indian High Courts.

Publications

Quickly discover relevant content by filtering publications.

Contact