Research

I study efficient reasoning and training methods for foundation models.

I am currently a Visiting Researcher in the Department of Machine Learning at MBZUAI, hosted by Dr. Junpei Komiyama, where I work on reinforcement learning and effcient training of language models. Previously, I worked on latent reasoning in vision language models with Dr. Biwei Huang at UC San Diego and on inference-time alignment of language models with Dr. Berkay Celik at Purdue University.

I like to understand how foundation models reason, how their reasoning abilities emerge through training, and how we can make that reasoning more efficient and trustworthy. Much of my work spans language and vision, with a focus on training and aligning models to be safe and aligned with human values.

News

May 2026 ๐ŸŽ‰ STARS was accepted at the SPIGM Workshop at ICML 2026.
Feb 2026 ๐Ÿš€ Joined MBZUAI as a Visiting Researcher to work on reasoning in language models.
Oct 2025 ๐Ÿ“ Served on the Program Committee for NeurIPS 2025 Workshops.
Oct 2025 ๐ŸŽ‰ Two papers accepted at NeurIPS workshops โ€” Efficient Reasoning and FM4LS.
Aug 2025 ๐Ÿ“ Served on the Program Committee for AAAI 2026 conference.

Publications * indicates equal contribution.

  1. Adaptive Blockwise Search: Inference-Time Alignment for Large Language Models
    M. Atif Quamar* , M. Areeb* , N. Sharma , A. Shreekumar , J. Rosenthal , M. Kuznetsov , M. Ozgur Ozmen , and Z. Berkay Celik
    Under Review
  2. ICML-W 2026
    stars.png
    STARS: Synchronous Token Alignment for Robust Supervision in Large Language Models
    M. Atif Quamar* , M. Areeb* , M. Kuznetsov , M. Ozgur Ozmen , and Z. Berkay Celik
    ICML 2026 - Structured Probabilistic Inference & Generative Modeling Workshop
  3. NeurIPS-W 2025
    entropy.png
    Logitโ€“Entropy Adaptive Stopping Heuristic for Efficient Chain-of-Thought Reasoning
    M. Atif Quamar and M. Areeb
    NeurIPS 2025 - Efficient Reasoning Workshop
  4. Reliable Chain-of-Thought via Prefix Consistency
    N. Iwase , Y. Ichihara , M. Atif Quamar , and J. Komiyama
    Under Review
  5. Learning Modal-Mixed Chain-of-Thought Reasoning with Latent Embeddings
    Y. Shao , K. Zhou , Z. Xu , M. Atif Quamar , S. Hao , Z. Wang , Z. Hu , and B. Huang
    Under Review
  6. NeurIPS-W 2025
    histone.png
    Decoding Histone Modification Signatures of Non-Coding RNAs via Foundation Models
    N. Sharma , M. Atif Quamar , and P. Xie
    NeurIPS 2025 - FM4LS Workshop