Recent Updates

Presenting at CAIS on May 26–turning my frustrations of working with too many agents into a research topic. Markdown Mayhem.
May '26
Our position paper highlighting recurring pitfalls in LLM-based planning research was accepted at ICML. Make Planning Research Rigorous Again!
May '26
Three papers accepted at ICAPS 2026. 1. Simplifying Planning Tasks with Fact-Level Relevance Analysis, led by Cameron Allen and Anita de Mello Koch, 2. Automating Thought of Search, led by Daniel Cao and Michael Katz, and 3. Planning in the LLM Era: Building for Reliability and Efficiency, a position paper led by Shirin Sohrabi. Grateful to all my wonderful collaborators!
Mar '26
Thrilled to share that our paper ACPBench-Hard has been accepted to ICLR 2026! πŸŽ‰ It’s incredibly rewarding to see this work is already gaining traction in the research community β€” with 10,000+ downloads on Hugging Face.
Jan '26
Will be attending NeurIPS 2025 in San Diego. Presenting a Tutorial on Planning in the Era of Language Models and showcasing Query Gym at IBM Booth.
Dec '25
The work on Black-Box Uncertainty Quantification for Large Language Models via Ensemble-of-Ensembles lead by Wang Ma in the Summer of 2025 is now accepted at Assessing and Improving Reliability of Foundation Models in the Real World (AIR-FM) Workshop at AAAI 2026.
Nov '25
Gave an invited talk at NxtAI Conference in San Francisco on LLMs for AI Planning.
Nov '25
...see older