Recent Updates

Thrilled to share that our paper ACPBench-Hard has been accepted to ICLR 2026! πŸŽ‰ It’s incredibly rewarding to see this work is already gaining traction in the research community β€” with 10,000+ downloads on Hugging Face.
Jan '26
Will be attending NeurIPS 2025 in San Diego. Presenting a Tutorial on Planning in the Era of Language Models and showcasing Query Gym at IBM Booth.
Dec '25
The work on Black-Box Uncertainty Quantification for Large Language Models via Ensemble-of-Ensembles lead by Wang Ma in the Summer of 2025 is now accepted at Assessing and Improving Reliability of Foundation Models in the Real World (AIR-FM) Workshop at AAAI 2026.
Nov '25
Gave an invited talk at NxtAI Conference in San Francisco on LLMs for AI Planning.
Nov '25
Will be presenting LLMs as MoldMakers; not 3D Printers at BayLearn - Machine Learning Symposium (2025) at Santa Clara University, CA.
Oct '25
Gave an Invited talk at the AI Institute at UofSC on LLMs for AI Planning
slide | video
Sep '25
Excited to share that PLAN-FM Bridge Program is accepted to AAAI 2026.
Sep '25
...see older