About Me

Hi, I am a Postdoctoral Associate at MIT’s Laboratory for Information and Decision Systems (LIDS), mentored by Prof. Cathy Wu. Previously, I was a Research Fellow at Nanyang Technological University (NTU), working with Prof. Jie Zhang and Prof. Zhiguang Cao. I obtained my Ph.D. in Industrial Systems Engineering at National University of Singapore (NUS) in 2024, where I was honored to be advised by Prof. Yeow Meng Chee. I received my B.E. from South China University of Technology (SCUT) in 2019, advised by Prof. Yuejiao Gong.

I am working at the intersection of Machine Learning (ML) and Optimization. I have published 20+ papers in top-tier conferences such as ICML, NeurIPS, ICLR, etc, and top-tier journals such as TPAMI, TNNLS, SMCA, TITS, etc. I am an Area Chair (AC) for NeurIPS and actively review for top AI conferences and journals, recognized with multiple best reviewer awards.

Welcome to see my publications, academic services, experience, and honors & awards. Drop me an email if you’d like to collaborate or discuss with me! You may approach me at:

  • Office: 45-611, 51 Vassar St, Cambridge, MA 02139
  • E-mail: yiningma [at] mit [dot] edu

💡 Research Interests

My research has focused on the emerging field of “Learning to Optimize (L2Opt)”, where we develop state-of-the-art AI-driven frameworks/approaches for addressing challenging optimization problems, e.g., combinatorial optimization problems (COPs). My research drives innovative solutions in broad application domains with real-world complexities such as large-scale vehicle routing in logistics, multi-robot coordination in automated warehouse, complex resource scheduling, path planning, protein docking, hyperparameter tuning, etc.

Research Highlights 🔥 - Complete list available here and on Google Scholar

  • Neural Combinatorial Optimization (NCO) - Learning end-to-end GPU-based solvers, e.g., DACT/NeuOpt (neural k-opt search), N2S/NCS (neural ruin-and-repair search), NeuOpt/PIP (constraint handling), AMDKD (generalization), FER (dynamic embedding), NHDE (multi-objective), MVMoE/SHIELD (multi-task)
  • Learning-guided Optimization - Accelerating and elevating classical solvers with AI to harness the best of both worlds, e.g., R-DAS (dynamic operator selection), GLEET/ConfigX (dynamic algorithm configuration), L-RHO (long-horizon COP), L2Seg (large-scale VRP)
  • Superhuman Knowledge Discovery & Interpretability - Discovering and understanding superhuman optimization knowledge and optimizer rules, e.g., Symbol (rule generation), LLaMoCo (code generation with LLM) – more projects coming soon!
  • Survey, Benchmark & Open-source Library - IET Review (survey of NCO through 2023), MetaBox (benchmark of learning-guided MetaBBO), RL4CO (library of RL for COP)

Research Keywords

  • Machine Learning: Reinforcement Learning, Deep Learning, Large Language Model (LLM), LLM agent, Foundation Model, Multi-Agent Systems, Interpretability
  • Optimization: Combinatorial Optimization, Discrete Optimization, Mixed Integer Linear Programming (MILP), Black-Box Optimization (BBO)
  • Application: Routing, Planning, Scheduling, Robotics, Transportation

🎉 News

  • I'm on the job market! Please contact me if you know of any suitable positions!
  • [05/2025] Our RL4CO benchmark library got accepted by KDD (Oral) 2025!
  • [05/2025] One paper on L2Opt got accepted by ICML’25, where we introduce SHIELD, a novel NCO solver for Multi-Task Multi-Distribution Vehicle Routing Problem (MTMDVRP).
  • [04/2025] I will be serving as an Area Chair for NeurIPS 2025!
  • [04/2025] Our L-RHO is featured on the Front Page of MIT.edu (see MIT News 🎊)!
  • [12/2024] One paper on L2Opt got accepted by ICLR’25, where we introduce L-RHO, leveraging ML to accelerate RHO by up to 54% for Long-Horizon COPs such as FJSP.
  • [12/2024] One paper on L2Opt got accepted by AAAI’25 (Oral), where we propose ConfigX, a unified configuration agent that learns to boost diverse evolutionary algorithms.
  • [12/2024] We hosted an unofficial L2Opt workshop at NeurIPS 2024 (see photos)!
  • [12/2024] ✈️ I will be attending NeurIPS’24 in person. See you in Vancouver, Canada! 🇨🇦
  • [11/2024] One paper on L2Opt got accepted by KDD’25, where we propose a neural approach for optimizing diverse, high-quality solutions to Multi-Solution TSP (MSTSP).
  • [10/2024] I am selected as NeurIPS’24 Top Reviewer (10%) for both main and DB track!
  • [09/2024] One paper on L2Opt got accepted by NeurIPS’24, where we propose Proactive Infeasibility Prevention (PIP) to elevate neural solvers for complex VRP constraints.
  • [08/2024] One paper on L2Opt got accepted by TPAMI, where we promoted a neural collaborative framework integrating learning-to-search and learning-to-construct solvers.
  • [08/2024] I’m so excited to join Prof. Cathy Wu’s group in MIT!
  • [07/2024] I am selected as ICML’24 Best Reviewer (< 3%)!
  • [06/2024] Check out RL4CO - our latest extensive benchmark to unify frameworks and facilitate research in RL-based CO algorithms 🚀.
Click to View More!
  • [05/2024] One paper on L2Opt got accepted by KDD'24, where we propose a hierarchical neural solver for realistic TSPs under real-world customer distributions.
  • [05/2024] One paper on L2Opt got accepted by ICML'24, where we introduce MVMoE, a step towards multi-task domain foundation models for VRPs based on mixture of experts.
  • [05/2024] Promoted to the Research Fellow in NTU.
  • [03/2024] One paper on MARL got accepted by TNNLS, where we introduce DOMAC for opponent modelling in multi-agent systems using only local information.
  • [03/2024] One paper on L2Opt got accepted by SMCA, where we introduce RL-DAS for dynamic algorithm selection based on deep reinforcement learning.
  • [02/2024] Gave a talk at MIT, hosted by Prof. Cathy Wu.
  • [02/2024] Successfully defended my PhD thesis at NUS 🎓!



Flag Counter