About Me

Hi, I am a Postdoctoral Associate at MIT’s Laboratory for Information and Decision Systems (LIDS), mentored by Prof. Cathy Wu. Previously, I was a Research Fellow at Nanyang Technological University (NTU), working with Prof. Jie Zhang and Prof. Zhiguang Cao. I obtained my Ph.D. in Industrial Systems Engineering at National University of Singapore (NUS) in 2024, where I was honored to be advised by Prof. Yeow Meng Chee. I received my B.E. from South China University of Technology (SCUT) in 2019, advised by Prof. Yuejiao Gong.

I am working at the intersection of Machine Learning (ML) and Optimization. I have published 20+ papers in top-tier conferences such as ICML, NeurIPS, ICLR, etc, and top-tier journals such as TPAMI, TNNLS, SMCA, TITS, etc. I am an Area Chair (AC) for NeurIPS and actively review for top AI conferences and journals, recognized with multiple best reviewer awards.

Welcome to see my publications, fundings, services, experience, and honors & awards. Drop me an email if you’d like to collaborate or discuss with me! You may approach me at:

Office: 45-611, 51 Vassar St, Cambridge, MA 02139
E-mail: yiningma [at] mit [dot] edu

💡 Research Interests

My research has focused on the emerging field of “Learning to Optimize (L2Opt)”, where we develop state-of-the-art AI-driven frameworks/approaches for addressing challenging optimization problems, e.g., combinatorial optimization problems (COPs). My research drives innovative solutions in broad application domains with real-world complexities such as large-scale vehicle routing in logistics, multi-robot coordination in automated warehouse, complex resource scheduling, path planning, protein docking, hyperparameter tuning, etc.

Research Highlights 🔥 - Complete list available here and on Google Scholar

Neural Combinatorial Optimization (NCO) - Learning end-to-end GPU-based solvers, e.g., DACT/NeuOpt (neural k-opt search), N2S/NCS (neural ruin-and-repair search), NeuOpt/PIP (constraint handling), AMDKD (generalization), FER (dynamic embedding), NHDE (multi-objective), MVMoE/SHIELD (multi-task)
Learning-guided Optimization - Accelerating and elevating classical solvers with AI to harness the best of both worlds, e.g., R-DAS (dynamic operator selection), GLEET/ConfigX (dynamic algorithm configuration), L-RHO (long-horizon COP), L2Seg (large-scale VRP)
Superhuman Knowledge Discovery & Interpretability - Discovering and understanding superhuman optimization knowledge and optimizer rules, e.g., Symbol (rule generation), LLaMoCo (code generation with LLM) – more projects coming soon!
Survey, Benchmark & Open-source Library - IET Review (survey of NCO through 2023), MetaBox-v2 (benchmark of learning-guided MetaBBO), RL4CO (library of RL for COP)

Research Keywords

Machine Learning: Reinforcement Learning, Deep Learning, Large Language Model (LLM), LLM agent, Foundation Model, Multi-Agent Systems, Interpretability
Optimization: Combinatorial Optimization, Discrete Optimization, Mixed Integer Linear Programming (MILP), Black-Box Optimization (BBO)
Application: Routing, Planning, Scheduling, Robotics, Transportation

🎉 News

I'm on the job market! Please contact me if you know of any suitable positions!
[09/2025] Three papers on L2Opt got accepted by NeurIPS’25, including 1) the very first paper studying the internal mechanism and intepretability of NCO models (Spotlight); 2) DesignX, a bi-agent learning system tailored for automated algorithm design in black-box optimization; and 3) MetaBox-v2, an up-to-date benchmark platform for MetaBBO!
[09/2025] We have organized the 1st workshop on Learning-assisted Evolutionary Algorithm Design (LEAD 2025). Watch recordings here; see you all next year!
[06/2025] Checkout MetaBox-v2🚀! A major update offering broader optimization scenarios (single-objective, multi-objective, multi-modal, multi-task, etc), extensive benchmarks, even faster parallel training/inference, and more support for MetaBBO.
[05/2025] Our benchmark library got accepted by KDD (Oral) 2025!
[05/2025] One paper on L2Opt got accepted by ICML’25, where we introduce SHIELD, a novel NCO solver for Multi-Task Multi-Distribution Vehicle Routing Problem (MTMDVRP).
[04/2025] I will be serving as an Area Chair for NeurIPS 2025!
[04/2025] Our L-RHO is featured on the Front Page of MIT.edu (see MIT News 🎊)!
[12/2024] One paper on L2Opt got accepted by ICLR’25, where we introduce L-RHO, leveraging ML to accelerate RHO by up to 54% for Long-Horizon COPs such as FJSP.
[12/2024] One paper on L2Opt got accepted by AAAI’25 (Oral), where we propose ConfigX, a unified configuration agent that learns to boost diverse evolutionary algorithms.
[12/2024] We hosted an unofficial L2Opt workshop at NeurIPS 2024 (see photos)!
[12/2024] ✈️ I will be attending NeurIPS’24 in person. See you in Vancouver, Canada! 🇨🇦
[11/2024] One paper on L2Opt got accepted by KDD’25, where we propose a neural approach for optimizing diverse, high-quality solutions to Multi-Solution TSP (MSTSP).
[10/2024] I am selected as NeurIPS’24 Top Reviewer (10%) for both main and DB track!
[09/2024] One paper on L2Opt got accepted by NeurIPS’24, where we propose Proactive Infeasibility Prevention (PIP) to elevate neural solvers for complex VRP constraints.
[08/2024] One paper on L2Opt got accepted by TPAMI, where we promoted a neural collaborative framework integrating learning-to-search and learning-to-construct solvers.
[08/2024] I’m so excited to join Prof. Cathy Wu’s group in MIT!
[07/2024] I am selected as ICML’24 Best Reviewer (< 3%)!
[06/2024] Check out - our latest extensive benchmark to unify frameworks and facilitate research in RL-based CO algorithms 🚀.

Click to View More!

[05/2024] One paper on L2Opt got accepted by KDD'24, where we propose a hierarchical neural solver for realistic TSPs under real-world customer distributions.
[05/2024] One paper on L2Opt got accepted by ICML'24, where we introduce MVMoE, a step towards multi-task domain foundation models for VRPs based on mixture of experts.
[05/2024] Promoted to the Research Fellow in NTU.
[03/2024] One paper on MARL got accepted by TNNLS, where we introduce DOMAC for opponent modelling in multi-agent systems using only local information.
[03/2024] One paper on L2Opt got accepted by SMCA, where we introduce RL-DAS for dynamic algorithm selection based on deep reinforcement learning.
[02/2024] Gave a talk at MIT, hosted by Prof. Cathy Wu.
[02/2024] Successfully defended my PhD thesis at NUS 🎓!

Yining MA (马一宁)

💡 Research Interests

🎉 News