I am a staff research scientist at Field AI working with Dr. Shayegan Omidshafiei and Dr. Ali Agha to develop field foundation models. Before joining Field AI, I was a research scientist at LG AI Research, where I developed large language model agents to navigate web environments with Prof. Honglak Lee.

I received my M.S./Ph.D. from MIT, studying multiagent reinforcement learning (doctoral thesis committee: Prof. Jonathan P. How, Prof. Jakob N. Foerster, and Prof. Pulkit Agrawal). Prior to my graduate study, I received a B.S. (summa cum laude) from Cornell and worked on computer vision research (advisor: Prof. Tsuhan Chen).

I am a recipient of the Kwanjeong Educational Foundation Fellowship, and have spent time at LG AI Research (advisor: Prof. Honglak Lee), CMU-RI (advisor: Prof. Sebastian Scherer) and TTIC (advisor: Prof. Matthew R. Walter).

🤗 My up-to-date CV is available upon request.

Publication

Large Language Models

  • Yao Fu*, Dong-Ki Kim*, Jaekyeom Kim, Sungryull Sohn, Lajanugen Logeswaran, Kyunghoon Bae, Honglak Lee. AutoGuide: Automated Generation and Selection of State-Aware Guidelines for Large Language Model Agents. In Neural Information Processing Systems (NeurIPS), 2024. [Paper]
  • Jaekyeom Kim, Dong-Ki Kim, Lajanugen Logeswaran, Sungryull Sohn, Honglak Lee. Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Agents. In Empirical Methods in Natural Language Processing (EMNLP), 2024.
  • Lajanugen Logeswaran, Sungryull Sohn, Yiwei Lyu, Anthony Z. Liu, Dong-Ki Kim, Dongsub Shim, Moontae Lee, Honglak Lee. Reasoning about Action Preconditions with Programs. In North American Chapter of the Association for Computational Linguistics (NAACL), 2024. [Paper]
  • Anthony Liu, Jongwook Choi, Sungryull Sohn, Yao Fu, Jaekyeom Kim, Dong-Ki Kim, Xinhe Wang, Jaewon Yoo, Honglak Lee. SkillAct: Using Skill Abstractions Improves LLM Agents. In International Conference on Machine Learning (ICML) Workshop, 2024. [Paper]
  • Dong-Ki Kim, Sungryull Sohn, Lajanugen Logeswaran, Dongsub Shim, Honglak Lee. MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning. In Neural Information Processing Systems (NeurIPS) Workshop with spotlight 🎉, 2023. [Paper]
  • Sungryull Sohn, Yiwei Lyu, Anthony Z. Liu, Lajanugen Logeswaran, Dong-Ki Kim, Dongsub Shim, Honglak Lee. TOD-Flow: Modeling the Structure of Task-Oriented Dialogues. In Empirical Methods in Natural Language Processing (EMNLP), 2023.

Reinforcement Learning

  • Anthony Z. Liu, Dong-Ki Kim, Sungryull Sohn, Honglak Lee. Learning Higher Order Skills that Efficiently Compose. In International Conference on Machine Learning (ICML) Workshop, 2023. [Paper]
  • Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Gerald Tesauro, Jonathan P. How. Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria. In Conference on Robot Learning (CoRL) Workshop with oral presentation 🎉, 2022. [Paper]
  • Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P. How. Influencing Long-Term Behavior in Multiagent Reinforcement Learning. In Neural Information Processing Systems (NeurIPS), 2022. [Paper] [Code] [Video] [MIT News]
  • Chuangchuang Sun, Dong-Ki Kim, Jonathan P. How. ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation. In International Conference on Robotics and Automation (ICRA), 2022. [Paper]
  • Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro, Jonathan P. How. Context-Specific Representation Abstraction for Deep Option Learning. In Association for the Advancement of Artificial Intelligence (AAAI), 2022. [Paper] [Code] [Video]
  • Dong-Ki Kim, Miao Liu, Matthew Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan P. How. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning. In International Conference on Machine Learning (ICML), 2021. [Paper] [Code] [Video]
  • Chuangchuang Sun, Dong-Ki Kim, Jonathan P. How. FISAR: Forward Invariant Safe Reinforcement Learning with a Deep Neural Network-Based Optimizer. In International Conference on Robotics and Automation (ICRA), 2021. [Paper]
  • Dong-Ki Kim, Miao Liu, Shayegan Omidshafiei, Sebastian Lopez-Cot, Matthew Riemer, Golnaz Habibi, Gerald Tesauro, Sami Mourad, Murray Campbell, Jonathan P. How. Learning Hierarchical Teaching in Cooperative Multiagent Reinforcement Learning. In International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2020. [Paper] [WIRED News]
  • Samir Wadhwania, Dong-Ki Kim, Shayegan Omidshafiei, Jonathan P. How. Policy Distillation and Value Matching in Multiagent Reinforcement Learning. In International Conference on Intelligent Robots and Systems (IROS), 2019. [Paper] [Video]
  • Shayegan Omidshafiei, Dong-Ki Kim, Miao Liu, Gerald Tesauro, Matthew Riemer, Christopher Amato, Murray Campbell, Jonathan P. How. Learning to Teach in Cooperative Multiagent Reinforcement Learning. In Association for the Advancement of Artificial Intelligence (AAAI), 2019. [Paper] [MIT News]
  • Shayegan Omidshafiei*, Dong-Ki Kim*, Jason Pazis, Jonathan P. How. Crossmodal Attentive Skill Learner. In International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2018 and Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS), 2020. [Conference Paper] [Journal Paper] [Code] [Video]

Computer Vision & Robotics

  • Lena M. Downes, Dong-Ki Kim, Ted J. Steiner, Jonathan P. How. City-wide Street-to-Satellite Image Geolocalization of a Mobile Ground Agent. In International Conference on Intelligent Robots and Systems (IROS), 2022. [Paper] [Video]
  • Andrea Tagliabue, Dong-Ki Kim, Michael Everett, Jonathan P. How. Demonstration-Efficient Guided Policy Search via Imitation of Robust Tube MPC. In International Conference on Robotics and Automation (ICRA), 2022. [Paper] [Video]
  • Dong-Ki Kim, Matthew R. Walter. Satellite Image-based Localization via Learned Embeddings. In International Conference on Robotics and Automation (ICRA), 2017. [Paper] [Video] [NVIDIA News]
  • Dong-Ki Kim, Daniel Maturana, Masashi Uenoyama, Sebastian Scherer. Season-Invariant Semantic Segmentation with A Deep Multimodal Network. In Field and Service Robotics (FSR), 2017. [Paper]
  • Hang Chu, Dong-Ki Kim, Tsuhan Chen. You Are Here: Mimicking the Human Thinking Process in Reading Floor-Plans. In International Conference on Computer Vision (ICCV), 2015. [Paper] [Video]
  • Dong-Ki Kim, Tsuhan Chen. Deep Neural Network for Real-Time Autonomous Indoor Navigation. Technical Report, 2015. [Paper] [Video]