2026

  • PROMISE: Proof Automation as Structural Imitation of Human Reasoning
    Youngjoo Ahn, Sangyeop Yeo, Gijung Im, Jongmin Lee, Jinyoung Yeo, Jieung Kim
    Preprint
  • [C29] ACPO: Agent-Chained Policy Optimization for Multi-Agent Reinforcement Learning
    Daiki E. Matsunaga, Junho Na, Tri Wahyu Guntara, Scott Sanner, Pascal Poupart, Jongmin Lee^, Kee-Eung Kim^
  • [C28] Partially Equivariant Reinforcement Learning in Symmetry-Breaking Environments
    Junwoo Chang, Minwoo Park, Joohwan Seo, Roberto Horowitz, Jongmin Lee^, Jongeun Choi^

2025

  • Group-Invariant Unsupervised Skill Discovery: Symmetry-aware Skill Representations for Generalizable Behavior
    Junwoo Chang, Joseph Park, Roberto Horowitz, Jongmin Lee^, Jongeun Choi^
    Preprint
  • Semi-gradient DICE for Offline Constrained Reinforcement Learning
    Woosung Kim*, JunHo Seo*, Jongmin Lee^, Byung-Jun Lee^
    Preprint
  • [C27] FairDICE: Fairness-Driven Offline Multi-Objective Reinforcement Learning
    Woosung Kim*, Jinho Lee*, Jongmin Lee^, Byung-Jun Lee^
  • [C26] SEMDICE: Off-policy State Entropy Maximization via Stationary Distribution Correction Estimation
    Jongmin Lee*, Meiqi Sun*, Pieter Abbeel

2024

  • [C23] Mitigating Covariate Shift in Behavioral Cloning via Robust Stationary Distribution Correction
    Seokin Seo, Byung-Jun Lee, Jongmin Lee, HyeongJoo Hwang, Hongseok Yang, Kee-Eung Kim
  • [C22] ROIDICE: Offline Return on Investment Maximization for Efficient Decision Making
    Woosung Kim*, Hayeong Lee*, Jongmin Lee^, Byung-Jun Lee^
  • [C25] Body Transformer: Leveraging Robot Embodiment for Policy Learning
    Carmelo Sferrazza, Dun-Ming Huang, Fangchen Liu, Jongmin Lee, Pieter Abbeel
  • [C24] Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
    Haanvid Lee, Tri Wahyu Guntara, Jongmin Lee, Yung-Kyun Noh, Kee-Eung Kim
    paper spotlight

2023

  • [C21] AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
    Daiki E. Matsunaga*, Jongmin Lee*, Jaeseok Yoon, Stefanos Leonardos, Pieter Abbeel, Kee-Eung Kim
  • [C20] SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations
    Youngsoo Jang, Geon-Hyeong Kim, Jongmin Lee, Sungryull Sohn, Byoungjip Kim, Honglak Lee, Moontae Lee
  • [C19] Tempo Adaptation in Non-stationary Reinforcement Learning
    Hyunin Lee, Yuhao Ding, Jongmin Lee, Ming Jin, Javad Lavaei, Somayeh Sojoudi

2022

  • [C15] LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation
    Geon-Hyeong Kim*, Jongmin Lee*, Youngsoo Jang, Hongseok Yang, Kee-Eung Kim
  • [C14] Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
    Haanvid Lee, Jongmin Lee, Yunseon Choi, Wonseok Jeon, Byung-Jun Lee, Yung-Kyun Noh, Kee-Eung Kim
  • [C18] COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
    Jongmin Lee, Cosmin Paduraru, Daniel J. Mankowitz, Nicolas Heess, Doina Precup, Kee-Eung Kim, Arthur Guez
    paper code spotlight
  • [C17] DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations
    Geon-Hyeong Kim, Seokin Seo, Jongmin Lee, Wonseok Jeon, HyeongJoo Hwang, Hongseok Yang, Kee-Eung Kim
  • [C16] GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
    Youngsoo Jang, Jongmin Lee, Kee-Eung Kim

2021

2020

2019

  • [C5] Trust Region Sequential Variational Inference
    Geon-Hyeong Kim, Youngsoo Jang, Jongmin Lee, Wonseok Jeon, Hongseok Yang, Kee-Eung Kim
  • [C6] PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules
    Youngsoo Jang*, Jongmin Lee*, Jaeyoung Park*, Kyeng-Hun Lee, Pierre Lison, Kee-Eung Kim

2018

2017

2016