Hyungjoo Chae

Ph.D. Student at Georgia Institute of Technology

hchae36@gatech.edu

Hyungjoo Chae
About
Hello! I am a Ph.D. Student at Georgia Institute of Technology in Computer Science, advi by Alan Ritter. I obtained my M.S. in Computer Science at Yonsei University, where I was fortunate to be advised by Jinyoung Yeo. Prior to that, I was a research intern at LG AI Research and Tutoring, Market Designers Inc., and did my B.S. in CS at Yonsei University.

My primary research focus is centered around developing GUI Agents and Digital Agents that can interact with complex environments, particularly web navigation and code generation tasks. I am also interested in Reinforcement Learning for Long-horizon Tasks and Code LLMs.

I am passionate about building agents that can understand and interact with the digital world in meaningful ways. My research spans from developing better evaluation frameworks for web agents to creating more robust code generation systems that can learn from feedback.

I am always open to collaboration and discussion about research projects. If you are interested in GUI agents, web navigation, or code generation, please feel free to reach out!
News
September 2025     Web-Shepherd accepted to NeurIPS 2025 Spotlight.
August 2025     ToolHaystack accepted to EMNLP 2025 Findings.
August 2025     Started Ph.D. at Georgia Institute of Technology!
May 2025     Evaluating Robustness of Reward Models accepted to ACL 2025. Can You Share Your Story? accepted to ACL 2025 Findings.
April 2025     The BiGGen Bench selected for the Best Paper Award at NAACL 2025.
January 2025     Web Agents with World Models accepted to ICLR 2025. Towards Lifelong Dialogue Agents accepted to NAACL 2025. TRAIT accepted to NAACL 2025 Findings.
September 2024     Our Language Models as Compilers and COFFEE-GYM papers got accepted to EMNLP 2024!
May 2024     Our Evidence-Focused Fact Summarization and VerifiNER papers got accepted to EMNLP 2024 and ACL 2024!
December 2023     Our Dialogue Chain-of-Thought Distillation paper got accepted to EMNLP 2023!
February 2023     Our CoTEVer and TUTORING papers got accepted to EACL 2023 and AAAI 2023!
October 2022     Our Mind the Gap! paper got accepted to COLING 2022!

Education

Georgia Institute of Technology2025 - Present

Ph.D. in Computer Science (Advisor: Alan Ritter)

Yonsei University2023 - 2025

M.S. in Computer Science (Advisor: Jinyoung Yeo)

Yonsei University2019 - 2022

B.S. in Computer Science

Publications

2025

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Hyungjoo Chae*, Sunghwan Kim*, Junhee Cho*, Seungone Kim, Seungjun Moon, Gyeom Hwangbo, Dongha Lim, Minjin Kim, Yeonjun Hwang, Minju Gwak, Dongwook Choi, Minseok Kang, Gwanhoon Im, ByeongUng Cho, Hyojun Kim, Jun Hee Han, Taeyoon Kwon, Minju Kim, Beong-woo Kwak, Dongjin Kang, Jinyoung Yeo

NeurIPS 2025 (Spotlight)

ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions

Beong-woo Kwak, Minju Kim, Dongha Lim, Hyungjoo Chae, Dongjin Kang, Sunghwan Kim, Dongil Yang, Jinyoung Yeo

EMNLP 2025 Findings

Evaluating Robustness of Reward Models for Mathematical Reasoning

Sunghwan Kim*, Dongjin Kang*, Taeyoon Kwon, Hyungjoo Chae, Jungsoo Won, Dongha Lee, Jinyoung Yeo

ACL 2025

Can You Share Your Story? Modeling Clients' Metacognition and Openness for LLM Therapist Evaluation

Minju Kim, Dongje Yoo, Yeonjun Hwang, Minseok Kang, Namyoung Kim, Minju Gwak, Beong-woo Kwak, Hyungjoo Chae, Harim Kim, Yunjoong Lee, Min Hee Kim, Dayi Jung, Kyong-Mee Chung, Jinyoung Yeo

ACL 2025 Findings

Towards Lifelong Dialogue Agents via Relation-aware Memory Construction and Timeline-augmented Response Generation

Kai Tzu-iunn Ong, Namyoung Kim, Minju Gwak, Hyungjoo Chae, Taeyoon Kwon, Yohan Jo, Seung-won Hwang, Dongha Lee, Jinyoung Yeo

NAACL 2025

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, ... Hyungjoo Chae ..., Bill Yuchen Lin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo

NAACL 2025, Best Paper Award

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

Seungbeen Lee*, Seungwon Lim*, Seungju Han, Giyeong Oh, Hyungjoo Chae, Jiwan Chung, Minju Kim, Beong-woo Kwak, Yeonsoo Lee, Dongha Lee, Jinyoung Yeo, Youngjae Yu

NAACL 2025 Findings

Web Agents With World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Hyungjoo Chae, Namyoung Kim, Kai Tzu-iunn Ong, Minju Gwak, Gwanwoo Song, Jihoon Kim, Sunghwan Kim, Dongha Lee, Jinyoung Yeo

ICLR 2025 / NeurIPS 2024 Sys2Reasonig at Scale Workshop

2024

COFFEE-GYM: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code

Hyungjoo Chae*, Taeyoon Kwon*, Seungjun Moon*, Yongho Song, Dongjin Kang, Kai Tzu-iunn Ong, Seung-won Hwang, Jinyoung Yeo

EMNLP 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Seonghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo

EMNLP 2024 / ACL 2024 NLRSE Workshop

Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering

Sungho Ko*, Hyunjin Cho*, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee

EMNLP 2024

VERIFINER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models

Seoyeon Kim*, Kwangwook Seo*, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee

ACL 2024

2023

Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents

Hyungjoo Chae*, Yongho Song*, Kai Tzu-iunn Ong, Taeyoon Kwon, Minjin Kim, Youngjae Yu, Dongha Lee, Dongyeop Kang, Jinyoung Yeo

EMNLP 2023

CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification

Seungone Kim, Sejune Joo, Yul Jang, Hyungjoo Chae, Jinyoung Yeo

EACL 2023 System Demonstrations

TUTORING: Instruction-grounded Conversational Agent for Language Learners

Hyungjoo Chae, Minjin Kim, Chaehyeong Kim, Won Seok Jeong, Hye Soong Kim, June Myung Lee, Jinyoung Yeo

AAAI 2023: System Demonstrations

2022

Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization

Seungone Kim*, SeJune Joo*, Hyungjoo Chae*, Chaehyeong Kim*, Seung-won Hwang, Jinyoung Yeo

COLING 2022

( * indicates equal contribution )

Vitæ

Full CV in PDF.

  • Georgia Institute of Technology 2025 - Present
    Ph.D. in Computer Science
    Advisor: Alan Ritter • Research Areas: GUI Agents
  • LG AI Research Oct. 2024 - Apr. 2025
    Research Intern
    Advisor: Kyungjae Lee and Moontae Lee • Inference-time scaling with MCTS and reflective reasoning
  • Yonsei University 2023 - 2025
    M.S. in Computer Science
    Advisor: Jinyoung Yeo • Research Areas: Digital Agents, RL for Long-horizon Tasks, Code LLMs
  • Tutoring, Market Designers Inc. Jul. 2022 - Nov. 2023
    Research Intern
    Advisor: Jinyoung Yeo
  • Yonsei University 2019 - 2022
    B.S. in Computer Science
    Graduation Project: Commonsense-augmented Dialogue Summarization • Advisor: Jinyoung Yeo
684 Pageviews
Oct. 01st - Oct. 31st
Jekyll theme adapted from Seungone Kim's website,Junmo Kang's website, and Joel Jang's website.