Hyungjoo Chae

Language & AGI Lab, Yonsei University | LG AI Research

About Me

Hyungjoo Chae is a Research Scientist at LG AI Research and a Master student at Yonsei University. He works on enhancing how AI agents interact with computers through code generation and within GUI environments. His notable projects include developing world models for web navigation and creating COFFEE-GYM, a platform for improving AI feedback on code. Currently pursuing his M.S. under Professor Jinyoung Yeo, Chae contributes to major conferences like EMNLP and ACL while also working at LG AI Research on inference-time scaling projects. His research aims to create more capable digital agents that can handle complex tasks autonomously.

Download CV

Interests

Digital Agents
Code LLMs
RL for Long-Horizon Tasks

Education

MS in Computer Science
Yonsei University
BS in Computer Science
Yonsei University

Featured Publications

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

In this paper, we present Think-and-Execute, a novel framework that improves algorithmic reasoning in language models by first discovering task-level logic expressed in pseudocode, then simulating its execution for specific instances. Our approach outperforms existing methods by leveraging reusable task-level patterns rather than instance-specific reasoning.

Apr 3, 2024

VERIFINER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models

In this paper, we introduce VerifiNER, a post-hoc verification framework that identifies errors from existing NER methods using knowledge and revises them into more faithful predictions.

Jan 13, 2024

See all publications

Recent Publications

Hyungjoo Chae, Namyoung Kim, Kai Tzu-Iunn Ong, Minju Gwak, Gwanwoo Song, Jihoon Kim, Sunghwan Kim, Dongha Lee, Jinyoung Yeo (2024). Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation. ICLR 2025.

PDF

Hyungjoo Chae, Taeyoon Kwon, Seungjun Moon, Yongho Song, Dongjin Kang, Kai Tzu-Iunn Ong, Beong-Woo Kwak, Seonghyeon Bae, Seung-Won Hwang, Jinyoung Yeo (2024). Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code. EMNLP 2024.

PDF Code

Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-Iunn Ong, Beong-Woo Kwak, Moohyeon Kim, Seonghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo (2024). Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models . EMNLP 2024.

PDF Code

Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee (2024). Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering. EMNLP 2024.

PDF

Seoyeon Kim, Kwangwook Seo, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee (2024). VERIFINER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models. ACL 2024.

PDF

Hyungjoo Chae, Yongho Song, Kai Tzu-Iunn Ong, Taeyoon Kwon, Minjin Kim, Youngjae Yu, Dongha Lee, Dongyeop Kang, Jinyoung Yeo (2023). Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents. EMNLP 2023.

PDF Code

Hyungjoo Chae, Minjin Kim, Chaehyeong Kim, Wonseok Jeong, Hyejoong Kim, Junmyung Lee, Jinyoung Yeo (2023). TUTORING: Instruction-grounded Conversational Agent for Language Learners. AAAI 2023 Demo.

PDF Video

Seungone Kim, Se June Joo, Yul Jang, Hyungjoo Chae, Jinyoung Yeo (2023). CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification. EACL 2023 Demo.

PDF Code

Seungone Kim, Se June Joo, Hyungjoo Chae, Chaehyeong Kim, Seung-Won Hwang, Jinyoung Yeo (2022). Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization. COLING 2022.

PDF Code

Preprints

Sunghwan Kim, Dongjin Kang, Taeyoon Kwon, Hyungjoo Chae, Jungsoo Won, Dongha Lee, Jinyoung Yeo (2024). Evaluating Robustness of Reward Models for Mathematical Reasoning. arXiv preprint / Under review at ICLR 2025.

PDF Code

Seungbeen Lee, Seungwon Lim, Seungju Han, Giyeong Oh, Hyungjoo Chae, Jiwan Chung, Minju Kim, Beong-Woo Kwak, Yeonsoo Lee, Dongha Lee, Jinyoung Yeo, Youngjae Yu (2024). Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics. arXiv preprint / Under review at NAACL 2025.

PDF

Kai Tzu-Iunn Ong, Namyoung Kim, Minju Gwak, Hyungjoo Chae, Taeyoon Kwon, Yohan Jo, Seung-Won Hwang, Dongha Lee, Jinyoung Yeo (2024). Towards Lifelong Dialogue Agents via Relation-aware Memory Construction and Timeline-augmented Response Generation. arXiv preprint / Under review at NAACL 2025.

PDF

Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, ...., Hyungjoo Chae, ..., Bill Yuchen Lin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo (2024). The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models. arXiv preprint / Under review at NAACL 2025.

PDF Code

About Me

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

VERIFINER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models

🎉 1 paper accepted at ICLR 2025 and 3 papers accepted at NAACL 2025

⭐️ 1 paper accepted at NeurIPS 2024 Workshop

🎯 Started Internship at LG AI Research

🎉 3 papers accepted at EMNLP 2024

🤠 1 paper accepted at ACL 2024