SPARTA is a fully automated SQL-centric pipeline that constructs a tree-structured multi-hop QA benchmark by unifying structured and unstructured evidence from tables and text into a single relational representation.
@article{park2024sparta,title={SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables},author={Park, Sungho and Kim, Juen and Han, Wook-Shin},year={2025},journal={Submitted},}
EMNLP
SAFE: Schema-Driven Approximate Distance Join for Efficient Knowledge Graph Querying
SAFE introduces a schema-driven approximate distance join algorithm that refines noisy LLM-generated query graphs using schema-level constraints and efficiently aligns them with large knowledge graphs, enabling robust and scalable knowledge graph querying.
@article{lee2025safe,title={SAFE: Schema-Driven Approximate Distance Join for Efficient Knowledge Graph Querying},author={Lee, Sangoh and Park, Sungho and Han, Wook-Shin},booktitle={Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing},year={2025},publisher={Association for Computational Linguistics},}
ACL
HELIOS: Harmonizing Early Fusion, Late Fusion, and LLM Reasoning for Multi-Granular Table-Text Retrieval
Sungho Park, Joohyung Yun, Jongwuk Lee, and 1 more author
In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
HELIOS formulates retrieval as finding a query-relevant subgraph within a bipartite data graph built via early fusion of table segments and passages, and introduces a three-stage pipeline integrating early fusion, late fusion, and LLM reasoning.
@inproceedings{park2025helios,title={HELIOS: Harmonizing Early Fusion, Late Fusion, and LLM Reasoning for Multi-Granular Table-Text Retrieval},author={Park, Sungho and Yun, Joohyung and Lee, Jongwuk and Han, Wook-Shin},booktitle={Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},pages={32424--32444},year={2025},}
2024
KDD Cup
KDD Cup Meta CRAG 2024 Technical Report: Three-step Question-Answering Framework
Sungho Park, Jeongeum Seok, Jooyoung Lee, and 2 more authors
In 2024 KDD Cup Workshop for Retrieval Augmented Generation, 2024
First place in comparison questions (Tasks 1, 2, 3) and post-processing question (Task 1)
A three-step RAG framework that minimizes unnecessary retrievals by leveraging LLMs’ inherent knowledge and introduces a verification stage to prevent error propagation, improving both accuracy and efficiency in question answering.
@inproceedings{park2024kdd,title={{KDD} Cup Meta {CRAG} 2024 Technical Report: Three-step Question-Answering Framework},author={Park, Sungho and Seok, Jeongeum and Lee, Jooyoung and Yun, Joohyung and Lee, Wonseok},booktitle={2024 KDD Cup Workshop for Retrieval Augmented Generation},year={2024},url={https://openreview.net/forum?id=G4Ei2QlKnv},}