About

I’m a Ph.D. student in the Data Intelligence and Learning Lab (DIAL Lab) at Sungkyunkwan University, Korea. I received my M.S. degree in Artificial Intelligence at SKKU, Korea, in 2023. Before that, I received my B.A. degree in Economics and B.S. degree in Computer Science and Engineering at SKKU, Korea, in 2021. I am primarily interested in Information Retrieval, especially focusing on sparse retrieval and generative document retrieval. My research interests broadly lie in the fields of data mining and natural language processing in real-world applications.

Publications

International Conference

GLEN: Generative Retrieval via Lexical Index Learning [link] [code] [blog(korean)]
Sunkyung Lee*, Minjin Choi*, Jongwuk Lee (* : equal contribution)
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Singapore, December 6-10, 2023 (Acceptance Rate: 23.3%, 901/3868)

ConQueR: Contextualized Query Reduction using Search Logs [link] [code] [blog(korean)]
Hye-young Kim*, Minjin Choi*, Sunkyung Lee, Eunseong Choi, Young-In Song and Jongwuk Lee (* : equal contribution)
The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR, short paper)
Taipei, Taiwan, July 23-27, 2023 (Acceptance Rate: 25.12%, 154/613)

SpaDE: Improving Sparse Representations using a Dual Document Encoder for First-stage Retrieval [link] [code]
Eunseong Choi*, Sunkyung Lee*, Minjin Choi, Hyeseon Ko, Young-In Song and Jongwuk Lee (* : equal contribution)
The 31st ACM International Conference on Information and Knowledge Management (CIKM)
Atlanta, Georgia, USA, October 17-21, 2022 (Acceptance Rate: 23.3%, 274/1175)

MelBERT: Metaphor Detection via Contextualized Late Interaction using Metaphorical Identification Theories [link] [code] [slide] [video]
Minjin Choi, Sunkyung Lee, Eunseong Choi, Heesoo Park, Junhyuk Lee, Dongwon Lee, Jongwuk Lee
2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)
Mexico City, Mexico (Virtual Event), June 6–11, 2021 (Acceptance Rate: 26.5%, 477/1797)

Domestic Conference and Journal

의사 문장 표현을 활용한 수학 문장형 문제 풀이 모델 (우수발표논문상) [link]
김지우, 이선경, 최은성, 이종욱
한국정보과학회 학술발표논문집 Vol.2022 No.06 [2022]: 446-448, Jun 2022

기계 독해 성능 개선을 위한 데이터 증강 기법 [link]
이선경, 최은성, 정선호, 이종욱
정보과학회논문지 (Journal of KIISE) Vol.48 No.12 [2021]: 1298-1304, Nov 2021

기계 독해 성능 개선을 위한 데이터 증강 기법 (우수논문상) [link]
이선경, 정선호, 이종욱
한국정보과학회 학술발표논문집 Vol.2020 No.12 [2020]: 400-402, Dec 2020

Education

Sungkyunkwan University, Republic of Korea
Ph.D., Department of Artificial Intelligence
Mar 2023 – present
Advisor: Prof. Jongwuk Lee

Sungkyunkwan University, Republic of Korea
M.S., Department of Artificial Intelligence
Mar 2021 – Feb 2023
Advisor: Prof. Jongwuk Lee
Thesis: A Dual Document Encoder Based on Sparse Representations for First-stage Retrieval

Sungkyunkwan University, Republic of Korea
B.S., Department of Computer Science and Engineering & B.A., Department of Global Economics
Mar 2017 – Feb 2021

For more info

Please download CV here.
Visit our lab homepage: DIAL Lab