Abstract
We study keyphrase extraction (KPE) from Web documents. Our key contribution is encoding Web documents to leverage structure, such as title or anchors, by building a graph of words representing both (a) position-based proximity and (b) structural relations. We evaluate KPE performance on real-world search engine NAVER and human-annotated KPE benchmarks, and ours outperforms state-of-the-arts in both tasks.
Original language | English |
---|---|
Title of host publication | SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval |
Publisher | Association for Computing Machinery, Inc |
Pages | 1823-1827 |
Number of pages | 5 |
ISBN (Electronic) | 9781450380379 |
DOIs | |
Publication status | Published - 2021 Jul 11 |
Event | 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021 - Virtual, Online, Canada Duration: 2021 Jul 11 → 2021 Jul 15 |
Publication series
Name | SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval |
---|
Conference
Conference | 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021 |
---|---|
Country/Territory | Canada |
City | Virtual, Online |
Period | 21/7/11 → 21/7/15 |
Bibliographical note
Funding Information:This work was supported by NAVER-SQR program from NAVER corporation, and IITP funded by MSIT (No. 2017-0-01779, XAI).
Publisher Copyright:
© 2021 ACM.
All Science Journal Classification (ASJC) codes
- Software
- Computer Graphics and Computer-Aided Design
- Information Systems