TY - JOUR
T1 - Designing and developing an automatic interactive keyphrase extraction system with Unified Modeling Language (UML)
AU - Song, Min
AU - Song, Il Yeol
AU - Hu, Xiaohua
PY - 2004/11
Y1 - 2004/11
N2 - Designing and developing a system that assists the users in digesting and understanding information available has been a difficult challenge. In this paper, we discuss the design and development of an automatic interactive keyphrase extraction system, called KPSpotter, which is capable of processing various formats of data such as XML, HTML, and plain text through Internet. KPSpotter combines Information Gain data mining measure and several Natural Language Processing (NLP) techniques, such as Part of Speech (POS) technique and First Occurrence of Term. To improve extraction accuracy, WordNet is incorporated into KPSpotter. In designing and developing KPSpotter we utilized Unified Modeling Language (UML). UML modeling helps in the formalization of the preliminary analysis model and accomplishes iterative system design and development. We also conducted experiments for system performance testing by comparing keyphrases extracted by KPSPotter and KEA, a well-known naïve Baysiean-based keyphrase extraction system. The experiments show that KPSpotter outperforms KEA in most test cases.
AB - Designing and developing a system that assists the users in digesting and understanding information available has been a difficult challenge. In this paper, we discuss the design and development of an automatic interactive keyphrase extraction system, called KPSpotter, which is capable of processing various formats of data such as XML, HTML, and plain text through Internet. KPSpotter combines Information Gain data mining measure and several Natural Language Processing (NLP) techniques, such as Part of Speech (POS) technique and First Occurrence of Term. To improve extraction accuracy, WordNet is incorporated into KPSpotter. In designing and developing KPSpotter we utilized Unified Modeling Language (UML). UML modeling helps in the formalization of the preliminary analysis model and accomplishes iterative system design and development. We also conducted experiments for system performance testing by comparing keyphrases extracted by KPSPotter and KEA, a well-known naïve Baysiean-based keyphrase extraction system. The experiments show that KPSpotter outperforms KEA in most test cases.
UR - http://www.scopus.com/inward/record.url?scp=34247225709&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34247225709&partnerID=8YFLogxK
U2 - 10.1002/meet.1450410143
DO - 10.1002/meet.1450410143
M3 - Article
AN - SCOPUS:34247225709
SN - 1550-8390
VL - 41
SP - 367
EP - 372
JO - Proceedings of the ASIST Annual Meeting
JF - Proceedings of the ASIST Annual Meeting
ER -