A methodology to measure the semantic similarity between words based on the formal concept analysis

Yewon Jeong, Yiyeon Yoon, Dongkyu Jeon, Youngsang Cho, Wooju Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recently, web users feel difficult to find the desired information on the internet despite a lot of useful information since it takes more time and effort to find it. In order to solve this problem, the query expansion is considered as a new alternative. It is the process of reformulating a query to improve retrieval performance in information retrieval operations. Although there are a few techniques of query expansion, synonym identification is one of them. Therefore, this paper proposes the method to measure the semantic similarity between two words by using the keyword-based web documents. The formal concept analysis and our proposed expansion algorithm are used to estimate the similarity between two words. To evaluate the performance of our method, we conducted two experiments. As the results, the average of similarity between synonym pairs is much higher than random pairs. Also, our method shows the remarkable performance in comparison with other method. Therefore, the suggested method in this paper has the contribution to find the synonym among a lot of candidate words.

Original languageEnglish
Title of host publicationWEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies
PublisherSciTePress
Pages313-321
Number of pages9
Volume2
ISBN (Print)9789897580246
Publication statusPublished - 2014 Jan 1
Event10th International Conference on Web Information Systems and Technologies, WEBIST 2014 - Barcelona, Spain
Duration: 2014 Apr 32014 Apr 5

Other

Other10th International Conference on Web Information Systems and Technologies, WEBIST 2014
CountrySpain
CityBarcelona
Period14/4/314/4/5

Fingerprint

Formal concept analysis
Semantics
Information retrieval
Internet
Experiments

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications

Cite this

Jeong, Y., Yoon, Y., Jeon, D., Cho, Y., & Kim, W. (2014). A methodology to measure the semantic similarity between words based on the formal concept analysis. In WEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies (Vol. 2, pp. 313-321). SciTePress.
Jeong, Yewon ; Yoon, Yiyeon ; Jeon, Dongkyu ; Cho, Youngsang ; Kim, Wooju. / A methodology to measure the semantic similarity between words based on the formal concept analysis. WEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies. Vol. 2 SciTePress, 2014. pp. 313-321
@inproceedings{cf19da45b752467f891a4c8bbf44fd1e,
title = "A methodology to measure the semantic similarity between words based on the formal concept analysis",
abstract = "Recently, web users feel difficult to find the desired information on the internet despite a lot of useful information since it takes more time and effort to find it. In order to solve this problem, the query expansion is considered as a new alternative. It is the process of reformulating a query to improve retrieval performance in information retrieval operations. Although there are a few techniques of query expansion, synonym identification is one of them. Therefore, this paper proposes the method to measure the semantic similarity between two words by using the keyword-based web documents. The formal concept analysis and our proposed expansion algorithm are used to estimate the similarity between two words. To evaluate the performance of our method, we conducted two experiments. As the results, the average of similarity between synonym pairs is much higher than random pairs. Also, our method shows the remarkable performance in comparison with other method. Therefore, the suggested method in this paper has the contribution to find the synonym among a lot of candidate words.",
author = "Yewon Jeong and Yiyeon Yoon and Dongkyu Jeon and Youngsang Cho and Wooju Kim",
year = "2014",
month = "1",
day = "1",
language = "English",
isbn = "9789897580246",
volume = "2",
pages = "313--321",
booktitle = "WEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies",
publisher = "SciTePress",

}

Jeong, Y, Yoon, Y, Jeon, D, Cho, Y & Kim, W 2014, A methodology to measure the semantic similarity between words based on the formal concept analysis. in WEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies. vol. 2, SciTePress, pp. 313-321, 10th International Conference on Web Information Systems and Technologies, WEBIST 2014, Barcelona, Spain, 14/4/3.

A methodology to measure the semantic similarity between words based on the formal concept analysis. / Jeong, Yewon; Yoon, Yiyeon; Jeon, Dongkyu; Cho, Youngsang; Kim, Wooju.

WEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies. Vol. 2 SciTePress, 2014. p. 313-321.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - A methodology to measure the semantic similarity between words based on the formal concept analysis

AU - Jeong, Yewon

AU - Yoon, Yiyeon

AU - Jeon, Dongkyu

AU - Cho, Youngsang

AU - Kim, Wooju

PY - 2014/1/1

Y1 - 2014/1/1

N2 - Recently, web users feel difficult to find the desired information on the internet despite a lot of useful information since it takes more time and effort to find it. In order to solve this problem, the query expansion is considered as a new alternative. It is the process of reformulating a query to improve retrieval performance in information retrieval operations. Although there are a few techniques of query expansion, synonym identification is one of them. Therefore, this paper proposes the method to measure the semantic similarity between two words by using the keyword-based web documents. The formal concept analysis and our proposed expansion algorithm are used to estimate the similarity between two words. To evaluate the performance of our method, we conducted two experiments. As the results, the average of similarity between synonym pairs is much higher than random pairs. Also, our method shows the remarkable performance in comparison with other method. Therefore, the suggested method in this paper has the contribution to find the synonym among a lot of candidate words.

AB - Recently, web users feel difficult to find the desired information on the internet despite a lot of useful information since it takes more time and effort to find it. In order to solve this problem, the query expansion is considered as a new alternative. It is the process of reformulating a query to improve retrieval performance in information retrieval operations. Although there are a few techniques of query expansion, synonym identification is one of them. Therefore, this paper proposes the method to measure the semantic similarity between two words by using the keyword-based web documents. The formal concept analysis and our proposed expansion algorithm are used to estimate the similarity between two words. To evaluate the performance of our method, we conducted two experiments. As the results, the average of similarity between synonym pairs is much higher than random pairs. Also, our method shows the remarkable performance in comparison with other method. Therefore, the suggested method in this paper has the contribution to find the synonym among a lot of candidate words.

UR - http://www.scopus.com/inward/record.url?scp=84902380329&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84902380329&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84902380329

SN - 9789897580246

VL - 2

SP - 313

EP - 321

BT - WEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies

PB - SciTePress

ER -

Jeong Y, Yoon Y, Jeon D, Cho Y, Kim W. A methodology to measure the semantic similarity between words based on the formal concept analysis. In WEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies. Vol. 2. SciTePress. 2014. p. 313-321