Time gap analysis by the topic model-based temporal technique

Do Heon Jeong, Min Song

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

This study proposes a temporal analysis method to utilize heterogeneous resources such as papers, patents, and web news articles in an integrated manner. We analyzed the time gap phenomena between three resources and two academic areas by conducting text mining-based content analysis. To this end, a topic modeling technique, Latent Dirichlet Allocation (LDA) was used to estimate the optimal time gaps among three resources (papers, patents, and web news articles) in two research domains. The contributions of this study are summarized as follows: firstly, we propose a new temporal analysis method to understand the content characteristics and trends of heterogeneous multiple resources in an integrated manner. We applied it to measure the exact time intervals between academic areas by understanding the time gap phenomena. The results of temporal analysis showed that the resources of the medical field had more up-to-date property than those of the computer field, and thus prompter disclosure to the public. Secondly, we adopted a power-law exponent measurement and content analysis to evaluate the proposed method. With the proposed method, we demonstrate how to analyze heterogeneous resources more precisely and comprehensively.

Original languageEnglish
Pages (from-to)776-790
Number of pages15
JournalJournal of Informetrics
Volume8
Issue number3
DOIs
Publication statusPublished - 2014 Jul

Fingerprint

resources
patent
content analysis
news
time
Law
trend

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Library and Information Sciences

Cite this

@article{52e7293ab11440ac8a734d31fd7562f2,
title = "Time gap analysis by the topic model-based temporal technique",
abstract = "This study proposes a temporal analysis method to utilize heterogeneous resources such as papers, patents, and web news articles in an integrated manner. We analyzed the time gap phenomena between three resources and two academic areas by conducting text mining-based content analysis. To this end, a topic modeling technique, Latent Dirichlet Allocation (LDA) was used to estimate the optimal time gaps among three resources (papers, patents, and web news articles) in two research domains. The contributions of this study are summarized as follows: firstly, we propose a new temporal analysis method to understand the content characteristics and trends of heterogeneous multiple resources in an integrated manner. We applied it to measure the exact time intervals between academic areas by understanding the time gap phenomena. The results of temporal analysis showed that the resources of the medical field had more up-to-date property than those of the computer field, and thus prompter disclosure to the public. Secondly, we adopted a power-law exponent measurement and content analysis to evaluate the proposed method. With the proposed method, we demonstrate how to analyze heterogeneous resources more precisely and comprehensively.",
author = "Jeong, {Do Heon} and Min Song",
year = "2014",
month = "7",
doi = "10.1016/j.joi.2014.07.005",
language = "English",
volume = "8",
pages = "776--790",
journal = "Journal of Informetrics",
issn = "1751-1577",
publisher = "Elsevier BV",
number = "3",

}

Time gap analysis by the topic model-based temporal technique. / Jeong, Do Heon; Song, Min.

In: Journal of Informetrics, Vol. 8, No. 3, 07.2014, p. 776-790.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Time gap analysis by the topic model-based temporal technique

AU - Jeong, Do Heon

AU - Song, Min

PY - 2014/7

Y1 - 2014/7

N2 - This study proposes a temporal analysis method to utilize heterogeneous resources such as papers, patents, and web news articles in an integrated manner. We analyzed the time gap phenomena between three resources and two academic areas by conducting text mining-based content analysis. To this end, a topic modeling technique, Latent Dirichlet Allocation (LDA) was used to estimate the optimal time gaps among three resources (papers, patents, and web news articles) in two research domains. The contributions of this study are summarized as follows: firstly, we propose a new temporal analysis method to understand the content characteristics and trends of heterogeneous multiple resources in an integrated manner. We applied it to measure the exact time intervals between academic areas by understanding the time gap phenomena. The results of temporal analysis showed that the resources of the medical field had more up-to-date property than those of the computer field, and thus prompter disclosure to the public. Secondly, we adopted a power-law exponent measurement and content analysis to evaluate the proposed method. With the proposed method, we demonstrate how to analyze heterogeneous resources more precisely and comprehensively.

AB - This study proposes a temporal analysis method to utilize heterogeneous resources such as papers, patents, and web news articles in an integrated manner. We analyzed the time gap phenomena between three resources and two academic areas by conducting text mining-based content analysis. To this end, a topic modeling technique, Latent Dirichlet Allocation (LDA) was used to estimate the optimal time gaps among three resources (papers, patents, and web news articles) in two research domains. The contributions of this study are summarized as follows: firstly, we propose a new temporal analysis method to understand the content characteristics and trends of heterogeneous multiple resources in an integrated manner. We applied it to measure the exact time intervals between academic areas by understanding the time gap phenomena. The results of temporal analysis showed that the resources of the medical field had more up-to-date property than those of the computer field, and thus prompter disclosure to the public. Secondly, we adopted a power-law exponent measurement and content analysis to evaluate the proposed method. With the proposed method, we demonstrate how to analyze heterogeneous resources more precisely and comprehensively.

UR - http://www.scopus.com/inward/record.url?scp=84924525150&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84924525150&partnerID=8YFLogxK

U2 - 10.1016/j.joi.2014.07.005

DO - 10.1016/j.joi.2014.07.005

M3 - Article

AN - SCOPUS:84924525150

VL - 8

SP - 776

EP - 790

JO - Journal of Informetrics

JF - Journal of Informetrics

SN - 1751-1577

IS - 3

ER -