Analyzing topic evolution in bioinformatics: investigation of dynamics of the field with conference data in DBLP

Min Song, Go Eun Heo, Su Yeon Kim

Research output: Contribution to journalArticle

19 Citations (Scopus)

Abstract

In this paper we analyze topic evolution over time within bioinformatics to uncover the underlying dynamics of that field, focusing on the recent developments in the 2000s. We select 33 bioinformatics related conferences indexed in DBLP from 2000 to 2011. The major reason for choosing DBLP as the data source instead of PubMed is that DBLP retains most bioinformatics related conferences, and to study dynamics of the field, conference papers are more suitable than journal papers. We divide a period of a dozen years into four periods: period 1 (2000–2002), period 2 (2003–2005), period 3 (2006–2008) and period 4 (2009–2011). To conduct topic evolution analysis, we employ three major procedures, and for each procedure, we develop the following novel technique: the Markov Random Field-based topic clustering, automatic cluster labeling, and topic similarity based on Within-Period Cluster Similarity and Between-Period Cluster Similarity. The experimental results show that there are distinct topic transition patterns between different time periods. From period 1 to period 3, new topics seem to have emerged and expanded, whereas from period 3 to period 4, topics are merged and display more rigorous interaction with each other. This trend is confirmed by the collaboration pattern over time.

Original languageEnglish
Pages (from-to)397-428
Number of pages32
JournalScientometrics
Volume101
Issue number1
DOIs
Publication statusPublished - 2014 Jan 1

Fingerprint

Bioinformatics
Labeling
trend
interaction
time

All Science Journal Classification (ASJC) codes

  • Social Sciences(all)
  • Computer Science Applications
  • Library and Information Sciences

Cite this

@article{9b26920f8e074531aecff1030cd60702,
title = "Analyzing topic evolution in bioinformatics: investigation of dynamics of the field with conference data in DBLP",
abstract = "In this paper we analyze topic evolution over time within bioinformatics to uncover the underlying dynamics of that field, focusing on the recent developments in the 2000s. We select 33 bioinformatics related conferences indexed in DBLP from 2000 to 2011. The major reason for choosing DBLP as the data source instead of PubMed is that DBLP retains most bioinformatics related conferences, and to study dynamics of the field, conference papers are more suitable than journal papers. We divide a period of a dozen years into four periods: period 1 (2000–2002), period 2 (2003–2005), period 3 (2006–2008) and period 4 (2009–2011). To conduct topic evolution analysis, we employ three major procedures, and for each procedure, we develop the following novel technique: the Markov Random Field-based topic clustering, automatic cluster labeling, and topic similarity based on Within-Period Cluster Similarity and Between-Period Cluster Similarity. The experimental results show that there are distinct topic transition patterns between different time periods. From period 1 to period 3, new topics seem to have emerged and expanded, whereas from period 3 to period 4, topics are merged and display more rigorous interaction with each other. This trend is confirmed by the collaboration pattern over time.",
author = "Min Song and Heo, {Go Eun} and Kim, {Su Yeon}",
year = "2014",
month = "1",
day = "1",
doi = "10.1007/s11192-014-1246-2",
language = "English",
volume = "101",
pages = "397--428",
journal = "Scientometrics",
issn = "0138-9130",
publisher = "Springer Netherlands",
number = "1",

}

Analyzing topic evolution in bioinformatics : investigation of dynamics of the field with conference data in DBLP. / Song, Min; Heo, Go Eun; Kim, Su Yeon.

In: Scientometrics, Vol. 101, No. 1, 01.01.2014, p. 397-428.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Analyzing topic evolution in bioinformatics

T2 - investigation of dynamics of the field with conference data in DBLP

AU - Song, Min

AU - Heo, Go Eun

AU - Kim, Su Yeon

PY - 2014/1/1

Y1 - 2014/1/1

N2 - In this paper we analyze topic evolution over time within bioinformatics to uncover the underlying dynamics of that field, focusing on the recent developments in the 2000s. We select 33 bioinformatics related conferences indexed in DBLP from 2000 to 2011. The major reason for choosing DBLP as the data source instead of PubMed is that DBLP retains most bioinformatics related conferences, and to study dynamics of the field, conference papers are more suitable than journal papers. We divide a period of a dozen years into four periods: period 1 (2000–2002), period 2 (2003–2005), period 3 (2006–2008) and period 4 (2009–2011). To conduct topic evolution analysis, we employ three major procedures, and for each procedure, we develop the following novel technique: the Markov Random Field-based topic clustering, automatic cluster labeling, and topic similarity based on Within-Period Cluster Similarity and Between-Period Cluster Similarity. The experimental results show that there are distinct topic transition patterns between different time periods. From period 1 to period 3, new topics seem to have emerged and expanded, whereas from period 3 to period 4, topics are merged and display more rigorous interaction with each other. This trend is confirmed by the collaboration pattern over time.

AB - In this paper we analyze topic evolution over time within bioinformatics to uncover the underlying dynamics of that field, focusing on the recent developments in the 2000s. We select 33 bioinformatics related conferences indexed in DBLP from 2000 to 2011. The major reason for choosing DBLP as the data source instead of PubMed is that DBLP retains most bioinformatics related conferences, and to study dynamics of the field, conference papers are more suitable than journal papers. We divide a period of a dozen years into four periods: period 1 (2000–2002), period 2 (2003–2005), period 3 (2006–2008) and period 4 (2009–2011). To conduct topic evolution analysis, we employ three major procedures, and for each procedure, we develop the following novel technique: the Markov Random Field-based topic clustering, automatic cluster labeling, and topic similarity based on Within-Period Cluster Similarity and Between-Period Cluster Similarity. The experimental results show that there are distinct topic transition patterns between different time periods. From period 1 to period 3, new topics seem to have emerged and expanded, whereas from period 3 to period 4, topics are merged and display more rigorous interaction with each other. This trend is confirmed by the collaboration pattern over time.

UR - http://www.scopus.com/inward/record.url?scp=84893661858&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84893661858&partnerID=8YFLogxK

U2 - 10.1007/s11192-014-1246-2

DO - 10.1007/s11192-014-1246-2

M3 - Article

AN - SCOPUS:84893661858

VL - 101

SP - 397

EP - 428

JO - Scientometrics

JF - Scientometrics

SN - 0138-9130

IS - 1

ER -