Tracing database usage

Detecting main paths in database link networks

Qi Yu, Ying Ding, Min Song, Sungjeon Song, Jianhua Liu, Bin Zhang

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

This paper presents a database link network to measure the impact of databases on biological research. To this end, we used the 20,861 full-text articles from PubMed Central in the field of Bioinformatics. We then extracted databases from the methodology sections of these articles and their references. The list of databases was built with The 2013 Nucleic Acids Research Molecular Biology Database Collection (available online), which includes 1512 databases. The database link network was constructed from sets of pairs of databases mentioned in the methodology sections of full-text PubMed Central articles. The edges of the database link network represent the link relationships between two databases. The weight of each edge is determined either by the link frequency of the two databases (i.e., in the link-weighted database link network) or the topic similarity between two databases (i.e., in the similarity-weighted database link network). With the database link network, we analyzed the topological structure and main paths of the database link network to trace the usage, connection, and evolution of databases. We also conducted content analysis by comparing content similarities among the papers citing databases.

Original languageEnglish
Pages (from-to)1-15
Number of pages15
JournalJournal of Informetrics
Volume9
Issue number1
DOIs
Publication statusPublished - 2015 Jan 1

Fingerprint

methodology
biology
content analysis
Molecular biology
Nucleic acids
Bioinformatics

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Library and Information Sciences

Cite this

Yu, Qi ; Ding, Ying ; Song, Min ; Song, Sungjeon ; Liu, Jianhua ; Zhang, Bin. / Tracing database usage : Detecting main paths in database link networks. In: Journal of Informetrics. 2015 ; Vol. 9, No. 1. pp. 1-15.
@article{2d2bc942ca94426a8d1cdab9b1935061,
title = "Tracing database usage: Detecting main paths in database link networks",
abstract = "This paper presents a database link network to measure the impact of databases on biological research. To this end, we used the 20,861 full-text articles from PubMed Central in the field of Bioinformatics. We then extracted databases from the methodology sections of these articles and their references. The list of databases was built with The 2013 Nucleic Acids Research Molecular Biology Database Collection (available online), which includes 1512 databases. The database link network was constructed from sets of pairs of databases mentioned in the methodology sections of full-text PubMed Central articles. The edges of the database link network represent the link relationships between two databases. The weight of each edge is determined either by the link frequency of the two databases (i.e., in the link-weighted database link network) or the topic similarity between two databases (i.e., in the similarity-weighted database link network). With the database link network, we analyzed the topological structure and main paths of the database link network to trace the usage, connection, and evolution of databases. We also conducted content analysis by comparing content similarities among the papers citing databases.",
author = "Qi Yu and Ying Ding and Min Song and Sungjeon Song and Jianhua Liu and Bin Zhang",
year = "2015",
month = "1",
day = "1",
doi = "10.1016/j.joi.2014.10.002",
language = "English",
volume = "9",
pages = "1--15",
journal = "Journal of Informetrics",
issn = "1751-1577",
publisher = "Elsevier BV",
number = "1",

}

Tracing database usage : Detecting main paths in database link networks. / Yu, Qi; Ding, Ying; Song, Min; Song, Sungjeon; Liu, Jianhua; Zhang, Bin.

In: Journal of Informetrics, Vol. 9, No. 1, 01.01.2015, p. 1-15.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Tracing database usage

T2 - Detecting main paths in database link networks

AU - Yu, Qi

AU - Ding, Ying

AU - Song, Min

AU - Song, Sungjeon

AU - Liu, Jianhua

AU - Zhang, Bin

PY - 2015/1/1

Y1 - 2015/1/1

N2 - This paper presents a database link network to measure the impact of databases on biological research. To this end, we used the 20,861 full-text articles from PubMed Central in the field of Bioinformatics. We then extracted databases from the methodology sections of these articles and their references. The list of databases was built with The 2013 Nucleic Acids Research Molecular Biology Database Collection (available online), which includes 1512 databases. The database link network was constructed from sets of pairs of databases mentioned in the methodology sections of full-text PubMed Central articles. The edges of the database link network represent the link relationships between two databases. The weight of each edge is determined either by the link frequency of the two databases (i.e., in the link-weighted database link network) or the topic similarity between two databases (i.e., in the similarity-weighted database link network). With the database link network, we analyzed the topological structure and main paths of the database link network to trace the usage, connection, and evolution of databases. We also conducted content analysis by comparing content similarities among the papers citing databases.

AB - This paper presents a database link network to measure the impact of databases on biological research. To this end, we used the 20,861 full-text articles from PubMed Central in the field of Bioinformatics. We then extracted databases from the methodology sections of these articles and their references. The list of databases was built with The 2013 Nucleic Acids Research Molecular Biology Database Collection (available online), which includes 1512 databases. The database link network was constructed from sets of pairs of databases mentioned in the methodology sections of full-text PubMed Central articles. The edges of the database link network represent the link relationships between two databases. The weight of each edge is determined either by the link frequency of the two databases (i.e., in the link-weighted database link network) or the topic similarity between two databases (i.e., in the similarity-weighted database link network). With the database link network, we analyzed the topological structure and main paths of the database link network to trace the usage, connection, and evolution of databases. We also conducted content analysis by comparing content similarities among the papers citing databases.

UR - http://www.scopus.com/inward/record.url?scp=84912011894&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84912011894&partnerID=8YFLogxK

U2 - 10.1016/j.joi.2014.10.002

DO - 10.1016/j.joi.2014.10.002

M3 - Article

VL - 9

SP - 1

EP - 15

JO - Journal of Informetrics

JF - Journal of Informetrics

SN - 1751-1577

IS - 1

ER -