Toward a coronavirus knowledge graph

Peng Zhang, Yi Bu, Peng Jiang, Xiaowen Shi, Bing Lun, Chongyan Chen, Arida Ferti Syafiandini, Ying Ding, Min Song

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

This study builds a coronavirus knowledge graph (KG) by merging two information sources. The first source is Analytical Graph (AG), which integrates more than 20 different public datasets related to drug discovery. The second source is CORD-19, a collection of published scientific articles related to COVID-19. We combined both chemo genomic entities in AG with entities extracted from CORD-19 to expand knowledge in the COVID-19 domain. Before populating KG with those entities, we perform entity disambiguation on CORD-19 collections using Wikidata. Our newly built KG contains at least 21,700 genes, 2500 diseases, 94,000 phenotypes, and other biological entities (e.g., compound, species, and cell lines). We define 27 relationship types and use them to label each edge in our KG. This research presents two cases to evaluate the KG’s usability: analyzing a subgraph (ego-centered network) from the angiotensin-converting enzyme (ACE) and revealing paths between biological entities (hydroxychloroquine and IL-6 receptor; chloroquine and STAT1). The ego-centered network captured information related to COVID-19. We also found significant COVID-19-related information in top-ranked paths with a depth of three based on our path evaluation.

Original languageEnglish
Article number998
JournalGenes
Volume12
Issue number7
DOIs
Publication statusPublished - 2021 Jul

Bibliographical note

Funding Information:
Funding: This research was funded by the National Research Foundation of Korea (NRF) grant by the Korean government (MSIT) with grant number NRF-2019R1A2C2002577 and the National Science Foundation in the United States with grant number NSF Rapid 2028717.

Publisher Copyright:
© 2021 by the authors. Licensee MDPI, Basel, Switzerland.

All Science Journal Classification (ASJC) codes

  • Genetics
  • Genetics(clinical)

Fingerprint

Dive into the research topics of 'Toward a coronavirus knowledge graph'. Together they form a unique fingerprint.

Cite this