CGM: A biomedical text categorization approach using concept graph mining

Said Bleik, Min Song, Aaron Smalter, Jun Huan, Gerald Lushington

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Text Categorization is used to organize and manage biomedical text databases that are growing at an exponential rate. Feature representations for documents are a crucial factor for the performance of text categorization. Most of the successful existing techniques use a vector representation based on key entities extracted from the text. In this paper we investigate a new direction where we represent a document as a graph. In this representation we identify high level concepts and build a rich graph structure that contains additional concepts and relationships. We then use graph kernel techniques to perform text categorization. The results show a significant improvement in accuracy when compared to categorization based on only the extracted concepts.

Original languageEnglish
Title of host publicationProceedings - 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009
Pages38-43
Number of pages6
DOIs
Publication statusPublished - 2009
Event2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009 - Washington, DC, United States
Duration: 2009 Nov 12009 Nov 4

Publication series

NameProceedings - 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009

Other

Other2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009
CountryUnited States
CityWashington, DC
Period09/11/109/11/4

All Science Journal Classification (ASJC) codes

  • Biomedical Engineering
  • Health Informatics
  • Health Information Management

Fingerprint Dive into the research topics of 'CGM: A biomedical text categorization approach using concept graph mining'. Together they form a unique fingerprint.

  • Cite this

    Bleik, S., Song, M., Smalter, A., Huan, J., & Lushington, G. (2009). CGM: A biomedical text categorization approach using concept graph mining. In Proceedings - 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009 (pp. 38-43). [5332134] (Proceedings - 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009). https://doi.org/10.1109/BIBMW.2009.5332134