Use of keyword hierarchies to interpret gene expression patterns

D R Masys; J B Welsh; J Lynn Fink; M Gribskov; I Klacansky; J Corbeil

doi:10.1093/bioinformatics/17.4.319

Use of keyword hierarchies to interpret gene expression patterns

Bioinformatics. 2001 Apr;17(4):319-26. doi: 10.1093/bioinformatics/17.4.319.

Authors

D R Masys¹, J B Welsh, J Lynn Fink, M Gribskov, I Klacansky, J Corbeil

Affiliation

¹ Department of Medicine, UCSD Cancer Center, University of California, San Diego, San Diego, CA 92093, USA.

PMID: 11301300
DOI: 10.1093/bioinformatics/17.4.319

Abstract

Motivation: High-density microarray technology permits the quantitative and simultaneous monitoring of thousands of genes. The interpretation challenge is to extract relevant information from this large amount of data. A growing variety of statistical analysis approaches are available to identify clusters of genes that share common expression characteristics, but provide no information regarding the biological similarities of genes within clusters. The published literature provides a potential source of information to assist in interpretation of clustering results.

Results: We describe a data mining method that uses indexing terms ('keywords') from the published literature linked to specific genes to present a view of the conceptual similarity of genes within a cluster or group of interest. The method takes advantage of the hierarchical nature of Medical Subject Headings used to index citations in the MEDLINE database, and the registry numbers applied to enzymes.

Publication types

Research Support, U.S. Gov't, P.H.S.

MeSH terms

Abstracting and Indexing
Databases, Factual*
Gene Expression Profiling*
Information Storage and Retrieval
MEDLINE
Oligonucleotide Array Sequence Analysis

Abstract

Publication types

MeSH terms

Grants and funding