Relational analysis of CpG islands methylation and gene expression in human lymphomas using possibilistic C-means clustering and modified cluster fuzzy density

IEEE/ACM Trans Comput Biol Bioinform. 2007 Apr-Jun;4(2):176-89. doi: 10.1109/TCBB.2007.070205.

Abstract

Heterogeneous genetic and epigenetic alterations are commonly found in human non-Hodgkin's lymphomas (NHL). One such epigenetic alteration is aberrant methylation of gene promoter-related CpG islands, where hypermethylation frequently results in transcriptional inactivation of target genes, while a decrease or loss of promoter methylation (hypomethylation) is frequently associated with transcriptional activation. Discovering genes with these relationships in NHL or other types of cancers could lead to a better understanding of the pathobiology of these diseases. The simultaneous analysis of promoter methylation using Differential Methylation Hybridization (DMH) and its associated gene expression using Expressed CpG Island Sequence Tag (ECIST) microarrays generates a large volume of methylation-expression relational data. To analyze this data, we propose a set of algorithms based on fuzzy sets theory, in particular Possibilistic c-Means (PCM) and cluster fuzzy density. For each gene, these algorithms calculate measures of confidence of various methylation-expression relationships in each NHL subclass. Thus, these tools can be used as a means of high volume data exploration to better guide biological confirmation using independent molecular biology methods.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Artificial Intelligence
  • Biomarkers, Tumor / genetics*
  • Cluster Analysis
  • Computer Simulation
  • CpG Islands / genetics*
  • DNA Methylation
  • Data Interpretation, Statistical
  • Fuzzy Logic
  • Gene Expression Profiling / methods*
  • Humans
  • Lymphoma, Non-Hodgkin / genetics*
  • Models, Genetic
  • Models, Statistical
  • Neoplasm Proteins / genetics*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Pattern Recognition, Automated / methods*
  • Statistics as Topic

Substances

  • Biomarkers, Tumor
  • Neoplasm Proteins