Deducing corticotropin-releasing hormone receptor type 1 signaling networks from gene expression data by usage of genetic algorithms and graphical Gaussian models

Dietrich Trümbach; Cornelia Graf; Benno Pütz; Claudia Kühne; Marcus Panhuysen; Peter Weber; Florian Holsboer; Wolfgang Wurst; Gerhard Welzl; Jan M Deussing

doi:10.1186/1752-0509-4-159

Deducing corticotropin-releasing hormone receptor type 1 signaling networks from gene expression data by usage of genetic algorithms and graphical Gaussian models

BMC Syst Biol. 2010 Nov 19:4:159. doi: 10.1186/1752-0509-4-159.

Authors

Dietrich Trümbach¹, Cornelia Graf, Benno Pütz, Claudia Kühne, Marcus Panhuysen, Peter Weber, Florian Holsboer, Wolfgang Wurst, Gerhard Welzl, Jan M Deussing

Affiliation

¹ Max Planck Institute of Psychiatry, Kraepelinstr, 2-10, 80804 Munich, Germany.

Abstract

Background: Dysregulation of the hypothalamic-pituitary-adrenal (HPA) axis is a hallmark of complex and multifactorial psychiatric diseases such as anxiety and mood disorders. About 50-60% of patients with major depression show HPA axis dysfunction, i.e. hyperactivity and impaired negative feedback regulation. The neuropeptide corticotropin-releasing hormone (CRH) and its receptor type 1 (CRHR1) are key regulators of this neuroendocrine stress axis. Therefore, we analyzed CRH/CRHR1-dependent gene expression data obtained from the pituitary corticotrope cell line AtT-20, a well-established in vitro model for CRHR1-mediated signal transduction. To extract significantly regulated genes from a genome-wide microarray data set and to deduce underlying CRHR1-dependent signaling networks, we combined supervised and unsupervised algorithms.

Results: We present an efficient variable selection strategy by consecutively applying univariate as well as multivariate methods followed by graphical models. First, feature preselection was used to exclude genes not differentially regulated over time from the dataset. For multivariate variable selection a maximum likelihood (MLHD) discriminant function within GALGO, an R package based on a genetic algorithm (GA), was chosen. The topmost genes representing major nodes in the expression network were ranked to find highly separating candidate genes. By using groups of five genes (chromosome size) in the discriminant function and repeating the genetic algorithm separately four times we found eleven genes occurring at least in three of the top ranked result lists of the four repetitions. In addition, we compared the results of GA/MLHD with the alternative optimization algorithms greedy selection and simulated annealing as well as with the state-of-the-art method random forest. In every case we obtained a clear overlap of the selected genes independently confirming the results of MLHD in combination with a genetic algorithm. With two unsupervised algorithms, principal component analysis and graphical Gaussian models, putative interactions of the candidate genes were determined and reconstructed by literature mining. Differential regulation of six candidate genes was validated by qRT-PCR.

Conclusions: The combination of supervised and unsupervised algorithms in this study allowed extracting a small subset of meaningful candidate genes from the genome-wide expression data set. Thereby, variable selection using different optimization algorithms based on linear classifiers as well as the nonlinear random forest method resulted in congruent candidate genes. The calculated interacting network connecting these new target genes was bioinformatically mapped to known CRHR1-dependent signaling pathways. Additionally, the differential expression of the identified target genes was confirmed experimentally.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Analysis of Variance
Animals
Cell Line
Computational Biology / methods*
Data Mining
Gene Expression Profiling*
Gene Regulatory Networks
Humans
Likelihood Functions
Linear Models
Models, Biological*
Normal Distribution
Principal Component Analysis
Rats
Receptors, Corticotropin-Releasing Hormone / metabolism*
Reproducibility of Results
Signal Transduction*

Substances

Receptors, Corticotropin-Releasing Hormone
CRF receptor type 1