ARHMM: Difference between revisions

From Jstacs
Jump to navigationJump to search
mNo edit summary
mNo edit summary
Line 4: Line 4:
== Description ==
== Description ==
=== Motivation ===
=== Motivation ===
Gene expression profiles of tumors are locally biased by deletions and duplications of chromosomal regions. Such local chromosomal dependencies affect expression levels of genes in close chromosomal proximity and should be utilized to improve the analysis of individual tumor expression profiles.
Changes in gene expression programs play a central role in the development of cancer. Deletions and duplications of chromosomal regions directly influence the expression levels of affected genes. Such local chromosomal dependencies lead to strong positive correlations of gene expression levels of neighboring genes. These dependencies should be utilized to improve the modeling and the analysis of individual tumor expression profiles.


=== Results ===
=== Results ===
We develop autoregressive higher-order Hidden Markov Models (ARHMMs) that exploit data-dependent local chromosomal dependencies to improve the identification of differentially expressed genes in tumors. ARHMMs combine enhanced local chromosomal modeling capabilities of higher-order state-transitions with direct modeling of chromosomal dependencies between successive gene expression levels using autoregressive emissions. We train ARHMMs using a specifically designed Bayesian Baum-Welch algorithm enabling the integration of prior knowledge on the distribution of differentially expressed genes. We apply ARHMMs to the analysis of breast cancer and brain tumor gene expression data and find that these models substantially improve the identification of differentially expressed genes with underlying copy number changes in comparison to existing methods. This is further supported by the identification of well-known and previously unreported hotspots of differential expression in glioblastoma multiforme highlighting the efficacy of ARHMMs for the analysis of individual tumor expression profiles.
We develop autoregressive higher-order Hidden Markov Models (HMMs) that carefully exploit local data-dependent chromosomal dependencies to improve the identification of differentially expressed genes in individual tumors. Autoregressive higher-order HMMs overcome generally existing limitations of standard first-order HMMs in the modeling of dependencies between genes in close chromosomal proximity by the usage of higher-order state-transitions in combination with autoregressive emissions as novel model features. We train autoregressive higher-order HMMs by a specifically developed Bayesian Baum-Welch algorithm that enables to integrate prior knowledge on the measurement distribution of genes. We apply autoregressive higher-order HMMs to the analysis of breast cancer and glioblastoma gene expression data and perform in-depth model evaluation studies. We find that autoregressive higher-order HMMs clearly improve the identification of overexpressed genes with underlying gene copy number duplications in breast cancer in comparison to mixture models, standard first- and higher-order HMMs, and other existing related methods. This performance benefit is attributed to the simultaneous usage of higher-order state-transitions in combination with autoregressive emissions, and could not be reached by using each of these two features independently. We also find that autoregressive higher-order HMMs are better able to identify differentially expressed genes in tumor independent of the underlying gene copy number status in comparison to the majority of related methods. This is further supported by the identification of well-known and previously unreported hotspots of differential expression in glioblastomas demonstrating the efficacy of autoregressive higher-order HMMs for the analysis of individual tumor expression profiles.


== Paper ==
== Paper ==
The paper '''''Exploiting local chromosomal dependencies in the analysis of tumor expression profiles by autoregressive higher-order Hidden Markov Models''''' has been submitted to [http://bioinformatics.oxfordjournals.org/ Bioinformatics].
The paper '''''Autoregressive Higher-Order Hidden Markov Models: Exploiting Local Chromosomal Dependencies in the Analysis of Tumor Expression Profiles''''' has been submitted to [http://www.ploscompbiol.org/ PLoS Computational Biology].


== Download ==
== Download ==
* [http://imc.zih.tu-dresden.de/downloads/ARHMM/ARHMM_Trainer.zip ARHMM]: A ZIP file containing a JAR file for analyzing tumor gene expression data sets by ARHMMs. This file also contains the breast cancer and the glioma gene expression data sets considered in our study.
* [http://imc.zih.tu-dresden.de/downloads/ARHMM/ARHMM_Trainer.zip ARHMM]: A ZIP file containing a JAR file for analyzing tumor gene expression data sets by ARHMMs. This file also contains the considered breast cancer and the glioblastoma gene expression data sets.


== Related Projects ==
== Related Projects ==

Revision as of 11:42, 27 January 2014

by Michael Seifert, Khalil Abou-El-Aradat, Barbara Klink, and Andreas Deutsch

Description

Motivation

Changes in gene expression programs play a central role in the development of cancer. Deletions and duplications of chromosomal regions directly influence the expression levels of affected genes. Such local chromosomal dependencies lead to strong positive correlations of gene expression levels of neighboring genes. These dependencies should be utilized to improve the modeling and the analysis of individual tumor expression profiles.

Results

We develop autoregressive higher-order Hidden Markov Models (HMMs) that carefully exploit local data-dependent chromosomal dependencies to improve the identification of differentially expressed genes in individual tumors. Autoregressive higher-order HMMs overcome generally existing limitations of standard first-order HMMs in the modeling of dependencies between genes in close chromosomal proximity by the usage of higher-order state-transitions in combination with autoregressive emissions as novel model features. We train autoregressive higher-order HMMs by a specifically developed Bayesian Baum-Welch algorithm that enables to integrate prior knowledge on the measurement distribution of genes. We apply autoregressive higher-order HMMs to the analysis of breast cancer and glioblastoma gene expression data and perform in-depth model evaluation studies. We find that autoregressive higher-order HMMs clearly improve the identification of overexpressed genes with underlying gene copy number duplications in breast cancer in comparison to mixture models, standard first- and higher-order HMMs, and other existing related methods. This performance benefit is attributed to the simultaneous usage of higher-order state-transitions in combination with autoregressive emissions, and could not be reached by using each of these two features independently. We also find that autoregressive higher-order HMMs are better able to identify differentially expressed genes in tumor independent of the underlying gene copy number status in comparison to the majority of related methods. This is further supported by the identification of well-known and previously unreported hotspots of differential expression in glioblastomas demonstrating the efficacy of autoregressive higher-order HMMs for the analysis of individual tumor expression profiles.

Paper

The paper Autoregressive Higher-Order Hidden Markov Models: Exploiting Local Chromosomal Dependencies in the Analysis of Tumor Expression Profiles has been submitted to PLoS Computational Biology.

Download

  • ARHMM: A ZIP file containing a JAR file for analyzing tumor gene expression data sets by ARHMMs. This file also contains the considered breast cancer and the glioblastoma gene expression data sets.

Related Projects

  • DSHMM: exploiting prior knowledge and gene distances in the analysis of tumor expression profiles
  • SHMM: utilizing gene-pair orientations for improved analysis of ChIP-chip promoter array data
  • PHHMM: improved analysis of Array-CGH data
  • MeDIP-HMM: HMM-based analysis of DNA methylation profiles
  • HMM Book: Hidden Markov Models with Applications in Computational Biology

Follow Me