iRDA: a new filter towards predictive, stable, and enriched candidate genes

Article


Lai, H., Albrecht, A. and Steinhofel, K. 2015. iRDA: a new filter towards predictive, stable, and enriched candidate genes. BMC Genomics. 16. https://doi.org/10.1186/s12864-015-2129-5
TypeArticle
TitleiRDA: a new filter towards predictive, stable, and enriched candidate genes
AuthorsLai, H., Albrecht, A. and Steinhofel, K.
Abstract

Background: Gene expression profiling using high-throughput screening (HTS) technologies allows clinical researchers to find prognosis gene signatures that could better discriminate between different phenotypes and serve as potential biological markers in disease diagnoses. In recent years, many feature selection methods have been devised for finding such discriminative genes, and more recently information theoretic filters have also been introduced for capturing feature-to-class relevance and feature-to-feature correlations in microarray-based classification.
Methods: In this paper, we present and fully formulate a new multivariate filter, iRDA, for the discovery of HTS gene-expression candidate genes. The filter constitutes a four-step framework and includes feature relevance, feature redundancy, and feature interdependence in the context of feature-pairs. The method is based upon approximate Markov blankets, information theory, several heuristic search strategies with forward, backward and insertion phases, and the method is aiming at higher order gene interactions.
Results: To show the strengths of iRDA, three performance measures, two evaluation schemes, two stability index sets, and the gene set enrichment analysis (GSEA) are all employed in our experimental studies. Its effectiveness has been validated by using seven well-known cancer gene-expression benchmarks and four other disease experiments, including a comparison to three popular information theoretic filters. In terms of classification performance, candidate genes selected by iRDA perform better than the sets discovered by the other three filters. Two stability measures indicate that iRDA is the most robust with the least variance. GSEA shows that iRDA produces more statistically enriched gene sets on five out of the six benchmark datasets.
Conclusions: Through the classification performance, the stability performance, and the enrichment analysis, iRDA is a promising filter to find predictive, stable, and enriched gene-expression candidate genes.

KeywordsCancer phenotype prediction; Feature selection and classification; Microarray; Prognosis gene signature; Transcriptomic profiling
PublisherBioMed Central
JournalBMC Genomics
ISSN1471-2164
Publication dates
Print09 Dec 2015
Publication process dates
Deposited10 Dec 2015
Accepted22 Oct 2015
Output statusPublished
Additional information

Article number: 1041

Web address (URL)http://www.biomedcentral.com/1471-2164/16/1041
Digital Object Identifier (DOI)https://doi.org/10.1186/s12864-015-2129-5
LanguageEnglish
Permalink -

https://repository.mdx.ac.uk/item/86156

  • 11
    total views
  • 0
    total downloads
  • 1
    views this month
  • 0
    downloads this month

Export as