Genetic algorithm-based feature selection in high-resolution NMR spectra

Hyun Woo Cho, Seoung Bum Kim, Myong K. Jeong, Youngja Park, Thomas R. Ziegler, Dean P. Jones

Research output: Contribution to journalArticle

32 Citations (Scopus)

Abstract

High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, an orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-based feature selection combined with an orthogonal signal filter.

Original languageEnglish
Pages (from-to)967-975
Number of pages9
JournalExpert Systems with Applications
Volume35
Issue number3
DOIs
Publication statusPublished - 2008 Oct 1
Externally publishedYes

Fingerprint

Feature extraction
Genetic algorithms
Nuclear magnetic resonance
Plasma (human)
Discriminant analysis
Biological systems
Nutrition
Metabolites
Nuclear magnetic resonance spectroscopy
Pattern recognition

Keywords

  • Discrimination
  • Feature selection
  • Genetic algorithm (GA)
  • Metabolomics
  • Nuclear magnetic resonance (NMR)
  • Orthogonal signal correction filter

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications

Cite this

Genetic algorithm-based feature selection in high-resolution NMR spectra. / Cho, Hyun Woo; Kim, Seoung Bum; Jeong, Myong K.; Park, Youngja; Ziegler, Thomas R.; Jones, Dean P.

In: Expert Systems with Applications, Vol. 35, No. 3, 01.10.2008, p. 967-975.

Research output: Contribution to journalArticle

Cho, Hyun Woo ; Kim, Seoung Bum ; Jeong, Myong K. ; Park, Youngja ; Ziegler, Thomas R. ; Jones, Dean P. / Genetic algorithm-based feature selection in high-resolution NMR spectra. In: Expert Systems with Applications. 2008 ; Vol. 35, No. 3. pp. 967-975.
@article{02099a2db3c74e819a3b62b5c435f054,
title = "Genetic algorithm-based feature selection in high-resolution NMR spectra",
abstract = "High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, an orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-based feature selection combined with an orthogonal signal filter.",
keywords = "Discrimination, Feature selection, Genetic algorithm (GA), Metabolomics, Nuclear magnetic resonance (NMR), Orthogonal signal correction filter",
author = "Cho, {Hyun Woo} and Kim, {Seoung Bum} and Jeong, {Myong K.} and Youngja Park and Ziegler, {Thomas R.} and Jones, {Dean P.}",
year = "2008",
month = "10",
day = "1",
doi = "10.1016/j.eswa.2007.08.050",
language = "English",
volume = "35",
pages = "967--975",
journal = "Expert Systems with Applications",
issn = "0957-4174",
publisher = "Elsevier Limited",
number = "3",

}

TY - JOUR

T1 - Genetic algorithm-based feature selection in high-resolution NMR spectra

AU - Cho, Hyun Woo

AU - Kim, Seoung Bum

AU - Jeong, Myong K.

AU - Park, Youngja

AU - Ziegler, Thomas R.

AU - Jones, Dean P.

PY - 2008/10/1

Y1 - 2008/10/1

N2 - High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, an orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-based feature selection combined with an orthogonal signal filter.

AB - High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, an orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-based feature selection combined with an orthogonal signal filter.

KW - Discrimination

KW - Feature selection

KW - Genetic algorithm (GA)

KW - Metabolomics

KW - Nuclear magnetic resonance (NMR)

KW - Orthogonal signal correction filter

UR - http://www.scopus.com/inward/record.url?scp=44949219266&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=44949219266&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2007.08.050

DO - 10.1016/j.eswa.2007.08.050

M3 - Article

AN - SCOPUS:44949219266

VL - 35

SP - 967

EP - 975

JO - Expert Systems with Applications

JF - Expert Systems with Applications

SN - 0957-4174

IS - 3

ER -