Abstract
This study presents three feature selection methods for identifying the metabolite features in nuclear magnetic resonance spectra that contribute to the distinction of samples among varying nutritional conditions. Principal component analysis, Fisher discriminant analysis, and Partial Least Square Discriminant Analysis (PLS-DA) were used to calculate the importance of individual metabolite feature in spectra. Moreover, an Orthogonal Signal Correction (OSC) filter was used to eliminate unnecessary variations in spectra. We evaluated the presented methods by comparing the ability of classification based on the features selected by each method. The result showed that the best classification was achieved from an OSC-PLS-DA model.
Original language | English |
---|---|
Pages (from-to) | 176-192 |
Number of pages | 17 |
Journal | International Journal of Data Mining and Bioinformatics |
Volume | 2 |
Issue number | 2 |
DOIs | |
Publication status | Published - 2008 Jun |
Externally published | Yes |
Keywords
- Bioinformatics
- Data mining
- Feature selection
- Metabolomics
- Multivariate statistical analysis
- NMR
- Nuclear magnetic resonance
- OSC
- Orthogonal signal correction
ASJC Scopus subject areas
- Information Systems
- Biochemistry, Genetics and Molecular Biology(all)
- Library and Information Sciences