Classification of multiple cancer types by multicategory support vector machines using gene expression data

Yoonkyung Lee, Cheol-Koo Lee

Research output: Contribution to journalArticle

226 Citations (Scopus)

Abstract

Motivation: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. Results: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al., 2001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods.

Original languageEnglish
Pages (from-to)1132-1139
Number of pages8
JournalBioinformatics
Volume19
Issue number9
DOIs
Publication statusPublished - 2003 Jun 12
Externally publishedYes

Fingerprint

Gene Expression Data
Gene expression
Support vector machines
Support Vector Machine
Cancer
Gene Expression
Tumors
Tumor
Neoplasms
Cancer Classification
DNA Microarray
Gene Expression Profile
Leukemia
Multi-class
Microarrays
DNA
Genes
Flexibility
Binary
Gene

ASJC Scopus subject areas

  • Clinical Biochemistry
  • Computer Science Applications
  • Computational Theory and Mathematics

Cite this

Classification of multiple cancer types by multicategory support vector machines using gene expression data. / Lee, Yoonkyung; Lee, Cheol-Koo.

In: Bioinformatics, Vol. 19, No. 9, 12.06.2003, p. 1132-1139.

Research output: Contribution to journalArticle

@article{c792a47bef594261bbc75cb68f0fae1e,
title = "Classification of multiple cancer types by multicategory support vector machines using gene expression data",
abstract = "Motivation: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. Results: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al., 2001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods.",
author = "Yoonkyung Lee and Cheol-Koo Lee",
year = "2003",
month = "6",
day = "12",
doi = "10.1093/bioinformatics/btg102",
language = "English",
volume = "19",
pages = "1132--1139",
journal = "Bioinformatics",
issn = "1367-4803",
publisher = "Oxford University Press",
number = "9",

}

TY - JOUR

T1 - Classification of multiple cancer types by multicategory support vector machines using gene expression data

AU - Lee, Yoonkyung

AU - Lee, Cheol-Koo

PY - 2003/6/12

Y1 - 2003/6/12

N2 - Motivation: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. Results: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al., 2001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods.

AB - Motivation: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. Results: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al., 2001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods.

UR - http://www.scopus.com/inward/record.url?scp=0038729565&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0038729565&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btg102

DO - 10.1093/bioinformatics/btg102

M3 - Article

VL - 19

SP - 1132

EP - 1139

JO - Bioinformatics

JF - Bioinformatics

SN - 1367-4803

IS - 9

ER -