An efficient and effective ensemble of support vector machines for anti-diabetic drug failure prediction

Seokho Kang, Pilsung Kang, Taehoon Ko, Sungzoon Cho, Su Jin Rhee, Kyung Sang Yu

Research output: Contribution to journalArticle

29 Citations (Scopus)


The treatment of patients with type 2 diabetes is mostly based on drug therapies, aiming at managing glucose levels appropriately. As the number of patients with type 2 diabetes continually increases worldwide, predicting drug treatment failure becomes an important issue. Support vector machine (SVM) can be a good method for the anti-diabetic drug failure prediction problem; however, it is difficult to train SVM on large-scale medical datasets directly because of its high training time complexity O(N3). To address the limitation, we propose an efficient and effective ensemble of SVMs, called E3-SVM. The proposed method excludes superfluous data points when constructing an SVM ensemble, thereby yielding a better classification performance. The proposed method consists of two phases. The first phase is to select the data points that are likely to be the support vectors by applying data selection methods. The second phase is to construct an SVM ensemble using the selected data points. We demonstrated the efficiency and effectiveness of the proposed method using the real-world dataset of the anti-diabetic drug failure prediction problem for type 2 diabetes. Experimental results show that the proposed method requires less training time to achieve comparable success, compared to the conventional SVM ensembles. Moreover, the proposed method obtains more reliable prediction results for each independent run of constructing an ensemble. In conclusion, firstly, the proposed method provides an efficient and effective way to use SVM for large-scale datasets. Secondly, we confirmed the suitability of SVM for the anti-diabetic drug failure prediction problem with an accuracy of about 80%.

Original languageEnglish
Pages (from-to)4265-4273
Number of pages9
JournalExpert Systems with Applications
Issue number9
Publication statusPublished - 2015 Jun 1



  • Data selection
  • Drug failure prediction
  • Ensemble
  • Support vector machines
  • Type 2 diabetes

ASJC Scopus subject areas

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Cite this