TY - JOUR
T1 - Multi-Hypergraph Learning for Incomplete Multimodality Data
AU - Liu, Mingxia
AU - Gao, Yue
AU - Yap, Pew Thian
AU - Shen, Dinggang
N1 - Funding Information:
Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. The investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this study. A complete listing of ADNI investigators can be found online https: //adni.Loni.usc.edu/wp-content/uploads/how-to-apply/ADNI-Acknowledgement-List.pdf. This work was supported by the National Institutes of Health under Grants EB006733, EB008374, EB009634, MH100217, AG041721, AG042599, AG010129, and AG030514.
Funding Information:
Manuscript received January 14, 2017; revised June 23, 2017; accepted July 24, 2017. Date of publication July 25, 2017; date of current version June 29, 2018. This work was supported by the National Institutes of Health under Grants EB006733, EB008374, EB009634, MH100217, AG041721, AG042599, AG010129, and AG030514. (Corresponding author: Dinggang Shen.) M. Liu, Y. Gao, and P.-T. Yap are with the Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599 USA (e-mail: mingxia_liu@med.unc.edu; kevin.gaoy@gmail. com; ptyap@med.unc.edu).
Publisher Copyright:
© 2017 IEEE.
PY - 2018/7
Y1 - 2018/7
N2 - Multi-modality data convey complementary information that can be used to improve the accuracy of prediction models in disease diagnosis. However, effectively integrating multi-modality data remains a challenging problem, especially when the data are incomplete. For instance, more than half of the subjects in the Alzheimer's disease neuroimaging initiative (ADNI) database have no fluorodeoxyglucose positron emission tomography and cerebrospinal fluid data. Currently, there are two commonly used strategies to handle the problem of incomplete data: 1) discard samples having missing features; and 2) impute those missing values via specific techniques. In the first case, a significant amount of useful information is lost and, in the second case, additional noise and artifacts might be introduced into the data. Also, previous studies generally focus on the pairwise relationships among subjects, without considering their underlying complex (e.g., high-order) relationships. To address these issues, in this paper, we propose a multi-hypergraph learning method for dealing with incomplete multimodality data. Specifically, we first construct multiple hypergraphs to represent the high-order relationships among subjects by dividing them into several groups according to the availability of their data modalities. A hypergraph regularized transductive learning method is then applied to these groups for automatic diagnosis of brain diseases. Extensive evaluation of the proposed method using all subjects in the baseline ADNI database indicates that our method achieves promising results in AD/MCI classification, compared with the state-of-the-art methods.
AB - Multi-modality data convey complementary information that can be used to improve the accuracy of prediction models in disease diagnosis. However, effectively integrating multi-modality data remains a challenging problem, especially when the data are incomplete. For instance, more than half of the subjects in the Alzheimer's disease neuroimaging initiative (ADNI) database have no fluorodeoxyglucose positron emission tomography and cerebrospinal fluid data. Currently, there are two commonly used strategies to handle the problem of incomplete data: 1) discard samples having missing features; and 2) impute those missing values via specific techniques. In the first case, a significant amount of useful information is lost and, in the second case, additional noise and artifacts might be introduced into the data. Also, previous studies generally focus on the pairwise relationships among subjects, without considering their underlying complex (e.g., high-order) relationships. To address these issues, in this paper, we propose a multi-hypergraph learning method for dealing with incomplete multimodality data. Specifically, we first construct multiple hypergraphs to represent the high-order relationships among subjects by dividing them into several groups according to the availability of their data modalities. A hypergraph regularized transductive learning method is then applied to these groups for automatic diagnosis of brain diseases. Extensive evaluation of the proposed method using all subjects in the baseline ADNI database indicates that our method achieves promising results in AD/MCI classification, compared with the state-of-the-art methods.
KW - Alzheimer's disease
KW - classification
KW - hypergraph
KW - incomplete data
KW - multi-modality
UR - http://www.scopus.com/inward/record.url?scp=85028942363&partnerID=8YFLogxK
U2 - 10.1109/JBHI.2017.2732287
DO - 10.1109/JBHI.2017.2732287
M3 - Article
C2 - 28749360
AN - SCOPUS:85028942363
SN - 2168-2194
VL - 22
SP - 1197
EP - 1208
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 4
ER -