Review

Reversed low-rank ANOVA model for transforming high dimensional genetic data into low dimension

Yoonsuh Jung, Jianhua Hu

Research output: Contribution to journalReview article

Abstract

A general modeling procedure for analyzing genetic data is reviewed. We review ANOVA type model that can handle both the continuous and discrete genetic variables in one modeling framework. Unlike the regression type models which typically set the phenotype variable as a response, this ANOVA model treats the phenotype variable as an explanatory variable. By reversely treating the phenotype variable, usual high dimensional problem is turned into low dimension. Instead, the ANOVA model always includes interaction term between the genetic locations and phenotype variable to find potential association between them. The interaction term is designed to be low rank with the multiplication of bilinear terms so that the required number of parameters is kept in a manageable degree. We compare the performance of the reviewed ANOVA model to the other popular methods via microarray and SNP data sets.

Original languageEnglish
Pages (from-to)169-178
Number of pages10
JournalJournal of the Korean Statistical Society
Volume48
Issue number2
DOIs
Publication statusPublished - 2019 Jun 1

Fingerprint

High-dimensional
Phenotype
Term
Model
Interaction
Modeling
Microarray
Review
Multiplication
Regression

Keywords

  • ANOVA
  • BIC
  • High dimension
  • Variable selection

ASJC Scopus subject areas

  • Statistics and Probability

Cite this

Review : Reversed low-rank ANOVA model for transforming high dimensional genetic data into low dimension. / Jung, Yoonsuh; Hu, Jianhua.

In: Journal of the Korean Statistical Society, Vol. 48, No. 2, 01.06.2019, p. 169-178.

Research output: Contribution to journalReview article

@article{7c3ec3ce0993428bb79c0641894ce5a0,
title = "Review: Reversed low-rank ANOVA model for transforming high dimensional genetic data into low dimension",
abstract = "A general modeling procedure for analyzing genetic data is reviewed. We review ANOVA type model that can handle both the continuous and discrete genetic variables in one modeling framework. Unlike the regression type models which typically set the phenotype variable as a response, this ANOVA model treats the phenotype variable as an explanatory variable. By reversely treating the phenotype variable, usual high dimensional problem is turned into low dimension. Instead, the ANOVA model always includes interaction term between the genetic locations and phenotype variable to find potential association between them. The interaction term is designed to be low rank with the multiplication of bilinear terms so that the required number of parameters is kept in a manageable degree. We compare the performance of the reviewed ANOVA model to the other popular methods via microarray and SNP data sets.",
keywords = "ANOVA, BIC, High dimension, Variable selection",
author = "Yoonsuh Jung and Jianhua Hu",
year = "2019",
month = "6",
day = "1",
doi = "10.1016/j.jkss.2018.10.002",
language = "English",
volume = "48",
pages = "169--178",
journal = "Journal of the Korean Statistical Society",
issn = "1226-3192",
publisher = "Korean Statistical Society",
number = "2",

}

TY - JOUR

T1 - Review

T2 - Reversed low-rank ANOVA model for transforming high dimensional genetic data into low dimension

AU - Jung, Yoonsuh

AU - Hu, Jianhua

PY - 2019/6/1

Y1 - 2019/6/1

N2 - A general modeling procedure for analyzing genetic data is reviewed. We review ANOVA type model that can handle both the continuous and discrete genetic variables in one modeling framework. Unlike the regression type models which typically set the phenotype variable as a response, this ANOVA model treats the phenotype variable as an explanatory variable. By reversely treating the phenotype variable, usual high dimensional problem is turned into low dimension. Instead, the ANOVA model always includes interaction term between the genetic locations and phenotype variable to find potential association between them. The interaction term is designed to be low rank with the multiplication of bilinear terms so that the required number of parameters is kept in a manageable degree. We compare the performance of the reviewed ANOVA model to the other popular methods via microarray and SNP data sets.

AB - A general modeling procedure for analyzing genetic data is reviewed. We review ANOVA type model that can handle both the continuous and discrete genetic variables in one modeling framework. Unlike the regression type models which typically set the phenotype variable as a response, this ANOVA model treats the phenotype variable as an explanatory variable. By reversely treating the phenotype variable, usual high dimensional problem is turned into low dimension. Instead, the ANOVA model always includes interaction term between the genetic locations and phenotype variable to find potential association between them. The interaction term is designed to be low rank with the multiplication of bilinear terms so that the required number of parameters is kept in a manageable degree. We compare the performance of the reviewed ANOVA model to the other popular methods via microarray and SNP data sets.

KW - ANOVA

KW - BIC

KW - High dimension

KW - Variable selection

UR - http://www.scopus.com/inward/record.url?scp=85056313113&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85056313113&partnerID=8YFLogxK

U2 - 10.1016/j.jkss.2018.10.002

DO - 10.1016/j.jkss.2018.10.002

M3 - Review article

VL - 48

SP - 169

EP - 178

JO - Journal of the Korean Statistical Society

JF - Journal of the Korean Statistical Society

SN - 1226-3192

IS - 2

ER -