TY - JOUR
T1 - A Comparison of Two-Stage Approaches Based on Penalized Regression for Estimating Gene Networks
AU - Lee, Minhyeok
AU - Seok, Junhee
AU - Tae, Donghyun
AU - Zhong, Hua
AU - Han, Sung Won
N1 - Funding Information:
This research was supported by grants from National Research Foundation of Korea (NRF-2014R1A1A2A16050527, NRF-2016R1D1A1B03931077) as well as a grant from Korea Evaluation Institute of Industrial Technology (10073166) and by a Korea University Grant (K1607901).
Publisher Copyright:
© Mary Ann Liebert, Inc. 2017.
PY - 2017/7/1
Y1 - 2017/7/1
N2 - Graphical models are commonly used for illustrating gene networks. However, estimating directed networks are generally challenging because of the limited sample size compared with the dimensionality of an experiment. Many previous studies have provided insight into the problem, and recently, two-stage approaches have shown significant improvements for estimating directed acyclic graphs. These two-stage approaches find neighborhoods in the first stage and determine the directions of the edges in the second stage. However, although numerous methods to find neighborhoods and determine directions exist, the most appropriate method to use with two-stage approaches has not been evaluated. Therefore, we compared such methods through extensive simulations to select effective methods for the first and second stages. Results show that adaptive lasso is the most effective for both stages in most cases. In addition, we compared methods to handle asymmetric entries to estimate an undirected network. Some previous studies indicate that the method used to handle asymmetric entries does not affect performance significantly; however, we found that the selection of the handling method for such edges is a significant factor for finding neighborhoods when using adaptive lasso.
AB - Graphical models are commonly used for illustrating gene networks. However, estimating directed networks are generally challenging because of the limited sample size compared with the dimensionality of an experiment. Many previous studies have provided insight into the problem, and recently, two-stage approaches have shown significant improvements for estimating directed acyclic graphs. These two-stage approaches find neighborhoods in the first stage and determine the directions of the edges in the second stage. However, although numerous methods to find neighborhoods and determine directions exist, the most appropriate method to use with two-stage approaches has not been evaluated. Therefore, we compared such methods through extensive simulations to select effective methods for the first and second stages. Results show that adaptive lasso is the most effective for both stages in most cases. In addition, we compared methods to handle asymmetric entries to estimate an undirected network. Some previous studies indicate that the method used to handle asymmetric entries does not affect performance significantly; however, we found that the selection of the handling method for such edges is a significant factor for finding neighborhoods when using adaptive lasso.
KW - gene network
KW - graphical model
KW - penalized regression
UR - http://www.scopus.com/inward/record.url?scp=85021745755&partnerID=8YFLogxK
U2 - 10.1089/cmb.2017.0052
DO - 10.1089/cmb.2017.0052
M3 - Article
C2 - 28541712
AN - SCOPUS:85021745755
VL - 24
SP - 709
EP - 720
JO - Journal of Computational Biology
JF - Journal of Computational Biology
SN - 1066-5277
IS - 7
ER -