TY - JOUR
T1 - SignatureClust
T2 - A tool for landmark gene-guided clustering
AU - Chopra, Pankaj
AU - Shin, Hanjun
AU - Kang, Jaewoo
AU - Lee, Sunwon
PY - 2012/3
Y1 - 2012/3
N2 - Over the last several years, many clustering algorithms have been applied to gene expression data. However, most clustering algorithms force the user into having one set of clusters, resulting in a restrictive biological interpretation of gene function. It would be difficult to interpret the complex biological regulatory mechanisms and genetic interactions from this restrictive interpretation of microarray expression data. The software package SignatureClust allows users to select a group of functionally related genes (called 'Landmark Genes'), and to project the gene expression data onto these genes. Compared to existing algorithms and software in this domain, our software package offers two unique benefits. First, by selecting different sets of landmark genes, it enables the user to cluster the microarray data from multiple biological perspectives. This encourages data exploration and discovery of new gene associations. Second, most packages associated with clustering provide internal validation measures, whereas our package validates the biological significance of the new clusters by retrieving significant ontology and pathway terms associated with the new clusters. SignatureClust is a free software tool that enables biologists to get multiple views of the microarray data. It highlights new gene associations that were not found using a traditional clustering algorithm. The software package 'SignatureClust' and the user manual can be downloaded from http://infos.korea.ac.kr/sigclust.php.
AB - Over the last several years, many clustering algorithms have been applied to gene expression data. However, most clustering algorithms force the user into having one set of clusters, resulting in a restrictive biological interpretation of gene function. It would be difficult to interpret the complex biological regulatory mechanisms and genetic interactions from this restrictive interpretation of microarray expression data. The software package SignatureClust allows users to select a group of functionally related genes (called 'Landmark Genes'), and to project the gene expression data onto these genes. Compared to existing algorithms and software in this domain, our software package offers two unique benefits. First, by selecting different sets of landmark genes, it enables the user to cluster the microarray data from multiple biological perspectives. This encourages data exploration and discovery of new gene associations. Second, most packages associated with clustering provide internal validation measures, whereas our package validates the biological significance of the new clusters by retrieving significant ontology and pathway terms associated with the new clusters. SignatureClust is a free software tool that enables biologists to get multiple views of the microarray data. It highlights new gene associations that were not found using a traditional clustering algorithm. The software package 'SignatureClust' and the user manual can be downloaded from http://infos.korea.ac.kr/sigclust.php.
UR - http://www.scopus.com/inward/record.url?scp=84856959857&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84856959857&partnerID=8YFLogxK
U2 - 10.1007/s00500-011-0725-0
DO - 10.1007/s00500-011-0725-0
M3 - Article
AN - SCOPUS:84856959857
SN - 1432-7643
VL - 16
SP - 411
EP - 418
JO - Soft Computing
JF - Soft Computing
IS - 3
ER -