Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data

BACK TO CONTENTS | PDF | PREVIOUS | NEXT

Title	Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
Authors	Haja N Kadarmideen¹*§ & Nathan S Watson-haigh²§
Affiliation	¹Department of Veterinary Clinical and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, 1870 Frederiksberg C, Copenhagen, Denmark; ²The Australian Wine Research Institute, Waite Institute, P.O. Box 197, Glen Osmond, SA 5064, Australia.
Email	hajak@sund.ku.dk; *Corresponding author
Article Type	Hypothesis
Date	Received September 10, 2012; Accepted September 12, 2012; Published September 21, 2012
Abstract	Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended.
Citation	Kadarmideen & Watson-haigh, Bioinformation 8(18): 855-861 (2012)
Edited by	P Kangueane
ISSN	0973-2063
Publisher	Biomedical Informatics
License	This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License.