GAN-WGCNA: Calculating gene modules to identify key intermediate regulators in cocaine addiction
by Taehyeong Kim, Kyoungmin Lee, Mookyung Cheon, Wookyung Yu
Understanding time-series interplay of genes is essential for diagnosis and treatment of disease. Spatio-temporally enriched NGS data contain important underlying regulatory mechanisms of biological processes. Generative adversarial networks (GANs) have been used to augment biological data to describe hidden intermediate time-series gene expression profiles during specific biological processes. Developing a pipeline that uses augmented time-series gene expression profiles is needed to provide an unbiased systemic-level map of biological processes and test for the statistical significance of the generated dataset, leading to the discovery of hidden intermediate regulators. Two analytical methods, GAN-WGCNA (weighted gene co-expression network analysis) and rDEG (rescued differentially expressed gene), interpreted spatiotemporal information and screened intermediate genes during cocaine addiction. GAN-WGCNA enables correlation calculations between phenotype and gene expression profiles and visualizes time-series gene module interplay. We analyzed a transcriptome dataset of two weeks of cocaine self-administration in C57BL/6J mice. Utilizing GAN-WGCNA, two genes (Alcam and Celf4) were selected as missed intermediate significant genes that showed high correlation with addiction behavior. Their correlation with addictive behavior was observed to be notably significant in aspect of statistics, and their expression and co-regulation were comprehensively mapped in terms of time, brain region, and biological process.