Co-expression networks in Chlamydomonas reveal significant rhythmicity in batch cultures and empower gene function discovery
Data files
Jan 28, 2021 version files 298.47 MB
-
Protocols.zip
939.20 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_1.xlsx
13.53 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_10.xlsx
78.14 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_11.xlsx
416.53 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_12.xlsx
111.11 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_13.xlsx
619.58 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_2.xlsx
10.70 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_3.xlsx
11.80 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_4.xlsx
11.33 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_5.xlsx
38.55 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_6.xlsx
20.62 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_7.xlsx
12.66 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_8.xlsx
18.70 KB
-
TPC2020-RA-00822R1_Supplemental_Data_Set_9.xlsx
94.24 KB
-
TPC2020-RA-00822R1_Supplemental_File_1.txt
59.46 MB
-
TPC2020-RA-00822R1_Supplemental_File_2.txt
10.66 MB
-
TPC2020-RA-00822R1_Supplemental_File_3.txt
22.60 MB
-
TPC2020-RA-00822R1_Supplemental_File_4.txt
60.84 MB
-
TPC2020-RA-00822R1_Supplemental_File_5.txt
8.96 MB
-
TPC2020-RA-00822R1_Supplemental_File_6.txt
19.85 MB
-
TPC2020-RA-00822R1_Supplemental_File_7.txt
55.94 MB
-
TPC2020-RA-00822R1_Supplemental_File_8.txt
57.61 MB
-
TPC2020-RA-00822R1_Supplemental_File_9.txt
141.56 KB
Abstract
The unicellular green alga Chlamydomonas reinhardtii is a choice reference system for the study of photosynthesis and chloroplast metabolism, cilium assembly and function, lipid and starch metabolism, and metal homeostasis. Despite decades of research, the functions of thousands of genes remain largely unknown, and new approaches are needed to categorically assign genes to cellular pathways. Growing collections of transcriptome and proteome data now allow a systematic approach based on integrative co-expression analysis. We used a dataset comprising 518 deep transcriptome samples derived from 58 independent experiments to identify potential co-expression relationships between genes. We visualized co-expression potential with the R package corrplot, to easily assess co-expression and anti-correlation between genes. We extracted several hundred high-confidence genes at the intersection of multiple curated lists involved in cilia, cell division, and photosynthesis, illustrating the power of our method. Surprisingly, Chlamydomonas experiments retained a significant rhythmic component across the transcriptome, suggesting an underappreciated variable during sample collection, even in samples collected in constant light. Our results therefore document substantial residual synchronization in batch cultures, contrary to assumptions of asynchrony. We provide step-by-step protocols for the analysis of co-expression across transcriptome data sets from Chlamydomonas and other species to help foster gene function discovery
For Chlamydomonas, the data set consists of 518 RNA-seq samples derived from 58 independent experiments, most published.
For Arabidospis, the data sets were downloaded from the AtGenExpress project site (http://jsp.weigelworld.org/AtGenExpress/resources/), and collated into a single file that consisted of 34 Arabidopsis accessions, 16 sets of etiolated seedlings exposed to various light treatments, 36 sets of seedlings exposed to pathogens, 13 cell culture samples, 68 sets each for shoots and roots exposed to various abiotic stresses, 79 developmental samples (72 from shoots or leaves, 7 from roots), and 18 sets each for leaves and roots subjected to iron deficiency, with controls included.
For Chlamydomonas, all genes were retained, and the data normalized by calculating the log2(FPKM+1).
For Arabidopsis, control probes were removed manually prior to normalization.