Genomic relationships of Glycine remota, a recently discovered perennial relative of soybean, within the legume genus Glycine
Data files
Mar 21, 2023 version files 16.02 GB
-
ABBA_BABA.zip
11.24 MB
-
Figure_and_table_legends.docx
12.90 KB
-
full_ribosome_RAxML.zip
11.14 KB
-
Glycine_remota_raw_assemblies.zip
49.16 KB
-
heterozygosity_angsd.zip
29.18 KB
-
HistoneH3D_full_set_RAxML.zip
5.18 KB
-
HistoneH3D_pruned_RAxML.zip
4.95 KB
-
ITS_only_RAxML.zip
4.69 KB
-
plastome_RAxML.zip
396.73 KB
-
plastomes_assemblies_pruned_and_annotations.zip
867.26 KB
-
README.md
10.60 KB
-
reference_genome.zip
305.85 MB
-
SNP_filtering.zip
7.30 GB
-
Sorted_BAM_files_for_heterozygosity_and_ploidy.zip
8.40 GB
-
Supplemental_Figure_S1.pdf
81.68 KB
-
Supplemental_Figure_S2.pdf
72.10 KB
-
Supplemental_Figure_S3.pdf
150.68 KB
-
Supplemental_Figure_S4.pdf
143.98 KB
-
Supplemental_Table_1.xlsx
11.86 KB
-
Supplemental_Table_2.xlsx
12.87 KB
-
Supplemental_Table_3.xlsx
271.63 KB
-
SVDquartets.zip
579.68 KB
Abstract
The legume genus, Glycine, which includes the Asian annual cultivated soybean, also includes a group of Australian perennial species comprising the subgenus Glycine. Because the subgenus Glycine represents the tertiary gene pool for one of the world’s most important crops, the group has been the target of collection and study for decades, resulting in a steady growth in the number of formally recognized species, from six in the 1970s to over 20 at present, as well as a number of additional informal taxa. These studies have also produced a system of nuclear diploid “genome groups” corresponding to clades in molecular phylogenies. The aptly named G. remota is known only from a single isolated population in the Kimberley region of northwestern Australia and was named only in 2015. The species is unique within Glycine in having unifoliolate leaves; its discoverers hypothesized that G. remota, if diploid, is related to species of the I-genome that are also native to the Kimberley region. We produced low-coverage short-read genome sequencing data from an herbarium specimen of G. remota. Genome size estimates from the sequencing data suggest that G. remota is a diploid, while ploidy estimation is inconclusive likely due to the history of whole genome duplication in Glycine. Phylogenomic analyses of genome-wide SNPs, as well as phylogenetic analyses of the low copy nuclear gene (histone H3D), the entire ribosomal RNA cistron, and the internal transcribed spacer all placed the species unequivocally in the diploid I-genome clade. A complete plastome sequence was also generated and its placement with a plastome phylogeny is also consistent with membership in the I-genome.
Methods
GBS-SNPs called wtih GATK and filtered with VCFtools
Fasta files aligned with MAFFT
Maximum likelihood phylogeny inference with RAxML
SVDquartets in PAUP*
ABBA-BABA in D-suite