Data for genetic characterization and curation of diploid a-genome wheat species
Data files
Dec 06, 2021 version files 139.78 MB
-
A_genome_core_total_Segregating.SNPs.txt
3.42 MB
-
A-genome_and_wheat_Filtered.SNPs.data.for.clustring.hmp.zip
4.65 MB
-
Agenome_population.structure.input.file.for.FastStructure.str.f.str.zip
9.33 MB
-
Agenome.sample_info.txt
8.83 KB
-
All_Agenome_Fisher.Passed.hmp.txt.zip
11.08 MB
-
All_Agnome_curated.sample.SNPmatrix.for.phylogenetic.cluster_PCA.txt.zip
3.22 MB
-
All_sample_SNP.matrix.for.genetically.identical.accession.hmp.txt
28.84 MB
-
All_sample_SNPs.diversity.computation.hmp.txt
27.06 MB
-
Key_file_WGRC_Diploid_A-Genome_Wheat_Species_Accessions.xlsx
76.79 KB
-
Readme.pdf
154.26 KB
-
SRA_Fastq_Files_WGRC_Diploid_A-Genome_Wheat_Species_Accessions.xlsx
9.56 KB
-
sub.aegilopoides_total_Segregating.SNPs.txt
10.59 MB
-
sub.aeiglopoides.input.Genocore.het.masked.SNPs.file.csv
19.13 MB
-
sub.aeiglopoides.input.Genocore.het.masked.SNPs.matrix.csv.zip
860.99 KB
-
sub.mono_total_Segregating.SNPs.txt
1.73 MB
-
sub.monococcum.input.Genocore.het.masked.SNPs.matrix.csv
1.49 MB
-
T.monococcum_total_Segregating.SNPs.txt
13.77 MB
-
T.monococcum.imputed.vcf.input.to.compute.FST.for.selection.signature.hmp.txt.zip
748.78 KB
-
T.urartu_total_Segregating.SNPs.txt
1.90 MB
-
T.urartu.input.Genocore.het.masked.SNPs.matrix.csv
1.70 MB
Abstract
Diploid A-genome relatives of wheat comprises T. urartu, T. monococcum subsp. monococcum (domesticated einkorn) and T. monococcum subsp. aegilopoides (wild einkorn). About 930 accessions of A-genome diploid wheat species preserved in the gene bank of the Wheat Genetics Resource Center (WGRC) at Kansas State University were genotyped using genotyping-by-sequencing (GBS). We constructed four pooled GBS libraries (384- and 288-plex) using restriction enzymes (Pst1-Msp1) combinations and the libraries were sequenced on the Illumina platform. The sequence data was processed using Tassel 5 GBS v2 pipeline and identified thousands of single nucleotide polymorphisms (SNPs) for downstream genetic and genomic dissections of the tested population. We have used T. urartu pseudomolecule (Tu2.0) as a reference genome to detect the genetic markers. Four fastq files with raw sequence reads information can be obtained at the National Center for Biotechnology Information (NCBI) SRA database with the BioProject accession PRJNA744683 (https://www.ncbi.nlm.nih.gov/sra/PRJNA744683). Provided key file has information for demultiplexing including flowcell, lane number, barcodes and sample names to replicate the analysis. This experiment led us to curate the gene bank through identification of genetically duplicated accessions, miss-classified accessions and miss-classified non-diploid accessions. We were able to observe the unique genetic and evolutionary relationships among the diploid A-genome wheat species along with the unique population structures per species and sub-species.
Methods
The genotyping by sequencing (GBS) was performed inorder to genotype wild wheat accessions. We obtained thousands of SNP markers from the sequencing work which was used to characterized the accessions.