Systematic prediction of EMS-induced mutations in a sorghum mutant population
Data files
May 22, 2022 version files 913.80 MB
-
README.txt
1.36 KB
-
Supplemental_File_S1.zip
11.97 MB
-
Supplemental_File_S10.zip
13.81 MB
-
Supplemental_File_S2.zip
552.50 MB
-
Supplemental_File_S3.zip
6.23 MB
-
Supplemental_File_S4.zip
77.93 MB
-
Supplemental_File_S5.zip
7.23 MB
-
Supplemental_File_S6.zip
10.87 KB
-
Supplemental_File_S7.zip
216.02 MB
-
Supplemental_File_S8.zip
2.62 MB
-
Supplemental_File_S9.zip
25.48 MB
Abstract
Sorghum is a next-generation crop species with tremendous potential for discovering highly desirable agronomical traits. We described an improved method for the systematic detection of EMS-induced mutations in the previous sequencing of the M3 generation of 600 sorghum BTx623 mutants. We used both SAMtools and GATK-based variant-calling algorithms to demonstrate the general utility of the method. The approach also includes a clustering algorithm for detecting likely false-negative EMS-induced mutations. We detected 3,497,654 EMS-induced single nucleotide polymorphisms (SNPs) in 30,285 distinct sorghum genes, and cataloged 10,263 high impact and 136,639 moderate impact SNPs. We also implemented a light-weight web portal for searching the mutation database for the 600 sorghum mutants.
NGS Sequencing data:
Illumina Sequencing of mutant individuals at 6X coverage. Sequencing data is available at the NCB SRA (SRA Accession Number SRP065118).
SNP Calling: SAMtools and GATK
SNP Annotations: snpEff and SIFT4G
Gene Annotation: Phytozome gene annotation for sorghum BTx623