Assessing gene flow between Dicranum scoparium Hedw. and D. bonjeanii De Not. (Dicranaceae) using single nucleotide polymorphisms (SNPs)
Data files
Nov 21, 2024 version files 68.22 KB
-
README.md
3.01 KB
-
Sequencing_data_Dicranum_scoparium_and_Dicranum_bonjeanii.xlsx
65.21 KB
Abstract
While hybridization in vascular plants has received considerable attention, hybridization in bryophytes is still relatively understudied. Here we investigate hybridization between two species from the moss genus Dicranum. Hedw. Dicranum scoparium Hedw. and D. bonjeanii De Not. are two moss species of the D. scoparium species complex with partially overlapping morphology and habitat ranges. This study aimed to investigate a potential hybridization pathway between the two species by using single nucleotide polymorphisms (SNP), morphologically identifying the sex of the specimens and analyzing potential sex-specific SNP-markers, focusing on southern Sweden. The species differ by D. scoparium being polymorphic and D. bonjeanii monomorphic for the used SNP-markers. The SNP-markers genetically separate D. scoparium and D. bonjeanii specimens, though admixture between the species was observed on a limited scale. This admixture appears to originate from unidirectional fertilization of D. bonjeanii by D. scoparium (with a genome skewed towards D. scoparium as a result), possibly followed by back-crossing of first-generation hybrids with D. scoparium. Male expressing specimens were completely absent in the D. bonjeanii sample, making a fertilization of D. bonjeanii by males of D. scoparium more likely. No sex-specificity was observed in the used SNP-markers.
https://doi.org/10.5061/dryad.zpc866tg6
The dataset contains SNP data of Dicranum scoparium Hedw. and Dicranum bonjeanii De Not. The excel file consists of several tabs with information about the samples, used genetic markers and the sequencing results. The file contains the results of 188 samples. Three of these samples were not used in further genetic analyses. One of the samples failed to give results (3212, this one was not usable in further analyses), one turned out to be a duplicate of another sample in the dataset and one turned out to be a species other than one of the two species of interest. The results of these samples are included in the dataset, but are marked yellow in the “Data” tab.
Results were retreived by selecting SNP markers from Lang et al. (2021), specific for distinguishing between the two species of interest. Additional markers were added with a less extreme affinity for either one of the species.
Description of the data and file structure
The excel file contains 5 tabs, “Information”, “summary”, “Maf_confl_samples”, “Maf_freq_all” and “Data”. The tabs contain the following information:
- Information > This tab contains general information regarding the project, the sequencer, the sequencing institution, the used plates and the accuracy of the sequencing.
- Summary > This tab contains stats about the dataset and more importantly, the used genetic markers. The markers are indicated with SNP_# in which # is a unique identifier for that specific marker. For each marker, succes rates are given together with several other stats and comments and recommendations from the sequencing institution. Failed samples are written down underneath the table.
- Maf_confl_samples > This tab does not contain data, because there were no conflicts within the sequencing results.
- Maf_freq_all > This tab contains information about the alleles of each of the different genetic markers. The tab contains allele frequences for each of the different alleles, the total amount of alleles and the amount of homozogytes for each of the alleles. Nota bene! The tissue that was sequenced was haploid.
- Data > This tab contains the sequencing results of each of the sequenced samples. For each of the samples the result per genetic marker is given. NA’s are noted with 0 0. The results given in this tab were used for further cleaning and analyses. When a . was used in the original sample name, this was altered to _ . Spaces were also changed into_. The earlier mentioned sample that did not work, the duplicate sample, and the sample that belonged to another taxon, are marked yellow in this tab.
References
Lang AS, Gehrmann T, Cronberg N, Genetic Diversity and Population Structure in Bryophyte With Facultative Nannandry. Frontiers in Plant Science 2021; 12: 517547
Moss samples of the species Dicranum scoparium Hedw. and Dicranum bonjeanii De Not. were collected in the most southern province of Sweden, Scania. DNA was extracted by using the Qiagen DNeasy Plant mini kit (www.qiagen.com) and an adapted version of the manifacturer's protocol. SNPs were selected from markers retrieved by Lang, Gehrmann and Cronberg (2021).
The samples included in the sequencing were those collected in Sweden as well as older extracts from Lang & Stech (2014) that were stored at Naturalis Biodiversity Center (NL). Sequencing was executed at the Mutation Analysis Facility (Karolinska University Hospital), with an Agena Massarray Analyzer.
References:
- Lang AS, Stech M, What’s in a Name? Disentangling the Dicranum scoparium species complex (Dicranaceae, Bryophyta). Systematic Botany 2014; 39: 369-379
- Lang AS, Gehrmann T, Cronberg N, Genetic Diversity and Population Structure in Bryophyte With Facultative Nannandry. Frontiers in Plant Science 2021; 12: 517547