Skip to main content

Impatiens glandulifera SNP and SilicoDArT genotyping data

Cite this dataset

Korpelainen, Helena (2022). Impatiens glandulifera SNP and SilicoDArT genotyping data [Dataset]. Dryad.


We conducted genomic characterization based on SNP and SilicoDArT markers on the invasive Himalayan balsam (Impatiens glandulifera) plants originating from the native and non-native regions of their distribution. When genetic relationships were explored by PCoA based on SNP and SilicoDArT marker data, the first, second and third principal coordinates explained altogether 37.4% and 31.0% of the variability, respectively. Samples from the UK, Canada and Pakistan grouped together, while Indian plants were clearly distinct based on SNP markers but relatively close to the UK-Canada-Pakistan group based on SilicoDArT markers. Constructed trees differentiated the individuals into clusters resembling the patterns observed by PCoA. The Bayesian BAPS analysis revealed that the individuals were distributed in seven clusters, representing samples from each of the four Finnish populations, India, Pakistan and the combination of the UK and Canada. Similar clustering was visible in the constructed UPGMA tree. The Indian cluster did not display any ancestral gene flow with the others, while the Pakistani cluster showed ancestral gene flow only with the combined UK and Canada cluster. Furthermore, the latter cluster displayed ancestral gene flow with the Finnish populations varying from 0% to 3.1%. The AMOVA analysis showed that 45% and 26% of genetic variation was present among the I. glandulifera groups/populations and the rest within them based on SNP and SilicoDArT markers, respectively. Overall, the Bayesian BAPS analysis and the following gene flow network were the most informative tools for resolving relationships among native and introduced plants.