This data set consists of two parts. Genotype of nSSR is coded with fragment size. Haplotype of cpSSR is coded with capital letter. In all cases, missing data is coded with “NA”. ############################################################# # 1) original_data ############################################################# This directory contains raw data sets of adults and seeds. a) adult27loci170113.txt 27 nSSR genotype and cpSSR haplotype data for 307 adults ID: ID number from 1 to 307 SampleName: individual name Haplotype: cpSSR haplotype stm0002a: first allele of locus stm0002 (a) stm0002b: second allele of locus stm0002 (b) And so on b) genotype_19loci_and_ecological_data170113.csv 19 nSSR genotype, cpSSR haplotype data, results of Newhybrids, assigned genealogical class and other ecological data for 307 adults ID: ID number from 1 to 307 SampleName: individual name Haplotype: cpSSR haplotype stm0002a: first allele of locus stm0002 (a) stm0002b: second allele of locus stm0002 (b) And so on MST: posterior probability assigned to M. stellata estimated by NewHybrids MSA: posterior probability assigned to M. salicifolia estimated by NewHybrids F1: posterior probability assigned to F1 estimated by NewHybrids F2: posterior probability assigned to F2 estimated by NewHybrids BxMST: posterior probability assigned to backcross with M. stellata estimated by NewHybrids BxMSA: posterior probability assigned to backcross with M. salicifolia estimated by NewHybrids MaxP: maximum posterior probability HC19: genealogical class determined using 19 loci information Mother: whether the individual was maternal tree or not (TRUE or FALSE) Flower: whether the individual flowered or not (1, 0 or NA) MaxDBH: maximum DBH (cm) within genets X: X coordinate (m) Y: Y coordinate (m) c) seed15loci161213.txt 15 nSSR genotype for 580 seeds In paternity analysis, we used only 574 seeds that were able to be typed at 7 or more loci of all 12 loci. SampleName: seed sample name (_S) stm0002a: first allele of locus stm0002 (a) stm0002b: second allele of locus stm0002 (b) And so on ############################################################# # 2) gene_dispersal_model ############################################################# This directory contains scripts and data sets for gene dispersal modeling. a) ./data Data sets for modeling i) adult_data_for_stan170113.csv SampleName: individual name Mother: whether the individual was maternal tree or not (TRUE or FALSE) DBH: maximum DBH (cm) within genets HC: assigned genealogical class ii) distance_for_stan170113.csv Distance matrix (m) Dimension is number of maternal trees (37) x number of candidate fathers (238). The order is the same as “adult_data_for_stan170113.csv”. iii) outcrossed_seeds_for_stan170113.csv Number of outcrossed seeds data Dimension is number of maternal trees (37) x number of candidate fathers (238). b) ./model Model file for stan c) R scripts i) r170613NM_4GC_random_free_c_diff_b.r Run stan from R and make trace plots and result object into ./figs and ./output directories, respectively. ii) r170613summarize_NM_free_c_diff_b.r Summarize posteirors. iii) r170613make_figs_b.r Make figure for b parameters. iv) r170613calc_pairwise_diff.r Calculate pairwise differences between c parameters. v) r170613make_figs_free_c_diff_b.r Make figure for c parameters.