Description of the datafile: StudyNr. Unique identifier for each sample. 2A - 4C: Microsatellite marker STRAf 2A till 4C, respectively, numbers indicate number of repetitions (ref. de Valk et al, 2005). n.3: Pos/Neg corresponds to presence/absence of a specific 1 nt deletion in the flanking region of microsatellite marker 4A (ref. de Valk et al, 2005). ANXC4: Numbers indicate the allelic variant found at MLST marker ANXC4 (ref. Bain et al, 2007). BGT1: Numbers indicate the allelic variant found at MLST marker BGT1 (ref. Bain et al, 2007). CSPtype: Indicates the allelic variant of the CSP gene (ref. Klaassen et al. 2009). RM1 - RM7: Alleles found at recombination markers 1 - 7, respectively. Allele A corresponds to the allele found in Af293. Allele B corresponds to the allele found in A1163. Allele C corresponds to a yet unidentified allele. Allele O indicates absence of an allele for that marker. RM1 corresponds to the mating type locus. POPULATION: Indicates to which of the five populations each genotype belongs. Clin/Env: Indicates the origin of the sample. Clin = clinical specimen, Env = environmental specimen. TR/L98H: Pos/Neg indicates whether the samples contains the TR/L98H genotype or not (ref. Klaassen et al, 2010). City: City where the sample was collected.