A pan-cetacean MHC amplicon sequencing panel developed and evaluated in combination with genome assemblies
Data files
Jan 05, 2024 version files 1.56 GB
-
Bac-16.fastq
3.80 MB
-
Bac-QLM.fastq
712.56 KB
-
Bamu-U18-0236.fastq
56.29 MB
-
Bed-31-1.fastq
582.76 KB
-
Bed-34-1.fastq
42.22 MB
-
Che11-CB124.fastq
4.16 MB
-
Che12-CB043.fastq
35.41 MB
-
Dde-M1.fastq
29.84 MB
-
Euau-20-AI-022.fastq
3.02 MB
-
Euau-20-AI-030.fastq
9.98 MB
-
Euau-20-AI-031.fastq
4.42 MB
-
Euau-20-AI-042.fastq
6.15 MB
-
Euau-20-AI-055.fastq
2.31 MB
-
Euau-20-AI-089.fastq
4.72 MB
-
Euau-20-AI-097.fastq
4.90 MB
-
Euau-20-AI-100.fastq
1.55 MB
-
Euau-20-AI-108.fastq
8.55 MB
-
Euau-20-AI-116.fastq
27.30 MB
-
Euau-20-AI-117.fastq
21.46 MB
-
Euau-20-AI-127.fastq
7.49 MB
-
Euau-20-AI-128.fastq
7.51 MB
-
Euau-20-AI-147.fastq
5.38 MB
-
Euau-20-AI-149.fastq
6.38 MB
-
Euau-20-AI-159.fastq
5.69 MB
-
Euau-20-AI-161.fastq
8.38 MB
-
Euau-20-AI-164.fastq
9.84 MB
-
Euau-20-AI-170.fastq
11.19 MB
-
Euau-20-AI-171.fastq
9.74 MB
-
Euau-20-AI-196.fastq
6.46 MB
-
Euau-20-AI-203.fastq
13.53 MB
-
Euau-20-AI-207.fastq
10.83 MB
-
Euau-20-AI-208.fastq
9.42 MB
-
Euau-20-AI-209.fastq
5.96 MB
-
Euau-20-AI-210.fastq
2.58 MB
-
Euau-20-AI-211.fastq
5.80 MB
-
Euau-20-AI-212.fastq
12.71 MB
-
Euau-20-AI-213.fastq
691.06 KB
-
Euau-20-AI-221.fastq
2.66 MB
-
Glo93.fastq
7.07 MB
-
Glo99.fastq
28.06 MB
-
Grgr-M1.fastq
30.82 MB
-
Grgr-U19-016.fastq
15.95 MB
-
Kbr-106.fastq
7.67 MB
-
Ksi-NC05-111.fastq
32.91 MB
-
Lob06.fastq
14.07 MB
-
Lob07.fastq
47.57 MB
-
Mede1.fastq
27.54 MB
-
Mede2.fastq
40.80 MB
-
Megr_U18-013.fastq
10.64 MB
-
Megr-U17-093.fastq
36.40 MB
-
Meno-20-NZ12A.fastq
13.59 MB
-
Meno-NC-00-026.fastq
54.45 MB
-
Meno-NC-00-036.fastq
9.37 MB
-
Meno-NC-01-041.fastq
20.09 MB
-
Meno-NC-01-078.fastq
26.24 MB
-
Meno-NC-01-080.fastq
11.63 MB
-
Meno-NC-01-121.fastq
24.92 MB
-
Meno-NC-01-123.fastq
34.36 MB
-
Meno-NC-06-020.fastq
39.55 MB
-
Meno-NC-06-076.fastq
15.97 MB
-
Meno-NC-07-092.fastq
31.94 MB
-
Meno-NC-07-093.fastq
42.54 MB
-
Meno-NC-07-148.fastq
30.99 MB
-
Meno-NC-09-050.fastq
19.31 MB
-
Meno-NC-09-065.fastq
37.56 MB
-
Meno-NC-09-149.fastq
5.52 MB
-
Meno-NC-09-162.fastq
32.06 MB
-
Meno-NC-10-046_and_10-055.fastq
17.65 MB
-
Meno-NC-11-162.fastq
34.97 MB
-
Meno-NC-11-239.fastq
34.74 MB
-
Meno-NC-11-259.fastq
10.56 MB
-
Meno-NC-95-002.fastq
23.06 MB
-
Meno-NC-96-032.fastq
24.69 MB
-
Meno-NC-96-038.fastq
22.75 MB
-
Meno-NC-97-022.fastq
26.15 MB
-
Meno-NC-99-029.fastq
14.96 MB
-
Meu-SW7444.fastq
16.72 MB
-
README.md
461 B
-
Sbr-04-FP03.fastq
26.53 MB
-
Sbr-07_SA03.fastq
9.62 MB
-
Sco-M1.fastq
31.07 MB
-
Ttr05-BOI106.fastq
14.82 MB
-
TtrJB04-05.fastq
30.10 MB
-
TtrU18-059.fastq
13.72 MB
-
Zca-NZ12-99.fastq
35.37 MB
-
Zica-U19-001.fastq
15.49 MB
Feb 26, 2024 version files 1.56 GB
-
assumed_non-functional_DRB_alleles.fastq
10.32 KB
-
Bac-16.fastq
3.80 MB
-
Bac-QLM.fastq
712.56 KB
-
Bamu-U18-0236.fastq
56.29 MB
-
Bed-31-1.fastq
582.76 KB
-
Bed-34-1.fastq
42.22 MB
-
Che11-CB124.fastq
4.16 MB
-
Che12-CB043.fastq
35.41 MB
-
class_I_alleles.fastq
128.69 KB
-
Dde-M1.fastq
29.84 MB
-
Euau-20-AI-022.fastq
3.02 MB
-
Euau-20-AI-030.fastq
9.98 MB
-
Euau-20-AI-031.fastq
4.42 MB
-
Euau-20-AI-042.fastq
6.15 MB
-
Euau-20-AI-055.fastq
2.31 MB
-
Euau-20-AI-089.fastq
4.72 MB
-
Euau-20-AI-097.fastq
4.90 MB
-
Euau-20-AI-100.fastq
1.55 MB
-
Euau-20-AI-108.fastq
8.55 MB
-
Euau-20-AI-116.fastq
27.30 MB
-
Euau-20-AI-117.fastq
21.46 MB
-
Euau-20-AI-127.fastq
7.49 MB
-
Euau-20-AI-128.fastq
7.51 MB
-
Euau-20-AI-147.fastq
5.38 MB
-
Euau-20-AI-149.fastq
6.38 MB
-
Euau-20-AI-159.fastq
5.69 MB
-
Euau-20-AI-161.fastq
8.38 MB
-
Euau-20-AI-164.fastq
9.84 MB
-
Euau-20-AI-170.fastq
11.19 MB
-
Euau-20-AI-171.fastq
9.74 MB
-
Euau-20-AI-196.fastq
6.46 MB
-
Euau-20-AI-203.fastq
13.53 MB
-
Euau-20-AI-207.fastq
10.83 MB
-
Euau-20-AI-208.fastq
9.42 MB
-
Euau-20-AI-209.fastq
5.96 MB
-
Euau-20-AI-210.fastq
2.58 MB
-
Euau-20-AI-211.fastq
5.80 MB
-
Euau-20-AI-212.fastq
12.71 MB
-
Euau-20-AI-213.fastq
691.06 KB
-
Euau-20-AI-221.fastq
2.66 MB
-
Glo93.fastq
7.07 MB
-
Glo99.fastq
28.06 MB
-
Grgr-M1.fastq
30.82 MB
-
Grgr-U19-016.fastq
15.95 MB
-
Kbr-106.fastq
7.67 MB
-
Ksi-NC05-111.fastq
32.91 MB
-
Lob06.fastq
14.07 MB
-
Lob07.fastq
47.57 MB
-
Mede1.fastq
27.54 MB
-
Mede2.fastq
40.80 MB
-
Megr_U18-013.fastq
10.64 MB
-
Megr-U17-093.fastq
36.40 MB
-
Meno-20-NZ12A.fastq
13.59 MB
-
Meno-NC-00-026.fastq
54.45 MB
-
Meno-NC-00-036.fastq
9.37 MB
-
Meno-NC-01-041.fastq
20.09 MB
-
Meno-NC-01-078.fastq
26.24 MB
-
Meno-NC-01-080.fastq
11.63 MB
-
Meno-NC-01-121.fastq
24.92 MB
-
Meno-NC-01-123.fastq
34.36 MB
-
Meno-NC-06-020.fastq
39.55 MB
-
Meno-NC-06-076.fastq
15.97 MB
-
Meno-NC-07-092.fastq
31.94 MB
-
Meno-NC-07-093.fastq
42.54 MB
-
Meno-NC-07-148.fastq
30.99 MB
-
Meno-NC-09-050.fastq
19.31 MB
-
Meno-NC-09-065.fastq
37.56 MB
-
Meno-NC-09-149.fastq
5.52 MB
-
Meno-NC-09-162.fastq
32.06 MB
-
Meno-NC-10-046_and_10-055.fastq
17.65 MB
-
Meno-NC-11-162.fastq
34.97 MB
-
Meno-NC-11-239.fastq
34.74 MB
-
Meno-NC-11-259.fastq
10.56 MB
-
Meno-NC-95-002.fastq
23.06 MB
-
Meno-NC-96-032.fastq
24.69 MB
-
Meno-NC-96-038.fastq
22.75 MB
-
Meno-NC-97-022.fastq
26.15 MB
-
Meno-NC-99-029.fastq
14.96 MB
-
Meu-SW7444.fastq
16.72 MB
-
README.md
804 B
-
Sbr-04-FP03.fastq
26.53 MB
-
Sbr-07_SA03.fastq
9.62 MB
-
Sco-M1.fastq
31.07 MB
-
Ttr05-BOI106.fastq
14.82 MB
-
TtrJB04-05.fastq
30.10 MB
-
TtrU18-059.fastq
13.72 MB
-
Zca-NZ12-99.fastq
35.37 MB
-
Zica-U19-001.fastq
15.49 MB
Abstract
The major histocompatibility complex (MHC) is a highly polymorphic gene family that is crucial in immunity, and its diversity can be effectively used as a fitness marker for populations. Despite this, MHC remains poorly characterised in non-model species (e.g., cetaceans: whales, dolphins and porpoises) as high gene copy number variation, especially in the fast-evolving class I region, makes analyses of genomic sequences difficult. To date, only small sections of class I and IIa genes have been used to assess functional diversity in cetacean populations. Here, we undertook a systematic characterisation of the MHC class I and IIa regions in available cetacean genomes. We extracted full-length gene sequences to design pan-cetacean primers that amplified the complete exon2 from MHC class I and IIa genes in one combined sequencing panel. We validated this panel in 19 cetacean species and described 354 alleles for both classes. Furthermore, we identified likely assembly artefacts for many MHC class I assemblies based on the presence of class I genes in the amplicon data compared to missing genes from genomes. Finally, we investigated MHC diversity using the panel in 25 humpback and 30 southern right whales, including four paternity trios for humpback whales. This revealed copy-number variable class I haplotypes in humpback whales, which is likely a common phenomenon across cetaceans. These MHC alleles will form the basis for a cetacean branch of the Immuno-Polymorphism Database (IPD-MHC), a curated resource intended to aid in the systematic compilation of MHC alleles across several species, to support conservation initiatives.
README: Merged paired end Illumina reads from five MHC loci for 85 cetaceans and their class I and assumed non-functional DRB alleles
https://doi.org/10.5061/dryad.wh70rxwvb
The dataset contains 85 fastq files. Each file contains reads of amplicons from five MHC loci (DQA, DQB, DRA, DRB, and class I genes) combined across separate sequencing runs from a single cetacean. Details on individual cetacean sample abbreviations can be found in the manuscript. Reads are paired and merged with the Illumina adapter removed.
It also contains one fastq file with all class I alleles found and one fastq file with non-functional DRB alleles found. Alleles are labeled with four letter species abbreviation followed by locus designation (DRB or N for class I) and are numbered in the order they were discovered.
Methods
A total of 85 tissue samples were taken from individual animals across several cetacean species. The type of tissue were either from strandings or biopsies. Stranding samples in New Zealand were taken by the Department of Conservation New Zealand and sent to the New Zealand Cetacean Tissue Archive (NZCeTA) housed at the University of Auckland Waipapa Taumata Rau with approval from mana whenua (Māori indigenous groups). Biopsy samples from New Zealand cetaceans include two Hector’s dolphin (Chephalorhyncus hectori) (Hamner et al., 2017) and two bottlenose dolphins (T. truncatus) (Tezanos-Pinto et al., 2009). Further biopsies include two rough-toothed dolphins (Steno bredanensis) and two Blainville beaked whales (Mesoplodon densirostris) from French-Polynesia (Albertson et al., 2017; Oremus et al., 2012). Details on samples and associated permit numbers can be found in the published manuscript.
DNA was extracted from tissue samples and genomic DNA underwent PCR for five Major Histocompatibility Complex loci. PCR products were pooled for each individual, indexed with Nextera indexes supplied by IDT, and sequenced on Illumina NanoSeq and MiSeq. Each individual amplicon-pool was sequenced multiple times on different sequencing runs. The reads provided here are the paired and merged reads from several sequencing runs combined for each individual in a fastq file.