The mimetic butterflies Heliconius erato and H. melpomene have undergone parallel radiations to form a near-identical patchwork of over 20 different wing pattern races across the Neotropics. Previous molecular phylogenetic work on these radiations has suggested that similar but geographically disjunct color patterns arose multiple times independently in each species. The neutral markers used in these studies, however, can move freely across color pattern boundaries and therefore might not represent the history of the adaptive traits as accurately as markers linked to color pattern genes. To assess this, we compared relationships among races within H. erato and within H. melpomene using a series of unlinked genes, genes linked to color pattern loci, and optix - a gene recently shown to control red color pattern variation. We found that while unlinked genes partition populations by geographic region, optix had a different history, structuring lineages by red color patterns and supporting a single origin of red-rayed patterns within each species. Genes closely linked (80-250 KB) to optix exhibited only weak associations with color pattern. This study empirically demonstrates the necessity of examining phenotype-determining genomic regions to understand the history of adaptive change in rapidly radiating lineages. With these refined relationships, we resolve a long-standing debate about the origins of the races within each species, supporting the hypothesis that the red-rayed Amazonian pattern evolved recently and expanded, causing disjunctions of more ancestral patterns.
2654eratoPHASED
DNA sequence data for a Proteasome 54kD subunit protein-coding sequence (referred to as 2654 in the manuscript) for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
2654melpomenePHASED
DNA sequence data for a Proteasome 54kD subunit protein-coding sequence (referred to as 2654 in the manuscript) for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
bveseratoPHASED
DNA sequence data for a partial protein-coding sequence of the gene bves for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
bvesmelpPHASED
DNA sequence data for a partial protein-coding sequence of the gene bves for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
CATeratoPHASED
DNA sequence data for a partial protein-coding sequence of the gene Catalase (Cat) for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
CATmelpomenePHASED
DNA sequence data for a partial protein-coding sequence of the gene Catalase (Cat) for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
GPCReratoPHASED
DNA sequence data for a partial protein-coding sequence of the gene G protein coupled receptor for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
GPCRmelpomenePHASED
DNA sequence data for a partial protein-coding sequence of the gene G protein coupled receptor for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
kin_melpPHASED
DNA sequence data for the gene kinesin for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
kineratoPHASED
DNA sequence data for the gene kinesin for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
mterato
DNA sequence data for a sequenced fragment including mitochondrial COI-tRNALeu-COII for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing. Sequences can also be obtained in Genbank. Phylogenetic trees from neighbor-joining and Bayesian analyses are attached as a treeblock.
mtmelpomene
DNA sequence data for a sequenced fragment including mitochondrial COI-tRNALeu-COII for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing. Sequences can also be obtained in Genbank. Phylogenetic trees from neighbor-joining and Bayesian analyses are attached as a treeblock.
SUMOHeratoPHASED
DNA sequence data for the gene SUMO-1 activating enzyme (sae-1) for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
SUMOHmelpomenePHASED
DNA sequence data for the gene SUMO-1 activating enzyme (sae-1) for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
Suz12eratoPHASED
DNA sequence data for the gene Su(z)12 for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
Suz12melpomenePHASED
DNA sequence data for the gene Su(z)12 for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
VanGogheratoPHASED
DNA sequence data for the gene VanGogh for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
VanGoghmelpomenePHASED
DNA sequence data for the gene VanGogh for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. The file includes a treeblock containing a neighbor-joining tree constructed from this data using PAUP* and used in the study.
optixeratoBayes
DNA sequence data for a 3'UTR plus a patial protein coding sequence of the gene optix for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. Identical haplotypes were consolidated into one haplotype sequence. The resulting Bayesian tree is attached as a treeblock.
optixmelpPHASEDHaplos
DNA sequence data for a 3'UTR plus a patial protein coding sequence of the gene optix for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. Identical haplotypes were consolidated into one haplotype sequence. The resulting Bayesian tree is attached as a treeblock.
optixeratoPHASEDNJ
DNA sequence data for a 3'UTR plus a patial protein coding sequence of the gene optix for multiple individuals for multiple races of Heliconius erato. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. A neighbor-joining tree inferred using PAUP* is attached as a treeblock.
optixmelpPHASEDNJ
DNA sequence data for a 3'UTR plus a patial protein coding sequence of the gene optix for multiple individuals for multiple races of Heliconius melpomene. The data is derived from direct sequencing, inferring polymorphisms from double peaks in the chromatograms. Haplotypes were determined by analytically phasing the data across all sequences using the program PHASE. Inferred sequence before phasing can be obtained in Genbank. A neighbor-joining tree inferred using PAUP* is attached as a treeblock.