This article documents the public availability of transcriptome sequence data, assembled contigs, annotated contigs and molecular markers for Hazelnut tree (Corylus avellana).
Functionally annotated Corylus assembly
This file lists all 13,690 Coryllus avellana contigs with their annotation. We considered, for each contig (col 1), the best blast hit found in the Swissprot database (col2) and in the Non-redundant nucleotide database (col3). We associated to these contigs GO terms. GO terms IDs are listed in col4 and GO terms definition is listed in col 5 (MF: Molecular function; CC: Cellular component; BP: Biological process).
corylus_assembly_with_ALL_funcat_final.txt
Corylus avellana CAV2 transcriptomic assembly
In this file, the 13,690 CAV2 contigs sequences are listed in fasta format.
corylus_CLCBIO_meta_assembly_cdhitest_0.95.fasta
C. avellana root and ECM-root sequences with T. melanosporum similarity
Root ("Root_" prefix) and ECM-roots ("ECM_Roots_" prefix) sequences (col 1) showing similarities with the Tuber melanosporum genome are listed in this file. Each Corylus read is associated with at least one T. melanosporum scaffold (col2).
Hazel_Root_ECM_454reads_with_Tuber_hits.csv
Mapping of the Corylus avellana input sequences against the CAV2 assembly.
Alignments of the 836,641 Corylus avellana Roche sequences against the 13,690 CAV2 contigs in bam format.
mapping_all_454_renamed_on_CLCBIO_assembly.bam