A plant virus differentially alters DNA methylation in two cryptic species of a hemipteran vector
Data files
Jul 15, 2024 version files 13.38 GB
-
all_data_filtered_gene_body.csv.gz
783.50 KB
-
all_data_filtered_promoter.csv.gz
1.36 MB
-
BC1.CpG_report.txt.gz
223.70 MB
-
BC2.CpG_report.txt.gz
224.87 MB
-
BC3.CpG_report.txt.gz
223.66 MB
-
BC4.CpG_report.txt.gz
223.32 MB
-
BC6.CpG_report.txt.gz
224.42 MB
-
BV1.CpG_report.txt.gz
222.35 MB
-
BV2.CpG_report.txt.gz
221.73 MB
-
BV3.CpG_report.txt.gz
223.20 MB
-
BV4.CpG_report.txt.gz
222.97 MB
-
BV5.CpG_report.txt.gz
223.71 MB
-
MEAM1_3.28.23.smk.gz
4.89 GB
-
MEAM1_cds_IDs.txt.gz
1.06 MB
-
MEAM1_Feature_Report.txt.gz
601.63 KB
-
MEAM1_promoter_IDs.txt.gz
183.35 KB
-
MEAM1_scaffold_v1.2.fa.mod.EDTA.intact.gff3.gz
490.03 KB
-
MEAM1_scaffold_v1.2.fa.mod.EDTA.TElib.fa.gz
1.77 MB
-
MEAM1_trend_cds_5000_2.txt.gz
12.48 KB
-
MEAM1_trend_TE_5000_2.txt.gz
12.64 KB
-
MED_3.28.23.smk.gz
3.84 GB
-
MED_cds_IDs.txt.gz
1.14 MB
-
MED_cds_v1_0.box
180.12 MB
-
MED_Feature_Report.txt.gz
713.53 KB
-
MED_genome_v1.0.fa.mod.EDTA.intact.gff3.gz
520.14 KB
-
MED_genome_v1.0.fa.mod.EDTA.TElib.fa.gz
1.90 MB
-
MED_promoter_IDs.txt.gz
300.23 KB
-
MED_trend_cds_5000_2.txt.gz
12.47 KB
-
MED_trend_TE_5000_2.txt.gz
12.73 KB
-
QC1.CpG_report.txt.gz
223.43 MB
-
QC2.CpG_report.txt.gz
220.29 MB
-
QC3.CpG_report.txt.gz
221.57 MB
-
QC4.CpG_report.txt.gz
220.13 MB
-
QC5.CpG_report.txt.gz
223.33 MB
-
QV1.CpG_report.txt.gz
222.79 MB
-
QV2.CpG_report.txt.gz
222.22 MB
-
QV3.CpG_report.txt.gz
223.33 MB
-
QV4.CpG_report.txt.gz
221.63 MB
-
QV5.CpG_report.txt.gz
224.49 MB
-
README.md
3.41 KB
-
unique_combinations.tsv.gz
1.06 KB
Abstract
This study investigated DNA methylation patterns in two cryptic species (B and Q) of the sweet potato whitefly, Bemisia tabaci (Gennadius), following the acquisition of the tomato yellow curl virus, a single-stranded DNA virus. The methylation levels in genomic features such as promoters, gene bodies, and transposable elements in both cryptic species were described in this study. While overall trends were found to be similar, specific differences in methylation levels were observed. Virus-induced differentially methylated regions (DMRs) were associated with different genes in each cryptic species and were negatively correlated with differential gene expression. These DMRs were analyzed for changes in gene expression and alternative splicing, revealing clusters of hyper- and hypomethylated genes related to virus-vector interactions, immune functions, and detoxification processes. These methylation differences may help explain the distinct biological and physiological traits observed between the B and Q cryptic species.
https://doi.org/10.5061/dryad.jdfn2z3jn
Description of the data and file structure
The deposited dataset includes methylation calling files from whitefly B and Q cryptic species, from non-viruliferous and viruliferous samples.
The OmicsBox (.box) file contains gene annotations for Q cryptic species and can be accessed using OmicsBox.
Methylation reports are formatted as , see https://felixkrueger.github.io/Bismark/bismark/methylation_extraction/ for more information.
SeqMonk files (.smk) with calls for methylation can be accessed using SeqMonk.
Transposable element annotations have been deposited and can be found in the files with "EDTA" in their file names, see https://github.com/oushujun/EDTA for more information.
Files "all_data_filtered_promoter.csv.gz" and "all_data_filtered_gene_body.csv.gz" contain the Position, ID, Methylation level (units: percent methylation within a genomic feature), CpGoe, Group, Sequence, and Description for promoters and gene bodies, respectively.
Code/Software
Custom R scripts with comments have been deposited. The scripts are named after their associated figures. All input files can also be found within the supplementary tables or additional files as called in the scripts. Files can be extracted from their compressed format (.gz) using gunzip.
Acronyms
- MEAM1 or B: Middle East Asia Minor 1 (B cryptic species of Bemisia tabaci)
- MED or Q: Mediterranean (Q cryptic species of Bemisia tabaci)
- c: control/non-viruliferous
- v: viruliferous
- CpG: Cytosine-phosphate-Guanine (DNA methylation)
- cds: Coding Sequence
- EDTA: Extensive de-novo Transposable Element Annotator (a tool for identifying transposable elements)
- TE: Transposable Element
- LTR: Long Terminal Repeat
- SINE: Short Interspersed Nuclear Element
- LINE: Long Interspersed Nuclear Element
- MITEs: Miniature Inverted-repeat Transposable Elements
- TIR: Terminal Inverted Repeat
- DTA: A superfamily of DNA transposons named after hobo, Ac, and Tam3 elements (hAT)
- DTC: A superfamily of DNA transposons also known as CACTA
- DTH: Transposable elements belonging to the PIF/Harbinger superfamily
- DTM: A superfamily of DNA transposons also known as Mutator
- DTT: A superfamily of transposable elements also known as Tc1/Mariner
- Helitron: A type of rolling-circle transposable element
File Types
- gff3: General Feature Format version 3 (a file format for describing genes and other features of DNA, RNA, and protein sequences)
- fa: FASTA (a text-based format for nucleotide sequences)
- box: Contains gene annotations (File format compatible with OmicsBox)
- smk: Contains methylation data (File format comparable with SeqMonk)
- csv: Comma-Separated Values
- tsv: Tab-Separated Values
- gz: Gzip (a file compression format)
This dataset was generated from bisulfite-converted sequencing of DNA from non-viruliferous and viruliferous whiteflies.