Beneficial insect assemblages in floral strips and comparison fields
Data files
May 22, 2025 version files 43.99 MB
-
2019_landscape_data_MK_oct_30_2023.csv
48.40 KB
-
2020_2021_2022_landscape_data_MK_july_31_2023.csv
143.60 KB
-
2020_2021_2022_vegetation_data_summarized_by_transect_and_site_June_20_2024.csv
244.40 KB
-
bluevane.bee.matrix.20.21.MK.feb.19.2024.csv
121 KB
-
bluevane.insects.20.21.22.MK.feb.19.2024.csv
440.72 KB
-
bluevane.syrphid.matrix.20.21.MK.feb.19.2024.csv
38.88 KB
-
netted.bee.matrix.20.21.22.MK.feb.19.2024.csv
220.92 KB
-
netted.insects.20.21.22.MK.feb.19.2024.csv
870.64 KB
-
netted.syrphid.matrix.20.21.22.MK.feb.19.2024.csv
66.40 KB
-
pantrap.bee.matrix.20.21.22.MK.feb.19.2024.csv
1.14 MB
-
pantrap.insects.20.21.22.MK.feb.19.2024.csv
5.74 MB
-
pantrap.syrphid.matrix.20.21.22.MK.feb.19.2024.csv
447.77 KB
-
pinned_specimen_data_MK_feb_12_2024.csv
32.12 MB
-
pitfall.genus.matrix.20.21.MK.feb.19.2024.csv
119.19 KB
-
pitfall.insects.20.21.22.MK.feb.19.2024.csv
1.27 MB
-
README.md
29.86 KB
-
sample_dates_feb_16_2024.csv
655.09 KB
-
site.information.feb.7.2022.csv
269.78 KB
Abstract
We installed thirteen floral strips next to rotationally managed agricultural fields in 2019 in Manitoba, Canada, and then monitored beneficial insects on field edges and within fields 1–3 years after establishment (2019–2022). Treatments included strip (crop fields with a floral strip located on one edge), control (crop fields with naturally occurring vegetation on the field edges), and unmanaged natural sites (no crop or enhancements). We measured ground beetle active density and Shannon diversity using pitfall traps. Bee and syrphid fly capture rates and Shannon diversity were measured with net transects, blue vane traps, and pan traps. We compared insect capture rates and diversities using Generalized linear mixed effect models and compared insect communities using permutational analysis of variance within strips in 2020, but declined to similar levels as control sites in 2021. Though no effect of treatment was found, bees were more abundant in strips compared to other treatments. Both bee abundance and diversity at strip sites increased over time. Syrphid abundance and diversity were not affected by treatment. Spillover of bees into adjacent blooming and non-blooming crops was twice as high in strip fields as in the control comparisons. Although no differences in spillover were found between treatments for syrphids or ground beetles. Floral strips increased the abundance of pollinators within adjacent crops, including blooming canola, which may lead to increased ecosystem services within crops. Our research supports the use of floral strips in rotational agriculture to manage local insect populations.
https://doi.org/10.5061/dryad.xgxd254sh
Description of the data and file structure
Data were collected in 2020–2023 in Manitoba, Canada, within sites of three treatments in a clustered design. Sites of each treatment were located within the same landscape and were spatially separated (>1 but < 5 km of each other, generally). Treatments included strip: crop fields with a floral strip installed along one edge, control: crop fields without a floral enhancement and containing a grassy margin of the field with naturally occurring vegetation, and natural: sites that did not have crops or enhancements and were unmanaged for at least 5 years prior to sampling. Some of this data (specimen occurance records) were used in a previous manuscript (https://doi.org/10.1111/afe.12595) and the data attached here are used with a manuscript that is currently in press with the Journal of Agricultural and Forest Entomology Killewald et al. 2025 “ Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields” (manuscript number AFE(2024)5015.R2)
Files and variables
File: site.information.feb.7.2022.csv
Description: Variables that are associated with sites or sampling events at each site. this data file is used to merge information with other files during analysis.
Variables
- site: name of the individual crop field that was sampled
- site.code: unique code to describe the location of the sample. unique location based on site, sample round, year, and sampling location
- old.site.code: same as site.code, but an older version that was retained for redundancy
- grower: manager of all sites within a specific cluster (statistical block)
- treatment: strip, control, natural, or on-site control (OSC) treatment
- new.treat: same as treatment, but control and OSC are merged together as control
- transect.loc: where was the sample located within the field
- distance: how far away from the edge was the sample located in metres
- year: what year was the sample collected in
- sample.round: the number of visits to the site each year, starting with 1 and increasing from there
- net.sample: was a _ sample collected, binary
- pantrap.sample: was a _ sample collected, binary
- pitfall.sample: was a _ sample collected, binary
- num.pitfall: how many pitfall traps are included in this sample?
- num.pan.trap: how many pan trap bowls were included in this sample?
- pt_hours_setup: how many hours were the pan traps actively collecting insects for this sample? used as an offset to standardize collection effort
- management: farm management, conventional or organic
- project: which larger project do these data belong to (should all be crop here)
- notes: free-form notes about this sample
- blooming_cover: average percent ground cover of blooming forbs
- bare: average percent ground cover of bare ground
- forb: average percent ground cover of all forbs (blooming or non-blooming)
- grass: average percent ground cover of grasses
- other: average percent ground cover of anything that does not fit into the above categories
- num_plant_sp: average blooming plant richness
- month: collection month for this sample
- day: collection day for this sample
- time_start: time of day for the start of collection for this sample
- date: date of this sample
- doy: numerical day of year for this sample
- temp: ambient temperature during this sample in °C
- wind_ave: average wind speed during this sample in KPH
- wind_max: max wind speed during this sample in KPH
- cloud_cover: cloud cover during this sample
- pt_month: collection month for pan traps
- pt_day: collection day for pan traps
- pt_time_start: time of day for the start of collection for pan traps
- pt_date: date of pan trap collections
- pt_doy: numerical day of year for pan trap collections
- pt_temp: ambient temperature during pan trap collections in °C
- pt_wind_ave: average wind speed during pan trap collections in KPH
- pt_wind_max: max wind speed during pan trap collections in KPH
- pt_cloud_cover: cloud cover during pan trap collections
- net_month: month of net collections
- net_day: day of net collections
- net_time_start: time of day when net collections began
- net_date: date of net collections
- net_doy: numerical day of year for net collections
- net_temp: ambient air temperature during net collections in °C
- net_wind_ave: average wind speed during net collections in KPH
- net_wind_max: max wind speed during net collections in KPH
- net_cloud_cover: cloud cover during net collections
- pitfall_month: month of pitfall collection
- pitfall_day: day of pitfall collection
- pitfall_date: date of pitfall collection
- pitfall_doy: numerical day of year of pitfall collection
- net.bee.total: total number of netted bees during the net transects
- net.bee.rich: species richness of bees collected during the net transects
- net.bee.shan: Shannon diversity of netted bees during the net transects
- net.bee.invsimp: inverse Simpson diversity of netted bees during the net transects
- pt.bee.total: total number of bees collected from pan traps
- pt.bee.rich: bee species richness collected from pan traps
- pt.bee.shan: bee Shannon diversity from pan traps
- pt.bee.invsimp: bee inverse Simpson diversity from pan traps
- pt.syrphid.total: number of syrphids collected from pan traps
- pt.syrphid.rich: syrphid species richness collected from pan traps
- pt.syrphid.shan: syrphid Shannon diversity collected from pan traps
- pt.syrphid.invsimp: syrphid inverse Simpson diversity collected from pan traps
- pf.carabid.total: total number of carabids collected from pitfall traps
- pf.carabid.rich: carabid richness collected from pitfall traps. values are species richness if the collection year is 2019 and genus richness if the year is 2020, 2021, or 2022
- pf.carabid.shan: carabid Shannon diversity collected from pitfall traps. values are species richness if the collection year is 2019 and genus richness if the year is 2020, 2021, or 2022
- pf.carabid.invsimp: carabid inverse Simpson diversity collected from pitfall traps. values are species richness if the collection year is 2019 and genus richness if the year is 2020, 2021, or 2022
The below landscape statistics are formatted in a similar way, including a prefix and a buffer zone suffix. take “shan_500” for example- shan indicates that the variable measures Shannon diversity and the 500 indicates that this measurement is at the 500 metre spatial scale away from sites. descriptions provided below are included for the 500 metre buffer zone, but variables are repreated for 1, 1.5, 2, 2.5, and 3 kilometres away from the sites.
- shan_500: Shannon landscape diversity as measured with a 500 metres buffer away from the sites. Other variables include this same measurement but at different spatial scales, which are indicated by the suffix of the variable name.
- edge_dens_500: edge density of landscape cover types as measured with a 500 metres buffer away from the sites. Other variables include this same measurement but at different spatial scales, which are indicated by the suffix of the variable name.
- natural_500: the percent cover of natural landcover types as measured with a 500 metres buffer away from the sites. Other variables include this same measurement but at different spatial scales, which are indicated by the suffix of the variable name.
- developed_500: the percent cover of developed land cover types as measured with a 500 metres buffer away from the sites. Other variables include this same measurement but at different spatial scales, which are indicated by the suffix of the variable name.
- rewrd.ag_500: the percent cover of rewarding agricultural land cover types (those that produce a flower frequently visited by insects for floral rewards) as measured with a 500 metres buffer away from the sites. Other variables include this same measurement but at different spatial scales, which are indicated by the suffix of the variable name.
- nonrewrd.ag_500: the percent ground cover of non-rewarding agricultural landcover types (those that do not produce a flower frequently visited by insects for floral rewards- primarily cereal crops) as measured with a 500 metres buffer away from the sites. other variables include this same measurement but at different spatial scales, which are indicated by the suffix of the variable name.
These variables were not included in the analysis for the attached manuscripts, but have a similar structure to the variables named above, where the prefix determines the variable type and the suffix is the buffer size.
- edge.len.natural.0.5: total length of edges within the buffer
- edge.len.nonreward_ag.0.5: total length of edges of non-rewarding agriculture as defined above within the buffer region
- edge.len.reward_ag.0.5: total length of edges of rewarding agriculture (defined above) within the buffer region
- edge.len.urban.0.5: same as above, but for the urban landcover category
- num.patch.natural.0.5: number of patches of the natural landcover category within the buffer region
- num.patch.nonreward_ag.0.5: same as above, but for the non-rewarding land cover category
- num.patch.reward_ag.0.5: same as above for the rewarding agricultural land cover category
- num.patch.urban.0.5: same as above, but for the urban landcover category
File: bluevane.insects.20.21.22.MK.feb.19.2024.csv
Description: data file used to summarize the collection of bees and syrphids from blue vane traps, and data from this file were primarily analyzed with generalized linear models in R
Variables
Variables included in this data file that are already described above for other data files are removed, so please reference the list of variable names
- trap.code: unique code to separate each sample for an individual blue vane trap (in this case, or other trap types, depending on the file)
- transect_type: what type of sample was collected for these data?
- trap_or_transect_num: unique number assigned to each trap so traps collected within the same sampling area could be separated
- sample_type: what type of sample was collected for these data?
- blooming_crop: was the adjacent crop in bloom?
- num.bluevane.insects: how many insects in total were collected from this bluevane trap?
- site_mean_forb: average site level percent forb ground cover
- site_sd_er_forb: standard error associated with the site mean forb calculation
- site_mean_grass: average site level percent grass ground cover
- site_sd_er_grass: standard error associated with the site mean grass calculation
- site_mean_bare: average site level percent bare ground cover
- site_sd_er_bare: standard error associated with the site mean bare calculation
- site_mean_other: average site level percent other ground cover
- site_sd_er_other: standard error associated with the site mean other calculation
- site_bloom_rich_mean: average site-level blooming forb richness
- site_bloom_rich_sd_er: standard error associated with the site bloom rich calculation
- site.bloom.veg.rich: site blooming vegetation richness (sum)
- site.bloom.veg.shan: site-level Shannon diversity of blooming forbs
- site.bloom.veg.invsimp: site-level inverse Simpson diversity of blooming forbs
- decimalLatitude: coordinates of the site
- decimalLongitude: coordinates of the site
- site.2: combination of site, treatment, and year used for sorting purposes
- year2: alternative to year, where year2 indicates the year of the project and not calendar year. formatted as an integer value
- year3: alternative to year2 where year3 is formatted as a factor
File: netted.insects.20.21.22.MK.feb.19.2024.csv
Description: This data file compiles the data for the netted insects (bees and syrphids) collected along the net transects. Variables are described in the master variable list
File: sample_dates_feb_16_2024.csv
Description: The data file describes the times that sites were visited so that all sampling data could be merged into the data files (to include times where a sample was performed but no specimens were collected)
File: 2020_2021_2022_landscape_data_MK_july_31_2023.csv
Description: This data file summarized the landscape data from 2020–2023 for all of the sites and was used to merge this information with the summary statistics of the vegetation and insect counts data
site.code: unique code to indicate the sample time for each site. This is a combination of site, treatment, round, and year. A site.code contains multiple sample.codes
sample.code: same as site.code, but also includes the location within the site where the individual sample was collected. Each sample code should be unique
grower: grower for the site cluster/manager of the strip and control fields
month: month when the sample was collected
day: day the sample was collected
year: year the sample was collected
temp: ambient air temperature when the sample was collected (in Celsius)
ave_wind: average wind speed during sample collection in KPH
max_wind: max wind speed during sample collection in KPH
cloud_cover: cloud cover during sample collection
time_start: time of day that the sample collection started
round: sampling round, resets to one each year, and increases by 1 with each site visit for sampling
treatment: treatment of the site, strip, control, or natural
transect.type: type of transect that was used at the site. For strip, control and natural sites, this should match treatment. Natural sites for the the pasture project (not reported in Killewald et al. 2025 “Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields” are either selective- where transects were placed by the researcher in areas where bees were likely to be collected, or random- transect location was pre determined before visitng the site through a random generator
transect: transect number
tran_mean_forb: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_sd_er_forb: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_mean_grass: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_sd_er_grass: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_mean_bare: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_sd_er_bare: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_mean_other: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_sd_er_other: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_bloom_rich_mean: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
tran_bloom_rich_sd_er: vegetation survey summary statistics. Tran indicated that this summary is done at the transect level (summary of 11, 0.5x1 metre quadrats) and the following variable names explain to variables. Mean forb is average forb cover, se forb is standard error forb cover. Grass is grass cover, and bare is bare ground cover. Other indicates a category not listed above (was rarely, if ever, utilized)
site_mean_forb: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_sd_er_forb: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_mean_grass: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_sd_er_grass: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control, and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_mean_bare: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control, and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_sd_er_bare: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_mean_other: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_sd_er_other: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_bloom_rich_mean: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
site_bloom_rich_sd_er: same variables as” site_…” listed above, but variables are summarized at the site level. Two transects per site for strip, control and natural sites for the crop project (Killewald et al, 2025 Floral strips increase the abundance and spillover of beneficial insects in rotational crop fields, and 6 transects per site for the pasture project (not published here)
File: pitfall.insects.20.21.22.MK.feb.19.2024.csv
Description: This data file summarizes the beetle assemblages that were collected from the pitfall traps between 2020 and 2022
File: pantrap.insects.20.21.22.MK.feb.19.2024.csv
Description: This file summarized the counts and diversity of insects that were collected using the pan traps between 2020 and 2022.
File: 2019_landscape_data_MK_oct_30_2023.csv
Description: This data file summarizes the landscape data that was collected for sites visited during 2019. Similar to the other landscape file, but contains sites from a different sampling year, and the landscape information is associated with a different sampling year as well.
File: 2020_2021_2022_vegetation_data_summarized_by_transect_and_site_June_20_2024.csv
Description: This data file summarizes the vegetation survey data and organizes the information by transect and site so that these data may be merged with the insect counts and landscape data.
File: bluevane.bee.matrix.20.21.MK.feb.19.2024.csv
File: bluevane.syrphid.matrix.20.21.MK.feb.19.2024.csv
File: netted.bee.matrix.20.21.22.MK.feb.19.2024.csv
File: netted.syrphid.matrix.20.21.22.MK.feb.19.2024.csv
File: pantrap.syrphid.matrix.20.21.22.MK.feb.19.2024.csv
File: pantrap.bee.matrix.20.21.22.MK.feb.19.2024.csv
File: pitfall.genus.matrix.20.21.MK.feb.19.2024.csv <- genus ~ site matrix
Description: The above datasets are species ~ site matrices (except the file: pitfall.genus.matrix.20.21.MK.feb.19.2024.csv, as this is a genus ~ site matrix) where each row represents a sample and columns represent species counts from that sample. Each dataset was used in ordination and PERMANOVA analysis to determine community differences between treatment types. For associations between row names (site sample codes) and landscape/site variables, see site.information.feb.7.2022.csv
Variables
- : This is the sample code that is unique to each sample
All variables below are scientific species names, where cell values represent counts of individual species per sample
File: pinned_specimen_data_MK_feb_12_2024.csv
Description: This is the raw data that is associated with the counts of species per collection event. This data file is curated by Jason Gibbs at the University of Manitoba (jason.gibbs@umanitoba.ca) and is included in the museum of entomology at the university. Each row in this data file is linked to one unique specimen.
Variables
- JBWMBarcode: unique code assigned to each specimen
- fix: used for data cleaning- does this entry require assistance
- record.number: number to indicate which entries were entered in what order- used to retain entry order after sorting data
- Prepared.by: who prepared the data entry
- grower: farmer who manages the site cluster of all treatments
- treatment: treatment of the field (strip, control, natural, or OSC- on-site control)
- location.of.transect: where was the transect located within the field
- distance: distance from field edge in metres. 0 indicates the sample was collected on the edge itself
- transect_type: type of sample
- transect_or_trap_num: if a sampling location contains multiple of the same transect type, then a unique number is assigned to each individual sample to keep them separate
- colour: colour of the pan trap
- sample.round: how many times was that site visited that year? starts with 1
- month.start: month of sampling start
- day.start: day of sampling start
- month.end: month when sampling is completed
- day.end: day when sampling is completed
- year: year of data collection
- collector: who collected the sample?
- method: how was the sample collected
- Host.common.name: common name of the host of the specimen (flowers generally for bees) may be left blank if no host data exists
- crop: what crop was the specimen collected on or near- may be left blank if it does not exist
- perc_bloom: percent bloom of adjacent crop- may be left blank if it does not exist
- site: combination of grower and treatment
- site.code: combination of grower, treatment, transect location, round, and year. used for sorting data
- trap.code: unique code associated with each individual sampling trap/transect
- Start.date.Y.M.D: start date of collection effort in year, month, day format
- End.date: end date of collection effort in year, month, day format
- Country: country where the sample was collected
- Prov: province where the sample was collected
- Region: region where the sample was collected
- Region.prefix: prefix of the region (if applicable) where the sample was collected
- Locality: location of the sample in relation to the nearest town
- label_loc: location of the sample to include on the insect label
- project: what project is the specimen associated with
- loc: code associated with the label location
- treatment.code: single letter associated with the treatment of the site
- management: site management, whether that is conventional or organic
- lat.rounded: site coordinates rounded to three decimal places
- long.rounded: site coordinates rounded to three decimal places
- Latitude: site coordinates
- Longitude: site coordinates
- datum: where did the site coordinate originate
- host.family: host taxonomic information, if known
- host.genus: host taxonomic information, if known
- host.species: host taxonomic information, if known
- host.spp: combination of host genus and host species if known
- Taxon.order: what taxonomic order does the specimen belong to
- Taxon.family: what taxonomic family does the specimen belong to
- Subfamily: what taxonomic subfamily does the specimen belong to
- Tribe: what taxonomic tribe does the specimen belong to
- Genus: what taxonomic genus does the specimen belong to
- Subgenus: what taxonomic subgenus does the specimen belong to
- Species: what taxonomic species does the specimen belong to
- author: which taxonomist described the species and what year?
- Sex: sex of the specimen
- Determiner: who determined the identity of the specimen
- sp: combination of Genus and species
- Subspecies: subspecies of the specimen, if known
- Subspecies.author: taxonomic author of the subspecies
- Depository: where is the specimen located?
- stem.number: if the specimen was removed from a stem nest, then what was the number of the stem? cross-referenced to a data file for an unrelated project
- cell.number: same as stem number, but the number of the cell within the stem, if known
- emerged.died: for stem nest data- did the specimen emerge on its own, die inside the stem, etc.
- stem_processed_date: when was the stem processed (for stem nest data only)
- label_printed: has the specimen label been printed
- label: the specimen label to be printed
- measurent.mag: associated trait data for this specimen: magnification of measurement
- measurement.ratio.pix.per.mm.: associated trait data for this specimen: how many pixels per mm?
- measurement.pixel.Count: associated trait data for this specimen: number of pixels measurement
- measurement.mm: associated trait data for this specimen: conversion of pixels to overall mm
- notes: notes on the specimen that should be recorded. free-form
The columns below are extra and are only retained within the data file for redundancy. These are not utilized for the analysis
- Subspecies.y:
- Subspecies.author.y:
- Note.y:
- Taxon.order.y:
- Taxon.family.y:
- Subfamily.y:
- Tribe.y:
- Genus2:
- Subgenus.y:
- Species2:
- author.y:
Code/software
The code can be viewed using R or R Studio. All code was compiled using R Studio for this project