This BEECH_HRusticaEggDatareadme.txt file was generated on 2022-05-26 by IRIS LEVIN GENERAL INFORMATION 1. Title of Dataset: High within-clutch repeatability of eggshell phenotype in Barn Swallows (Hirundo rustica erythrogaster) despite less maculated last-laid eggs 2. Author Information A. Principal Investigator Contact Information Name: Iris I Levin Institution: Kenyon College Address: Dept of Biology, 202 N College Rd, Gambier, Ohio 43022 Email: levin1@kenyon.edu B. Associate or Co-investigator Contact Information Name: Ava-Rose Beech Institution: Kenyon College Address: Email: beech1@kenyon.edu C. Alternate Contact Information Name: Institution: Address: Email: 3. Date of data collection (single date, range, approximate date): 2018-05-01 - 2021-07-01 4. Geographic location of data collection: GA, OH, USA 5. Information about funding sources that supported the collection of the data: NSF (IOS-1856254), Kenyon College, Agnes Scott College SHARING/ACCESS INFORMATION 1. Licenses/restrictions placed on the data: Reuse with permission 2. Links to publications that cite or use the data: Ornithology, in press 3. Links to other publicly accessible locations of the data: NA 4. Links/relationships to ancillary data sets: NA 5. Was data derived from another source? yes/no no A. If yes, list source(s): 6. Recommended citation for this dataset: DATA & FILE OVERVIEW 1. File List: first.v.replacement.NPM.csv: first clutch vs. replacement clutch egg match ranks generated by NaturePatternMatch first.v.replacement.SpotEgg.csv: first clutch vs. replacement clutch egg phenotype data generated by SpotEgg full.SpotEgg.csv: Complete egg phenotype dataset generated using SpotEgg lay.order.NPM.csv: Ranks to clutch for eggs of known lay order generated by NaturePatternMatchclay.order.SpotEgg.csv: Egg phenotype for eggs with known lay order generated by SpotEgg sides.of.egg.NPM.csv: Ranks to egg for photos of different sides of same egg generated by NaturePatternMatch sides.of.egg.SpotEgg.csv: Egg phenotype for photos of different sides of the same egg generated by SpotEgg 2. Relationship between files, if important: NA 3. Additional related data collected that was not included in the current data package: 4. Are there multiple versions of the dataset? yes/no No A. If yes, name of file(s) that was updated: i. Why was the file updated? ii. When was the file updated? METHODOLOGICAL INFORMATION 1. Description of methods used for collection/generation of data: Data were generated from photographs of eggs 2. Methods for processing the data: See methods in paper 3. Instrument- or software-specific information needed to interpret the data: See methods in paper for SpotEgg and NaturePatternMatch methods 4. Standards and calibration information, if appropriate: 5. Environmental/experimental conditions: NA 6. Describe any quality-assurance procedures performed on the data: NA 7. People involved with sample collection, processing, analysis and/or submission: Iris Levin, Toshi Tsunekage, Emily Smith, Yujie Liu, Mattheus Santos, Ava-Rose Beech, Ben Berejka DATA-SPECIFIC INFORMATION FOR: first.v.replacement.NPM.csv 1. Number of variables: 6 2. Number of cases/rows: 104 3. Variable List: File Name: name of egg (site, nest number, egg number) Nest: Nest number Clutch Size: number of eggs in that nest Rant to Nest: NPM rank for how egg matched back to nest of origin Paired Nest: Second clutch laid by same female Rank to Paired: NPM rank for how egg matched back to second clutch 4. Missing data codes: NA 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: first.v.replacement.SpotEgg.csv 1. Number of variables: 13 2. Number of cases/rows: 24 3. Variable List: Clutch: First vs. replacement Female: Identity of the female who laid the eggs Site: Where eggs were sampled Nest: Nest number Clutch size: Number of eggs laid in that nest MeanVolume: Clutch mean for egg volume generated by SpotEgg MeanArea: Clutch mean for egg area generated by SpotEgg MeanWidth: Clutch mean of egg width generated by SpotEgg MeanLength: Clutch mean of egg length generated by SpotEgg MeanSphericity: Clutch mean of egg sphericity generated by SpotEgg MeanNumSpots: Clutch mean number of spots generated by SpotEgg Mean TotAreaSpots: Clutch mean for total area of eggshell covered in spots Mean AvgSpotSize: Clutch mean for average spot size on eggshell 4. Missing data codes: NA 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: full.SpotEgg.csv 1. Number of variables: 15 2. Number of cases/rows: 705 3. Variable List: FileName: Name of egg (site, nest number, egg number) Year: Year egg laid and photographed Location: State of collection Prexlude: Y = egg could be sampled from a female already with eggs in data set Site: Site name where egg was laid Nest: Nest ID Clutch size: Number of eggs in that clutch Volume: Egg volume generated by SpotEgg Area: Egg area generated by SpotEgg Length: Egg length generated by SpotEgg Width: Egg width generated by SpotEgg Sphericity: Egg sphericity, B/L from data above NumSpots: Number of spots on eggshell generated by SpotEgg TotAreaSpots: Total area of eggshell (in %) covered in spots generated by SpotEgg AvgSpotSize: Average spot size generated by SpotEgg 4. Missing data codes: NA 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: lay.order.NPM.csv 1. Number of variables: 13 2. Number of cases/rows: 33 3. Variable List: Nest: Nest ID Number of Eggs: Number of eggs in the clutch with known lay order E1 rank: Rank of first egg to nest of origin generated by NaturePatternMatch E2 rank: Rank of second egg to nest of origin generated by NaturePatternMatch E3 rank: Rank of third egg to nest of origin generated by NaturePatternMatch E4 rank: Rank of fourth egg to nest of origin generated by NaturePatternMatch E5 rank: Rank of fifth egg to nest of origin generated by NaturePatternMatch E6 rank: Rank of sixth egg to nest of origin generated by NaturePatternMatch Average Rank: Mean rank of eggs to clutch Average of all but first: Mean rank of all eggs to clutch excluding the first-laid egg Final egg rank: Rank of last-laid egg to clutch Average of all but last: Mean rank of all eggs to clutch excluding the last-laid egg Average of middle: Mean rank of all middle eggs excluding first and last-laid eggs. 4. Missing data codes: NA 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: lay.order.SpotEgg.csv 1. Number of variables: 12 2. Number of cases/rows: 191 3. Variable List: File name: Name of egg (site, nest number, egg number) Lay order: Order in which eggs were laid Nest: Nest ID Clutch size: Number of eggs in the clutch Volume: Egg volume generated by SpotEgg Area: Egg area generated by SpotEgg Length: Egg length generated by SpotEgg Width: Egg width generated by SpotEgg Sphericity: Egg sphericity, B/L from data above NumSpots: Number of spots on eggshell generated by SpotEgg TotAreaSpots: Total area of eggshell (in %) covered in spots generated by SpotEgg AvgSpotSize: Average spot size generated by SpotEgg 4. Missing data codes: NA 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: sides.of.egg.NPM.csv 1. Number of variables: 4 2. Number of cases/rows: 148 3. Variable List: File name: Name of egg (site, nest number, egg number) Nest: Nest ID Rank to nest: Rank to nest of origin generated by NaturePatternMatch Rank to egg: Whether the best matching egg was the photo of the other side of that same egg 4. Missing data codes: NA 5. Specialized formats or other abbreviations used: DATA-SPECIFIC INFORMATION FOR: sides.of.egg.SpotEgg.csv 1. Number of variables: 14 2. Number of cases/rows: 148 3. Variable List: File name: Name of egg (site, nest number, egg number) Side: One vs. two, for photos of sides of eggs rotated 180 deg Egg: Egg number Site: Where egg was laid and photographed Nest: Nest ID NestEgg: Unique nest & egg identifier Clutch size: Number of eggs laid in the clutch Volume: Egg volume generated by SpotEgg Area: Egg area generated by SpotEgg Length: Egg length generated by SpotEgg Width: Egg width generated by SpotEgg Sphericity: Egg sphericity, B/L from data above NumSpots: Number of spots on eggshell generated by SpotEgg TotAreaSpots: Total area of eggshell (in %) covered in spots generated by SpotEgg AvgSpotSize: Average spot size generated by SpotEgg 4. Missing data codes: NA 5. Specialized formats or other abbreviations used: