Automated time-lapse cameras can facilitate reliable and consistent monitoring of wild animal populations. In this report, data from 73,802 images taken by 15 different Penguin Watch cameras are presented, capturing the dynamics of penguin (Spheniscidae; Pygoscelis spp.) breeding colonies across the Antarctic Peninsula, South Shetland Islands and South Georgia (03/2012 to 01/2014). Citizen science provides a means by which large and otherwise intractable photographic data sets can be processed, and here we describe the methodology associated with the Zooniverse project Penguin Watch, and provide validation of the method. We present anonymised volunteer classifications for the 73,802 images, alongside the associated metadata (including date/time and temperature information). In addition to the benefits for ecological monitoring, such as easy detection of animal attendance patterns, this type of annotated time-lapse imagery can be employed as a training tool for machi
ne
learning algorithms to automate data extraction, and we encourage the use of this data set for computer vision development.
Raw images - NEKO
Raw image (JPEG) files for photographs captured in NEKO Harbour (NEKO). There are 6 folders in total (images are separated by camera and year). Date, time, moon phase and temperature are shown within each image. Image names are in the following format: SITExYEARx_imagenumber.JPG. Please see the associated paper for more information.
Raw_images_NEKO.zip
Raw images - PETE.1
Raw images (JPEG) files for photographs captured on Petermann Island (PETE). There are 2 folders in total (images are separated by year); additional PETE images can be found in 'PETE.2'. Date, time, moon phase and temperature are shown within each image. Image names are in the following format: SITExYEARx_imagenumber.JPG. Please see the associated paper for more information.
Raw_images_PETE.1.zip
Raw images - PETE.2
Raw image (JPEG) files for photographs captured on Petermann Island (PETE). There are 2 folders in total (images are separated by camera and year); additional PETE images can be found in 'PETE.1'. Date, time, moon phase and temperature are shown within each image. Image names are in the following format: SITExYEARx_imagenumber.JPG. Please see the associated paper for more information.
Raw_images_PETE.2_v2.zip
Raw images - SPIG
Raw image (JPEG) files for photographs captured on Spigot Peak (SPIG). There are 3 folders in total (images are separated by year). Date, time, moon phase and temperature are shown within each image. Image names are in the following format: SITExYEARx_imagenumber.JPG. Please see the associated paper for more information.
Raw_images_SPIG.zip
PW Anonymised Raw Classifications and Metadata
This folder contains 32 csv files, each providing the metadata (including - but not limited to - 'user_name' (anonymised), 'subject_zooniverse_id', 'temperature_f', 'timestamp', and raw 'x' and 'y' coordinates for volunteer clicks) associated with each of the 63,070 Penguin Watch subjects (photographs) provided in this repository. Please see the associated paper for further information and an explanation of terms. The metadata are divided by camera/year, although some are subdivided for clarity and consistency with the 'PW Consensus Click Data' files. For information on how to filter the raw coordinates prior to performing a cluster analysis, please see the 'Clustering Algorithm' section of the accompanying 'Data Descriptor'. Please note that any missing information is absent because it could not be obtained from the cameras, or was missing from the Penguin Watch database - all available metadata have been uploaded to the repository.
Anonymised_PW_metadata_v2.zip
PW Consensus Click Data
This folder comprises 32 csv files, each containing 'consensus click' data relating to the 63,070 Penguin Watch subjects (photographs) also published in this repository. These data are produced via a clustering algorithm (see the 'Methods' section of the associated paper for more information), the code for which can be found on GitHub at the following address: https://github.com/zooniverse/aggregation/blob/master/penguins/aggregate.py (a static version is also archived on Figshare at: https://doi.org/10.6084/m9.figshare.5472544.v1). Please also see the associated paper for an explanation of terms. Please note that the data for some cameras have been subdivided into multiple files - this is to increase clarity, and for consistency with respect to the 'Anonymised Raw Classifications and Metadata' files.
Consensus_clicks_v2.zip
Raw images - DAMO-MAIV
Raw images (JPEG) files from cameras at Damoy Point, George's Point, Half Moon Island, Port Lockroy and Maiviken - DAMO (1), GEOR (1), HALF (2), LOCK (1) and MAIV (4), respectively. Numbers in parentheses denote the number of separate folders associated with each camera location. Each image shows the date, time, moon phase and temperature alongside the photograph. Individual images are named in the following format: SITExYEARx_imagenumber.JPG. For more information please refer to the associated paper.
Raw_images_DAMO-MAIV.zip
Erratum
Description of change between version 1 and version 2.