Sharks and rays swimming in a large public aquarium
Data files
Feb 13, 2024 version files 53.77 GB
-
README.md
1.55 KB
-
shark_ray_tracking.zip
53.77 GB
Abstract
Large Public Aquaria are complex ecosystems that require constant monitoring to detect and correct anomalies that may affect the habitat and their species. Many of those anomalies can be directly or indirectly spotted by monitoring the behavior of fish. This can be a quite laborious task to be done by biologists alone. Automated fish tracking methods, specially of the non-intrusive type, can help biologists in the timely detection of such events. These systems require annotated data of fish to be trained. We used footage collected from the main aquarium of Oceanário de Lisboa to create a novel dataset with fish annotations from the shark and ray species. The dataset has the following characteristics:
- 66 shark training tracks with a total of 15812 bounding boxes
- 88 shark testing tracks with a total of 15978 bounding boxes
- 133 ray training tracks with a total of 28168 bounding boxes
- 192 ray testing tracks with a total of 31529 bounding boxes
The training set corresponds to a calm environment where fish swim slowly. On the other hand, the test set is a challenging environment where divers enter the tank to feed the fish causing a frenzy.
README: Sharks and rays swimming in a large public aquarium
Each set has 2 folders: gt and img1. The gt folder contains 3 txt files: gt, gt_out and labels. The gt and gt_out files contain the bounding box annotations sorted in two distinct ways. The former has the annotations sorted by frame number, while the latter is sorted by the track ID. Each line of the ground truth files represents one bounding box of a fish trajectory. The bounding boxes are represented with the following format: frame id, track id, x, y, w, h, not ignored, class id, visibility. The folder img1 contains all the annotated frames.
Description of the bounding boxes variables:
frame id points to the frame where the bounding box was obtained;
track id identifies the track of a fish with which the bonding box is associated;
x and y are the pixel coordinates of the top left corner of the bounding box;
w and h are the width and height of the bounding box respectively. These variables are measured in terms of pixels of the image;
not ignored is a flag to define if the bounding box is meant to be used in the processing. When the value is 1 the bounding box should be considered and when the value is 0 the bounding box should not be considered;
class id contains the class of the object of the bounding box. This id can be crossed with the labels file to obtain the name of the object category;
visibility identifies if the object is visible on the screen in that frame. When the value is 1 the object is visible and when the value is 0 the object is not visible.
Methods
The dataset was collected using a stationary camera positioned outside the main tank of Oceanário de Lisboa aiming at the fish.
Additionally, this data was processed using the CVAT annotation tool to create the sharks and rays annotations.