Skip to main content
Dryad

Data from: Reinforcement learning-based framework for whale rendezvous via autonomous sensing robots

Data files

Nov 02, 2024 version files 494.36 MB

Abstract

Rendezvous with sperm whales for biological observations is made challenging by their prolonged dive patterns. In this paper, we propose an algorithmic framework that co-develops multi-agent reinforcement learning-based routing (autonomy module) and Synthetic Aperture Radar-based Very High Frequency (VHF) signal-based bearing estimation (sensing module) for maximizing rendezvous opportunities of autonomous robots with sperm whales. The sensing module is compatible with low-energy VHF tags, commonly used for tracking wildlife. The autonomy module leverages in-situ noisy bearing measurements of whale vocalizations, VHF tags, and whale dive behaviors to enable time–critical rendezvous of a robot team with multiple whales in simulation. We conduct experiments at sea in the native habitat of sperm whales using an "engineered whale" - a speedboat equipped with a VHF-emitting tag, emulating five distinct whale tracks, with different whale motions. The sensing module shows a median bearing error of 10.55 degrees to the tag. Using bearing measurements to the engineered whale from an acoustic sensor and our sensing module, our autonomy module gives an aggregate successful rendezvous of 81.3% for a 500-meter rendezvous distance using three robots in post-processing. A second class of fielded experiments that use acoustic-only bearing measurements to three untagged sperm whales, show an aggregate successful rendezvous of 68.68% for a 1000-meter rendezvous distance using two robots in post-processing. We further validate these algorithms with several ablation studies using a sperm whale visual encounter dataset collected by marine biologists.