HeadlinesBriefing favicon HeadlinesBriefing.com

Google DeepMind’s DolphinGemma Model Decodes Whale‑Like Sounds

Google DeepMind Blog •
×

Google DeepMind unveiled DolphinGemma, a 400‑million‑parameter model that learns the syntax of Atlantic spotted dolphin vocalizations. Built on SoundStream tokenization, the system predicts next sounds much like language models predict words. Researchers from the Wild Dolphin Project in the Bahamas have supplied the years‑long, hand‑labeled audio set that anchors the model’s training for future analysis and field deployments in 2025.

Field teams now run DolphinGemma on Google Pixel 9 phones, letting researchers spot recurring acoustic patterns in real time. The model feeds into the CHAT system, which pairs synthetic whistles with objects like sargassum or scarves, training dolphins to signal requests. Pixel devices cut hardware costs and enable bone‑conducting underwater alerts, speeding response times for continuous studies across multiple years.

By opening DolphinGemma to the scientific community, DeepMind hopes other cetacean researchers can fine‑tune the model for species such as bottlenose or spinner dolphins. The collaboration demonstrates how machine learning can accelerate pattern discovery in complex animal communication, reducing manual analysis time and opening doors to practical interspecies dialogue tools for future researchers in the ocean world to decipher marine.