A benchmark for computational analysis of animal behavior, using animal-borne tags

Read original: arXiv:2305.10740 - Published 4/12/2024 by Benjamin Hoffman, Maddie Cusimano, Vittorio Baglione, Daniela Canestrari, Damien Chevallier, Dominic L. DeSantis, Lor`ene Jeantet, Monique A. Ladds, Takuya Maekawa, Vicente Mata-Silva and 7 others

👁️

Overview

Animal-borne sensors, known as "bio-loggers," can collect a variety of data about an animal's movement and environment, which can provide insights into the animal's physiology and support conservation efforts.
Machine learning techniques are used to interpret the large amounts of data collected by bio-loggers, but there is no common framework for comparing the different machine learning approaches in this domain.
To address this, the researchers present the Bio-logger Ethogram Benchmark (BEBE), a collection of datasets with behavioral annotations, as well as a modeling task and evaluation metrics.
BEBE is the largest and most taxonomically diverse publicly available benchmark of its kind, containing 1654 hours of data from 149 individuals across nine taxa.
The researchers also test a novel self-supervised learning approach to identifying animal behaviors using bio-logger data, and show that it outperforms common alternatives, especially when there is limited training data.

Plain English Explanation

Animal-borne sensors, or "bio-loggers," are devices that can be attached to animals to record a wide range of data about their movement and environment. This data can provide valuable insights into the animals' physiology and behavior, which can then be used to support conservation efforts.

To interpret the large amounts of data collected by bio-loggers, researchers often use machine learning techniques. However, there has been no common framework for comparing the different machine learning approaches used in this domain. This makes it difficult to evaluate the effectiveness of various methods and identify the best approaches for working with bio-logger data.

To address this gap, the researchers have developed the Bio-logger Ethogram Benchmark (BEBE), which is a collection of datasets with detailed behavioral annotations. BEBE also includes a standardized modeling task and evaluation metrics that researchers can use to test and compare their machine learning techniques.

BEBE is the largest and most diverse publicly available benchmark of its kind, containing over 1,600 hours of data collected from 149 individual animals across nine different species. This diversity helps ensure that the benchmark is representative of the real-world challenges researchers may face when working with bio-logger data.

In addition to creating BEBE, the researchers also tested a novel self-supervised learning approach for identifying animal behaviors from bio-logger data. Self-supervised learning is a type of machine learning where the model learns to recognize patterns in the data without being explicitly trained on labeled examples. The researchers found that this approach outperformed more common machine learning methods, especially when there was limited training data available.

Technical Explanation

The researchers present the Bio-logger Ethogram Benchmark (BEBE), a collection of datasets with behavioral annotations, as well as a modeling task and evaluation metrics, to address the lack of a common framework for comparing machine learning techniques used to interpret bio-logger data.

BEBE includes 1654 hours of data collected from 149 individuals across nine taxa, making it the largest and most taxonomically diverse publicly available benchmark of its kind. The datasets contain a variety of kinematic and environmental data recorded by animal-borne bio-loggers, along with detailed annotations of the animals' behaviors.

Using BEBE, the researchers test a novel self-supervised learning approach for identifying animal behaviors from bio-logger data. This approach involves training a deep neural network on a large, unlabeled dataset of human wrist-worn accelerometer data, and then fine-tuning the pre-trained model on the BEBE datasets. The researchers show that this self-supervised approach outperforms common supervised learning techniques, especially in settings with limited training data.

The datasets, models, and evaluation code for BEBE are made publicly available at https://github.com/earthspecies/BEBE to enable the broader research community to use the benchmark as a point of comparison in the development of new machine learning methods for working with bio-logger data.

Critical Analysis

The BEBE benchmark and the researchers' self-supervised learning approach represent important contributions to the field of animal behavior research using bio-loggers. By providing a standardized dataset and evaluation framework, BEBE can help catalyze progress in this area by enabling more direct comparisons between different machine learning techniques.

However, the paper does not address some potential limitations of the BEBE dataset and the self-supervised learning approach. For example, the dataset may not be representative of all possible animal behaviors or environments, and the self-supervised learning method may be sensitive to the specific characteristics of the human wrist-worn accelerometer data used for pre-training.

Additionally, the paper does not delve into the ethical considerations of using bio-loggers on animals, such as potential impacts on animal welfare or the need for responsible data collection and use. As the use of these technologies becomes more widespread, it will be important for the research community to grapple with these issues.

Overall, the BEBE benchmark and the self-supervised learning approach represent valuable contributions to the field, but there is still room for further research and discussion to address the limitations and ethical considerations of this work. Readers are encouraged to critically evaluate the research and consider the broader implications for animal behavior and conservation science.

Conclusion

The researchers have developed the Bio-logger Ethogram Benchmark (BEBE), a comprehensive dataset and evaluation framework for testing machine learning techniques used to interpret data collected by animal-borne bio-loggers. BEBE is the largest and most taxonomically diverse publicly available benchmark of its kind, and the researchers have also tested a novel self-supervised learning approach that outperforms common alternatives, especially when training data is limited.

By providing a standardized platform for comparing different machine learning methods, BEBE has the potential to accelerate progress in the field of animal behavior research using bio-loggers. The public release of the datasets, models, and evaluation code will enable the broader research community to build upon this work and develop ever-more sophisticated techniques for extracting insights from the rich data collected by these animal-mounted sensors.

As the use of bio-loggers becomes more widespread, this research will also have important implications for conservation efforts, as the data and insights enabled by these technologies can inform strategies to protect vulnerable species and their habitats. However, the ethical considerations surrounding the use of bio-loggers on animals must also be carefully addressed as this field continues to evolve.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👁️

A benchmark for computational analysis of animal behavior, using animal-borne tags

Benjamin Hoffman, Maddie Cusimano, Vittorio Baglione, Daniela Canestrari, Damien Chevallier, Dominic L. DeSantis, Lor`ene Jeantet, Monique A. Ladds, Takuya Maekawa, Vicente Mata-Silva, V'ictor Moreno-Gonz'alez, Eva Trapote, Outi Vainio, Antti Vehkaoja, Ken Yoda, Katherine Zacarian, Ari Friedlaender

Animal-borne sensors ('bio-loggers') can record a suite of kinematic and environmental data, which can elucidate animal ecophysiology and improve conservation efforts. Machine learning techniques are used for interpreting the large amounts of data recorded by bio-loggers, but there exists no common framework for comparing the different machine learning techniques in this domain. To address this, we present the Bio-logger Ethogram Benchmark (BEBE), a collection of datasets with behavioral annotations, as well as a modeling task and evaluation metrics. BEBE is to date the largest, most taxonomically diverse, publicly available benchmark of this type, and includes 1654 hours of data collected from 149 individuals across nine taxa. In addition, using BEBE, we test a novel self-supervised learning approach to identifying animal behaviors based on bio-logger data, using a deep neural network pre-trained with self-supervision on data collected from human wrist-worn accelerometers. We show that this approach out-performs common alternatives, especially in a setting with a low amount of training data. Datasets, models, and evaluation code are made publicly available at https://github.com/earthspecies/BEBE, to enable community use of BEBE as a point of comparison in methods development.

4/12/2024

🏷️

Accelerometer-Based Multivariate Time-Series Dataset for Calf Behavior Classification

Oshana Dissanayake, Sarah E. McPherson, Joseph Allyndree, Emer Kennedy, Padraig Cunningham, Lucile Riaboff

Getting new insights on pre-weaned calf behavioral adaptation to routine challenges (transport, group relocation, etc.) and diseases (respiratory diseases, diarrhea, etc.) is a promising way to improve calf welfare in dairy farms. A classic approach to automatically monitoring behavior is to equip animals with accelerometers attached to neck collars and to develop machine learning models from accelerometer time-series. However, to be used for model development, data must be equipped with labels. Obtaining these labels requires annotating behaviors from direct observation or videos, a time-consuming and labor-intensive process. To address this challenge, we propose the ActBeCalf (Accelerometer Time-Series for Calf Behaviour classification) dataset: 30 pre-weaned dairy calves (Holstein Friesian and Jersey) were equipped with a 3D-accelerometer sensor attached to a neck-collar from one week of birth for 13 weeks. The calves were simultaneously filmed with a camera in each pen. At the end of the trial, behaviors were manually annotated from the videos using the Behavioral Observation Research Interactive Software (BORIS) by 3 observers using an ethogram with 23 behaviors. ActBeCalf contains 27.4 hours of accelerometer data aligned adequately with calf behaviors. The dataset includes the main behaviors, like lying, standing, walking, and running, and less prominent behaviors, such as sniffing, social interaction, and grooming. Finally, ActBeCalf was used for behavior classification with machine learning models: (i)two classes of behaviors, [active and inactive; model 1] and (ii)four classes of behaviors [running, lying, drinking milk, and 'other' class; model 2] to demonstrate its reliability. We got a balanced accuracy of 92% [model1] and 84% [model2]. ActBeCalf is a comprehensive and ready-to-use dataset for classifying pre-weaned calf behaviour from the acceleration time series.

9/4/2024

BirdSet: A Multi-Task Benchmark for Classification in Computational Avian Bioacoustics

Lukas Rauch, Raphael Schwinger, Moritz Wirth, Ren'e Heinrich, Denis Huseljic, Jonas Lange, Stefan Kahl, Bernhard Sick, Sven Tomforde, Christoph Scholz

Deep learning (DL) models have emerged as a powerful tool in avian bioacoustics to assess environmental health. To maximize the potential of cost-effective and minimal-invasive passive acoustic monitoring (PAM), DL models must analyze bird vocalizations across a wide range of species and environmental conditions. However, data fragmentation challenges a comprehensive evaluation of generalization performance. Therefore, we introduce the BirdSet dataset, comprising approximately 520,000 global bird recordings for training and over 400 hours of PAM recordings for testing. Our benchmark offers baselines for several DL models to enhance comparability and consolidate research across studies, along with code implementations that include comprehensive training and evaluation protocols.

6/18/2024

🔄

Development of a digital tool for monitoring the behaviour of pre-weaned calves using accelerometer neck-collars

Oshana Dissanayake (UCD), Sarah E. Mcpherson (Teagasc, WUR), Joseph Allyndr'ee (UCD), Emer Kennedy (Teagasc), P'adraig Cunningham (UCD), Lucile Riaboff (GenPhySE, INRAE)

Automatic monitoring of calf behaviour is a promising way of assessing animal welfare from their first week on farms. This study aims to (i) develop machine learning models from accelerometer data to classify the main behaviours of pre-weaned calves and (ii) set up a digital tool for monitoring the behaviour of pre-weaned calves from the models' prediction. Thirty pre-weaned calves were equipped with a 3-D accelerometer attached to a neck-collar for two months and filmed simultaneously. The behaviours were annotated, resulting in 27.4 hours of observation aligned with the accelerometer data. The time-series were then split into 3 seconds windows. Two machine learning models were tuned using data from 80% of the calves: (i) a Random Forest model to classify between active and inactive behaviours using a set of 11 hand-craft features [model 1] and (ii) a RidgeClassifierCV model to classify between lying, running, drinking milk and other behaviours using ROCKET features [model 2]. The performance of the models was tested using data from the remaining 20% of the calves. Model 1 achieved a balanced accuracy of 0.92. Model 2 achieved a balanced accuracy of 0.84. Behavioural metrics such as daily activity ratio and episodes of running, lying, drinking milk, and other behaviours expressed over time were deduced from the predictions. All the development was finally embedded into a Python dashboard so that the individual calf metrics could be displayed directly from the raw accelerometer files.

6/26/2024