FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation

2403.10916

Published 7/1/2024 by Moseli Mots'oehli, Anton Nikolaev, Wawan B. IGede, John Lynham, Peter J. Mous, Peter Sadowski

🤿

Abstract

Fish stock assessment often involves manual fish counting by taxonomy specialists, which is both time-consuming and costly. We propose FishNet, an automated computer vision system for both taxonomic classification and fish size estimation from images captured with a low-cost digital camera. The system first performs object detection and segmentation using a Mask R-CNN to identify individual fish from images containing multiple fish, possibly consisting of different species. Then each fish species is classified and the length is predicted using separate machine learning models. To develop the model, we use a dataset of 300,000 hand-labeled images containing 1.2M fish of 163 different species and ranging in length from 10cm to 250cm, with additional annotations and quality control methods used to curate high-quality training data. On held-out test data sets, our system achieves a 92% intersection over union on the fish segmentation task, a 89% top-1 classification accuracy on single fish species classification, and a 2.3cm mean absolute error on the fish length estimation task.

Create account to get full access

Overview

This paper presents a template for citing AI research papers in the PRIME AI style, which includes key details like author names, title, page numbers, and DOI.
The template is generated using LaTeXML, a tool for converting LaTeX documents to various output formats.
The paper provides a structured layout with sections, subsections, and links to navigate the content.

Plain English Explanation

This paper provides a template or example for how to properly cite and format the bibliographic information for an AI research paper. [The template is available at https://aimodels.fyi/papers/arxiv/low-cost-machine-vision-insect-classification.] It includes the standard components you would expect to see in a citation, like the author names, paper title, page numbers, and a unique digital object identifier (DOI).

The template was generated using a tool called LaTeXML, which can convert documents written in the LaTeX typesetting language into other formats, like HTML. This allows the citation information to be presented in a clean, structured way with headings, links, and other formatting to make it easy to read and navigate.

Overall, this paper is simply offering a standardized way to properly cite AI research papers that follows best practices and industry conventions. Having a consistent citation format makes it easier for researchers, readers, and search engines to find and reference the original work.

Technical Explanation

The paper presents a template for formatting bibliographic citations in the PRIME AI style. It includes sections for the paper title, author names, page numbers, and a DOI (digital object identifier) that provides a unique reference to the published work.

The template is generated using LaTeXML, a tool for converting LaTeX documents - a common typesetting language used in academic publishing - into other formats like HTML. This allows the citation information to be presented in a structured, hyperlinked format that is easy for readers to navigate.

The document includes various HTML elements and Bootstrap CSS styling to create a visually appealing and responsive layout. Sections are delineated using heading tags (e.g. <h1>, <h2>) and a table of contents is provided to allow easy navigation between different parts of the paper.

Overall, this template provides a standardized way to properly cite AI research papers in a format that aligns with industry best practices and conventions. By using consistent citation styles, it becomes easier for researchers, readers, and search engines to locate and reference the original work.

Critical Analysis

The template provided in this paper offers a clear and structured way to cite AI research papers. By following a consistent format, it helps ensure that key bibliographic details are properly documented and accessible to readers.

One potential limitation is that the template is specific to the PRIME AI citation style. While this is a common format, there are other citation styles (e.g. APA, IEEE) that may be preferred in certain research domains or publications. It would be helpful if the paper also provided guidance on adapting the template to other citation conventions.

Additionally, the template is currently generated using LaTeXML, which may not be the most accessible tool for all researchers. Providing alternative methods for generating the citation, such as a simple web-based form or integration with reference management software, could make the template more widely usable.

Overall, this paper provides a valuable contribution by offering a standardized citation format for AI research. With some additional flexibility and accessibility features, the template could be an even more useful resource for the research community.

Conclusion

This paper presents a template for citing AI research papers in the PRIME AI style. The template includes key bibliographic details like author names, paper title, page numbers, and a DOI, all formatted in a structured and hyperlinked layout.

The use of a standardized citation format helps ensure that AI research is properly documented and easily referenced by other researchers, readers, and search engines. While the template is specific to the PRIME AI style, the general principles it demonstrates could be adapted to other common citation conventions.

Overall, this paper provides a valuable contribution to the AI research community by offering a clear and consistent way to cite published works. By following best practices for bibliographic citations, researchers can improve the discoverability and accessibility of their important contributions to the field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

👀

Low Cost Machine Vision for Insect Classification

Danja Brandt, Martin Tschaikner, Teodor Chiaburu, Henning Schmidt, Ilona Schrimpf, Alexandra Stadel, Ingeborg E. Beckers, Frank Hau{ss}er

Preserving the number and diversity of insects is one of our society's most important goals in the area of environmental sustainability. A prerequisite for this is a systematic and up-scaled monitoring in order to detect correlations and identify countermeasures. Therefore, automatized monitoring using live traps is important, but so far there is no system that provides image data of sufficient detailed information for entomological classification. In this work, we present an imaging method as part of a multisensor system developed as a low-cost, scalable, open-source system that is adaptable to classical trap types. The image quality meets the requirements needed for classification in the taxonomic tree. Therefore, illumination and resolution have been optimized and motion artefacts have been suppressed. The system is evaluated exemplarily on a dataset consisting of 16 insect species of the same as well as different genus, family and order. We demonstrate that standard CNN-architectures like ResNet50 (pretrained on iNaturalist data) or MobileNet perform very well for the prediction task after re-training. Smaller custom made CNNs also lead to promising results. Classification accuracy of $>96%$ has been achieved. Moreover, it was proved that image cropping of insects is necessary for classification of species with high inter-class similarity.

4/29/2024

cs.CV cs.LG

Vision-Based Approach for Food Weight Estimation from 2D Images

Chathura Wimalasiri, Prasan Kumar Sahoo

In response to the increasing demand for efficient and non-invasive methods to estimate food weight, this paper presents a vision-based approach utilizing 2D images. The study employs a dataset of 2380 images comprising fourteen different food types in various portions, orientations, and containers. The proposed methodology integrates deep learning and computer vision techniques, specifically employing Faster R-CNN for food detection and MobileNetV3 for weight estimation. The detection model achieved a mean average precision (mAP) of 83.41%, an average Intersection over Union (IoU) of 91.82%, and a classification accuracy of 100%. For weight estimation, the model demonstrated a root mean squared error (RMSE) of 6.3204, a mean absolute percentage error (MAPE) of 0.0640%, and an R-squared value of 98.65%. The study underscores the potential applications of this technology in healthcare for nutrition counseling, fitness and wellness for dietary intake assessment, and smart food storage solutions to reduce waste. The results indicate that the combination of Faster R-CNN and MobileNetV3 provides a robust framework for accurate food weight estimation from 2D images, showcasing the synergy of computer vision and deep learning in practical applications.

5/28/2024

cs.CV cs.AI

Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos

Duc Pham, Matthew Hansen, F'elicie Dhellemmens, Jens Krause, Pia Bideau

Easily accessible sensors, like drones with diverse onboard sensors, have greatly expanded studying animal behavior in natural environments. Yet, analyzing vast, unlabeled video data, often spanning hours, remains a challenge for machine learning, especially in computer vision. Existing approaches often analyze only a few frames. Our focus is on long-term animal behavior analysis. To address this challenge, we utilize classical probabilistic methods for state estimation, such as particle filtering. By incorporating recent advancements in semantic object segmentation, we enable continuous tracking of rapidly evolving object formations, even in scenarios with limited data availability. Particle filters offer a provably optimal algorithmic structure for recursively adding new incoming information. We propose a novel approach for tracking schools of fish in the open ocean from drone videos. Our framework not only performs classical object tracking in 2D, instead it tracks the position and spatial expansion of the fish school in world coordinates by fusing video data and the drone's on board sensor information (GPS and IMU). The presented framework for the first time allows researchers to study collective behavior of fish schools in its natural social and environmental context in a non-invasive and scalable way.

6/13/2024

cs.CV

Fishnets: Information-Optimal, Scalable Aggregation for Sets and Graphs

T. Lucas Makinen, Justin Alsing, Benjamin D. Wandelt

Set-based learning is an essential component of modern deep learning and network science. Graph Neural Networks (GNNs) and their edge-free counterparts Deepsets have proven remarkably useful on ragged and topologically challenging datasets. The key to learning informative embeddings for set members is a specified aggregation function, usually a sum, max, or mean. We propose Fishnets, an aggregation strategy for learning information-optimal embeddings for sets of data for both Bayesian inference and graph aggregation. We demonstrate that i) Fishnets neural summaries can be scaled optimally to an arbitrary number of data objects, ii) Fishnets aggregations are robust to changes in data distribution, unlike standard deepsets, iii) Fishnets saturate Bayesian information content and extend to regimes where MCMC techniques fail and iv) Fishnets can be used as a drop-in aggregation scheme within GNNs. We show that by adopting a Fishnets aggregation scheme for message passing, GNNs can achieve state-of-the-art performance versus architecture size on ogbn-protein data over existing benchmarks with a fraction of learnable parameters and faster training time.

7/1/2024

cs.LG stat.ML