MetaFood3D: Large 3D Food Object Dataset with Nutrition Values

Read original: arXiv:2409.01966 - Published 9/4/2024 by Yuhao Chen, Jiangpeng He, Chris Czarnecki, Gautham Vinod, Talha Ibn Mahmud, Siddeshwar Raghavan, Jinge Ma, Dayou Mao, Saeejith Nair, Pengcheng Xi and 3 others

MetaFood3D: Large 3D Food Object Dataset with Nutrition Values

Overview

Large 3D dataset of food objects with detailed nutrition information
Aims to advance research in food recognition, dietary monitoring, and personalized nutrition
Includes over 10,000 high-quality 3D models of diverse food items with associated nutrient data

Plain English Explanation

The MetaFood3D dataset provides a comprehensive collection of detailed 3D models for a wide variety of food items, accompanied by their corresponding nutrition information. This resource is designed to drive progress in areas such as food recognition, dietary monitoring, and personalized nutrition.

By offering over 10,000 high-quality 3D food models, along with their detailed nutrient data, the dataset aims to enable researchers to develop more accurate and comprehensive solutions for tasks like recognizing different foods in images or estimating the portion sizes of meals. This data can also help advance the understanding of personalized dietary needs and support the creation of tailored nutrition recommendations.

Technical Explanation

The MetaFood3D dataset consists of over 10,000 high-fidelity 3D models of diverse food items, encompassing a wide range of categories such as fruits, vegetables, baked goods, and more. Each 3D model is accompanied by detailed nutrition information, including macronutrients (e.g., carbohydrates, proteins, fats) and micronutrients (e.g., vitamins, minerals).

The dataset was created using a combination of state-of-the-art 3D scanning techniques and carefully curated nutrition data from authoritative sources. The 3D models were generated using a physically-informed approach to ensure accurate geometric representations and realistic material properties.

Critical Analysis

The MetaFood3D dataset provides a valuable resource for advancing research in food-related computer vision and dietary applications. By offering a large, diverse, and well-annotated collection of 3D food models, the dataset has the potential to significantly improve the performance and robustness of food recognition and portion estimation algorithms.

However, the dataset's reliance on 3D scanning and modeling techniques may introduce some limitations. The accuracy and completeness of the 3D models and their associated nutrition data could be influenced by factors such as scanning quality, food preparation methods, and data sources. Additionally, the dataset may not fully capture the natural variability and context-dependent characteristics of real-world food items.

Researchers using the MetaFood3D dataset should be mindful of these potential limitations and consider complementing it with other datasets or real-world observations to ensure the robustness and generalizability of their solutions.

Conclusion

The MetaFood3D dataset represents a significant contribution to the field of food-related computer vision and dietary applications. By providing a large, diverse, and well-annotated collection of 3D food models with detailed nutrition information, the dataset has the potential to drive advancements in areas such as food recognition, dietary monitoring, and personalized nutrition. As researchers continue to explore and utilize this resource, it could lead to new insights and innovative solutions that improve our understanding and management of food-related health and wellness.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

MetaFood3D: Large 3D Food Object Dataset with Nutrition Values

Yuhao Chen, Jiangpeng He, Chris Czarnecki, Gautham Vinod, Talha Ibn Mahmud, Siddeshwar Raghavan, Jinge Ma, Dayou Mao, Saeejith Nair, Pengcheng Xi, Alexander Wong, Edward Delp, Fengqing Zhu

Food computing is both important and challenging in computer vision (CV). It significantly contributes to the development of CV algorithms due to its frequent presence in datasets across various applications, ranging from classification and instance segmentation to 3D reconstruction. The polymorphic shapes and textures of food, coupled with high variation in forms and vast multimodal information, including language descriptions and nutritional data, make food computing a complex and demanding task for modern CV algorithms. 3D food modeling is a new frontier for addressing food-related problems, due to its inherent capability to deal with random camera views and its straightforward representation for calculating food portion size. However, the primary hurdle in the development of algorithms for food object analysis is the lack of nutrition values in existing 3D datasets. Moreover, in the broader field of 3D research, there is a critical need for domain-specific test datasets. To bridge the gap between general 3D vision and food computing research, we propose MetaFood3D. This dataset consists of 637 meticulously labeled 3D food objects across 108 categories, featuring detailed nutrition information, weight, and food codes linked to a comprehensive nutrition database. The dataset emphasizes intra-class diversity and includes rich modalities such as textured mesh files, RGB-D videos, and segmentation masks. Experimental results demonstrate our dataset's significant potential for improving algorithm performance, highlight the challenging gap between video captures and 3D scanned data, and show the strength of the MetaFood3D dataset in high-quality data generation, simulation, and augmentation.

9/4/2024

MetaFood CVPR 2024 Challenge on Physically Informed 3D Food Reconstruction: Methods and Results

Jiangpeng He, Yuhao Chen, Gautham Vinod, Talha Ibn Mahmud, Fengqing Zhu, Edward Delp, Alexander Wong, Pengcheng Xi, Ahmad AlMughrabi, Umair Haroon, Ricardo Marques, Petia Radeva, Jiadong Tang, Dianyi Yang, Yu Gao, Zhaoxiang Liang, Yawei Jueluo, Chengyu Shi, Pengyu Wang

The increasing interest in computer vision applications for nutrition and dietary monitoring has led to the development of advanced 3D reconstruction techniques for food items. However, the scarcity of high-quality data and limited collaboration between industry and academia have constrained progress in this field. Building on recent advancements in 3D reconstruction, we host the MetaFood Workshop and its challenge for Physically Informed 3D Food Reconstruction. This challenge focuses on reconstructing volume-accurate 3D models of food items from 2D images, using a visible checkerboard as a size reference. Participants were tasked with reconstructing 3D models for 20 selected food items of varying difficulty levels: easy, medium, and hard. The easy level provides 200 images, the medium level provides 30 images, and the hard level provides only 1 image for reconstruction. In total, 16 teams submitted results in the final testing phase. The solutions developed in this challenge achieved promising results in 3D food reconstruction, with significant potential for improving portion estimation for dietary assessment and nutritional monitoring. More details about this workshop challenge and access to the dataset can be found at https://sites.google.com/view/cvpr-metafood-2024.

7/15/2024

🖼️

Leveraging Automatic Personalised Nutrition: Food Image Recognition Benchmark and Dataset based on Nutrition Taxonomy

Sergio Romero-Tapiador, Ruben Tolosana, Aythami Morales, Julian Fierrez, Ruben Vera-Rodriguez, Isabel Espinosa-Salinas, Gala Freixer, Enrique Carrillo de Santa Pau, Ana Ram'irez de Molina, Javier Ortega-Garcia

Maintaining a healthy lifestyle has become increasingly challenging in today's sedentary society marked by poor eating habits. To address this issue, both national and international organisations have made numerous efforts to promote healthier diets and increased physical activity. However, implementing these recommendations in daily life can be difficult, as they are often generic and not tailored to individuals. This study presents the AI4Food-NutritionDB database, the first nutrition database that incorporates food images and a nutrition taxonomy based on recommendations by national and international health authorities. The database offers a multi-level categorisation, comprising 6 nutritional levels, 19 main categories (e.g., Meat), 73 subcategories (e.g., White Meat), and 893 specific food products (e.g., Chicken). The AI4Food-NutritionDB opens the doors to new food computing approaches in terms of food intake frequency, quality, and categorisation. Also, we present a standardised experimental protocol and benchmark including three tasks based on the nutrition taxonomy (i.e., category, subcategory, and final product recognition). These resources are available to the research community, including our deep learning models trained on AI4Food-NutritionDB, which can serve as pre-trained models, achieving accurate recognition results for challenging food image databases.

4/22/2024

Food Portion Estimation via 3D Object Scaling

Gautham Vinod, Jiangpeng He, Zeman Shao, Fengqing Zhu

Image-based methods to analyze food images have alleviated the user burden and biases associated with traditional methods. However, accurate portion estimation remains a major challenge due to the loss of 3D information in the 2D representation of foods captured by smartphone cameras or wearable devices. In this paper, we propose a new framework to estimate both food volume and energy from 2D images by leveraging the power of 3D food models and physical reference in the eating scene. Our method estimates the pose of the camera and the food object in the input image and recreates the eating occasion by rendering an image of a 3D model of the food with the estimated poses. We also introduce a new dataset, SimpleFood45, which contains 2D images of 45 food items and associated annotations including food volume, weight, and energy. Our method achieves an average error of 31.10 kCal (17.67%) on this dataset, outperforming existing portion estimation methods.

4/19/2024