Don't forget to put the milk back! Dataset for Enabling Embodied Agents to Detect Anomalous Situations

Read original: arXiv:2404.08827 - Published 4/16/2024 by James F. Mullen Jr, Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadan

Don't forget to put the milk back! Dataset for Enabling Embodied Agents to Detect Anomalous Situations

Overview

This paper introduces a new dataset called "Don't forget to put the milk back!" to enable embodied agents to detect anomalous situations.
The dataset consists of videos of everyday household tasks, with some videos containing anomalous events like forgetting to put milk back in the fridge.
The goal is to develop AI systems that can detect these types of anomalies in real-world environments, which could have applications in assistive robotics and smart home technologies.

Plain English Explanation

The researchers behind this paper have created a new dataset to help teach AI systems how to identify unusual or abnormal situations that might occur in a home. The dataset contains videos of people doing everyday household tasks, like putting away groceries or making a meal.

In some of the videos, the person makes a small mistake, like forgetting to put the milk back in the fridge. The researchers want to use this dataset to train AI assistants and robots to recognize when things are not quite right, so they can remind the person or take action to fix the problem.

This could be really useful for things like smart home systems that can detect if you've left the oven on or forgotten to lock the door. It could also help assistive robots provide more useful support to elderly or disabled people by noticing when they've forgotten a step in their daily routine.

The key innovation here is creating a dataset that captures these types of minor but important anomalies, to give AI systems the training data they need to get better at perceiving and responding to the subtle details of everyday life.

Technical Explanation

The paper introduces the "Don't Forget to Put the Milk Back!" (DFMB) dataset, which consists of videos of people performing common household tasks like unloading groceries, preparing a meal, or tidying up a kitchen. Crucially, some of the videos contain anomalous events, such as the person forgetting to put the milk back in the fridge after use.

The dataset was collected by having participants act out various household scenarios, both with and without these types of anomalous events. The videos were annotated to indicate the timestamps where the anomalies occurred. In total, the DFMB dataset contains over 200 videos, with around 25% featuring an anomalous event.

The researchers envision this dataset being used to train embodied AI agents, such as robots or smart home assistants, to detect when something out of the ordinary has happened. By learning to recognize subtle cues that indicate an anomaly, these systems could then provide helpful reminders or take corrective action.

Compared to existing anomaly detection datasets, the DFMB dataset has a few key advantages. First, it focuses on the types of everyday, low-level anomalies that are most relevant for assistive technologies in home environments. Second, the videos feature a diverse set of participants and environments, improving the generalizability of the models trained on this data.

Critical Analysis

One potential limitation of the DFMB dataset is the relatively small overall size, with just over 200 videos. While this may be sufficient for initial model training and evaluation, larger datasets would be needed to develop more robust and generalizable anomaly detection capabilities.

Additionally, the dataset only covers a subset of household tasks and anomalies. There may be value in expanding the scope to include a wider range of everyday activities and potential anomalies, such as forgetting to turn off lights or appliances, misplacing keys or wallets, or spilling liquids.

The paper also does not provide much insight into the specific types of machine learning models or anomaly detection techniques that were explored. Further details on the experimental setup and results would be helpful for researchers looking to build upon this work.

Overall, the DFMB dataset represents a valuable contribution to the field of embodied AI and assistive technologies. By focusing on the types of subtle, context-dependent anomalies that are challenging for current systems to detect, this dataset opens up new avenues for developing more capable and helpful AI assistants.

Conclusion

The "Don't Forget to Put the Milk Back!" dataset introduced in this paper provides a new benchmark for training and evaluating AI systems that can detect anomalous situations in real-world, household environments.

By capturing a diverse set of everyday tasks and scenarios, including small but important mistakes like forgetting to put away groceries, this dataset aims to push the boundaries of what embodied AI agents can perceive and respond to.

The potential applications of this technology are wide-ranging, from smart home assistants that can help elderly or disabled individuals maintain their independence, to robotic helpers that can provide proactive support in daily living tasks. As the field of embodied AI continues to advance, datasets like DFMB will be crucial for developing systems that can truly understand and engage with the nuances of human behavior.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Don't forget to put the milk back! Dataset for Enabling Embodied Agents to Detect Anomalous Situations

James F. Mullen Jr, Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadan

Home robots intend to make their users lives easier. Our work assists in this goal by enabling robots to inform their users of dangerous or unsanitary anomalies in their home. Some examples of these anomalies include the user leaving their milk out, forgetting to turn off the stove, or leaving poison accessible to children. To move towards enabling home robots with these abilities, we have created a new dataset, which we call SafetyDetect. The SafetyDetect dataset consists of 1000 anomalous home scenes, each of which contains unsafe or unsanitary situations for an agent to detect. Our approach utilizes large language models (LLMs) alongside both a graph representation of the scene and the relationships between the objects in the scene. Our key insight is that this connected scene graph and the object relationships it encodes enables the LLM to better reason about the scene -- especially as it relates to detecting dangerous or unsanitary situations. Our most promising approach utilizes GPT-4 and pursues a categorization technique where object relations from the scene graph are classified as normal, dangerous, unsanitary, or dangerous for children. This method is able to correctly identify over 90% of anomalous scenarios in the SafetyDetect Dataset. Additionally, we conduct real world experiments on a ClearPath TurtleBot where we generate a scene graph from visuals of the real world scene, and run our approach with no modification. This setup resulted in little performance loss. The SafetyDetect Dataset and code will be released to the public upon this papers publication.

4/16/2024

Make Your Home Safe: Time-aware Unsupervised User Behavior Anomaly Detection in Smart Homes via Loss-guided Mask

Jingyu Xiao, Zhiyao Xu, Qingsong Zou, Qing Li, Dan Zhao, Dong Fang, Ruoyu Li, Wenxin Tang, Kang Li, Xudong Zuo, Penghui Hu, Yong Jiang, Zixuan Weng, Michael R. Lyv

Smart homes, powered by the Internet of Things, offer great convenience but also pose security concerns due to abnormal behaviors, such as improper operations of users and potential attacks from malicious attackers. Several behavior modeling methods have been proposed to identify abnormal behaviors and mitigate potential risks. However, their performance often falls short because they do not effectively learn less frequent behaviors, consider temporal context, or account for the impact of noise in human behaviors. In this paper, we propose SmartGuard, an autoencoder-based unsupervised user behavior anomaly detection framework. First, we design a Loss-guided Dynamic Mask Strategy (LDMS) to encourage the model to learn less frequent behaviors, which are often overlooked during learning. Second, we propose a Three-level Time-aware Position Embedding (TTPE) to incorporate temporal information into positional embedding to detect temporal context anomaly. Third, we propose a Noise-aware Weighted Reconstruction Loss (NWRL) that assigns different weights for routine behaviors and noise behaviors to mitigate the interference of noise behaviors during inference. Comprehensive experiments on three datasets with ten types of anomaly behaviors demonstrates that SmartGuard consistently outperforms state-of-the-art baselines and also offers highly interpretable results.

6/19/2024

🏋️

Towards Robust Training Datasets for Machine Learning with Ontologies: A Case Study for Emergency Road Vehicle Detection

Lynn Vonderhaar, Timothy Elvira, Tyler Procko, Omar Ochoa

Countless domains rely on Machine Learning (ML) models, including safety-critical domains, such as autonomous driving, which this paper focuses on. While the black box nature of ML is simply a nuisance in some domains, in safety-critical domains, this makes ML models difficult to trust. To fully utilize ML models in safety-critical domains, it would be beneficial to have a method to improve trust in model robustness and accuracy without human experts checking each decision. This research proposes a method to increase trust in ML models used in safety-critical domains by ensuring the robustness and completeness of the model's training dataset. Because ML models embody what they are trained with, ensuring the completeness of training datasets can help to increase the trust in the training of ML models. To this end, this paper proposes the use of a domain ontology and an image quality characteristic ontology to validate the domain completeness and image quality robustness of a training dataset. This research also presents an experiment as a proof of concept for this method, where ontologies are built for the emergency road vehicle domain.

6/24/2024

SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety

Paul Rottger, Fabio Pernisi, Bertie Vidgen, Dirk Hovy

The last two years have seen a rapid growth in concerns around the safety of large language models (LLMs). Researchers and practitioners have met these concerns by introducing an abundance of new datasets for evaluating and improving LLM safety. However, much of this work has happened in parallel, and with very different goals in mind, ranging from the mitigation of near-term risks around bias and toxic content generation to the assessment of longer-term catastrophic risk potential. This makes it difficult for researchers and practitioners to find the most relevant datasets for a given use case, and to identify gaps in dataset coverage that future work may fill. To remedy these issues, we conduct a first systematic review of open datasets for evaluating and improving LLM safety. We review 102 datasets, which we identified through an iterative and community-driven process over the course of several months. We highlight patterns and trends, such as a a trend towards fully synthetic datasets, as well as gaps in dataset coverage, such as a clear lack of non-English datasets. We also examine how LLM safety datasets are used in practice -- in LLM release publications and popular LLM benchmarks -- finding that current evaluation practices are highly idiosyncratic and make use of only a small fraction of available datasets. Our contributions are based on SafetyPrompts.com, a living catalogue of open datasets for LLM safety, which we commit to updating continuously as the field of LLM safety develops.

4/9/2024