Naming the Pain in Machine Learning-Enabled Systems Engineering

2406.04359

Published 6/10/2024 by Marcos Kalinowski, Daniel Mendez, Gorkem Giray, Antonio Pedro Santos Alves, Kelly Azevedo, Tatiana Escovedo, Hugo Villamizar, Helio Lopes, Teresa Baldassarre, Stefan Wagner and 5 others

cs.SE cs.AI

📊

Abstract

Context: Machine learning (ML)-enabled systems are being increasingly adopted by companies aiming to enhance their products and operational processes. Objective: This paper aims to deliver a comprehensive overview of the current status quo of engineering ML-enabled systems and lay the foundation to steer practically relevant and problem-driven academic research. Method: We conducted an international survey to collect insights from practitioners on the current practices and problems in engineering ML-enabled systems. We received 188 complete responses from 25 countries. We conducted quantitative statistical analyses on contemporary practices using bootstrapping with confidence intervals and qualitative analyses on the reported problems using open and axial coding procedures. Results: Our survey results reinforce and extend existing empirical evidence on engineering ML-enabled systems, providing additional insights into typical ML-enabled systems project contexts, the perceived relevance and complexity of ML life cycle phases, and current practices related to problem understanding, model deployment, and model monitoring. Furthermore, the qualitative analysis provides a detailed map of the problems practitioners face within each ML life cycle phase and the problems causing overall project failure. Conclusions: The results contribute to a better understanding of the status quo and problems in practical environments. We advocate for the further adaptation and dissemination of software engineering practices to enhance the engineering of ML-enabled systems.

Create account to get full access

Overview

This paper presents the results of an international survey on the current practices and problems in engineering machine learning (ML)-enabled systems.
The survey collected insights from 188 practitioners across 25 countries, providing a comprehensive overview of the status quo in this field.
The study explores typical ML-enabled systems project contexts, the perceived relevance and complexity of ML life cycle phases, and current practices related to problem understanding, model deployment, and model monitoring.
Qualitative analysis also identifies the key problems practitioners face within each ML life cycle phase and the problems causing overall project failure.

Plain English Explanation

The paper explores the current state of engineering machine learning (ML)-enabled systems. Companies are increasingly adopting ML to enhance their products and operations, but there are still many challenges in actually building these systems.

The researchers conducted a survey of 188 practitioners from 25 countries to understand the typical contexts, practices, and problems they face. They looked at things like how relevant and complex the different stages of the ML development process are perceived to be, as well as current approaches to understanding the problem, deploying the model, and monitoring it after deployment.

The survey results provide a detailed picture of the current state of ML engineering, including the key issues that lead to overall project failure. This helps identify areas where more software engineering best practices need to be adapted and applied to improve the development of ML-powered systems.

Technical Explanation

The researchers conducted an international survey to collect insights from 188 practitioners across 25 countries on the current practices and problems in engineering ML-enabled systems. They used a combination of quantitative statistical analyses, using bootstrapping with confidence intervals, and qualitative analyses, using open and axial coding procedures, to explore the survey results.

The quantitative analysis looked at typical ML-enabled systems project contexts, the perceived relevance and complexity of different ML life cycle phases, and current practices related to problem understanding, model deployment, and model monitoring. The qualitative analysis then provided a detailed map of the specific problems practitioners face within each ML life cycle phase, as well as the overarching issues that lead to overall project failure.

The results reinforce and extend existing empirical evidence on engineering ML-enabled systems, offering new insights into the realities of practical ML deployment. The study highlights the need for better integration of software engineering practices to enhance the development and deployment of ML-powered applications.

Critical Analysis

The paper provides a comprehensive and empirically grounded overview of the current state of engineering ML-enabled systems, which is a valuable contribution to the field. However, the survey-based methodology has some inherent limitations:

The sample size, while relatively large, may not be fully representative of the entire ML engineering practitioner population.
The qualitative analysis, while thorough, relies on the researchers' interpretation of the open-ended survey responses, which could be subject to biases.
The study focuses on general trends and problems, but does not delve deeply into the nuances and contextual factors that may influence ML engineering practices in different industries or organizations.

Additionally, the paper does not address the potential ethical and societal implications of the widespread deployment of ML-enabled systems, which is an important consideration that warrants further research and discussion.

Conclusion

This study offers a detailed snapshot of the current practices and problems in engineering ML-enabled systems, based on insights from a large, international sample of practitioners. The findings reinforce the need for a stronger integration of software engineering best practices to enhance the development and deployment of ML-powered applications.

The results contribute to a better understanding of the status quo and practical challenges in this rapidly evolving field, providing a foundation for future research and the development of more effective ML engineering methodologies. As ML-enabled systems become increasingly ubiquitous, addressing the real-world problems identified in this study will be crucial for realizing the full potential of ML technologies while mitigating the risks and challenges of their deployment.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🗣️

A Systematic Literature Review on the Use of Machine Learning in Software Engineering

Nyaga Fred, I. O. Temkin

Software engineering (SE) is a dynamic field that involves multiple phases all of which are necessary to develop sustainable software systems. Machine learning (ML), a branch of artificial intelligence (AI), has drawn a lot of attention in recent years thanks to its ability to analyze massive volumes of data and extract useful patterns from data. Several studies have focused on examining, categorising, and assessing the application of ML in SE processes. We conducted a literature review on primary studies to address this gap. The study was carried out following the objective and the research questions to explore the current state of the art in applying machine learning techniques in software engineering processes. The review identifies the key areas within software engineering where ML has been applied, including software quality assurance, software maintenance, software comprehension, and software documentation. It also highlights the specific ML techniques that have been leveraged in these domains, such as supervised learning, unsupervised learning, and deep learning. Keywords: machine learning, deep learning, software engineering, natural language processing, source code

6/21/2024

cs.SE cs.LG

🏷️

Beyond development: Challenges in deploying machine learning models for structural engineering applications

Mohsen Zaker Esteghamati, Brennan Bean, Henry V. Burton, M. Z. Naser

Machine learning (ML)-based solutions are rapidly changing the landscape of many fields, including structural engineering. Despite their promising performance, these approaches are usually only demonstrated as proof-of-concept in structural engineering, and are rarely deployed for real-world applications. This paper aims to illustrate the challenges of developing ML models suitable for deployment through two illustrative examples. Among various pitfalls, the presented discussion focuses on model overfitting and underspecification, training data representativeness, variable omission bias, and cross-validation. The results highlight the importance of implementing rigorous model validation techniques through adaptive sampling, careful physics-informed feature selection, and considerations of both model complexity and generalizability.

4/22/2024

cs.LG cs.CE stat.ML

🏋️

Machine Learning-Enabled Software and System Architecture Frameworks

Armin Moin, Atta Badii, Stephan Gunnemann, Moharram Challenger

Various architecture frameworks for software, systems, and enterprises have been proposed in the literature. They identified several stakeholders and defined modeling perspectives, architecture viewpoints, and views to frame and address stakeholder concerns. However, the stakeholders with data science and Machine Learning (ML) related concerns, such as data scientists and data engineers, are yet to be included in existing architecture frameworks. Only this way can we envision a holistic system architecture description of an ML-enabled system. Note that the ML component behavior and functionalities are special and should be distinguished from traditional software system behavior and functionalities. The main reason is that the actual functionality should be inferred from data instead of being specified at design time. Additionally, the structural models of ML components, such as ML model architectures, are typically specified using different notations and formalisms from what the Software Engineering (SE) community uses for software structural models. Yet, these two aspects, namely ML and non-ML, are becoming so intertwined that it necessitates an extension of software architecture frameworks and modeling practices toward supporting ML-enabled system architectures. In this paper, we address this gap through an empirical study using an online survey instrument. We surveyed 61 subject matter experts from over 25 organizations in 10 countries.

6/28/2024

cs.SE cs.LG

Physics-Enhanced Machine Learning: a position paper for dynamical systems investigations

Alice Cicirello

This position paper takes a broad look at Physics-Enhanced Machine Learning (PEML) -- also known as Scientific Machine Learning -- with particular focus to those PEML strategies developed to tackle dynamical systems' challenges. The need to go beyond Machine Learning (ML) strategies is driven by: (i) limited volume of informative data, (ii) avoiding accurate-but-wrong predictions; (iii) dealing with uncertainties; (iv) providing Explainable and Interpretable inferences. A general definition of PEML is provided by considering four physics and domain knowledge biases, and three broad groups of PEML approaches are discussed: physics-guided, physics-encoded and physics-informed. The advantages and challenges in developing PEML strategies for guiding high-consequence decision making in engineering applications involving complex dynamical systems, are presented.

6/11/2024

cs.LG cs.CE