Beyond development: Challenges in deploying machine learning models for structural engineering applications

2404.12544

Published 4/22/2024 by Mohsen Zaker Esteghamati, Brennan Bean, Henry V. Burton, M. Z. Naser

🏷️

Abstract

Machine learning (ML)-based solutions are rapidly changing the landscape of many fields, including structural engineering. Despite their promising performance, these approaches are usually only demonstrated as proof-of-concept in structural engineering, and are rarely deployed for real-world applications. This paper aims to illustrate the challenges of developing ML models suitable for deployment through two illustrative examples. Among various pitfalls, the presented discussion focuses on model overfitting and underspecification, training data representativeness, variable omission bias, and cross-validation. The results highlight the importance of implementing rigorous model validation techniques through adaptive sampling, careful physics-informed feature selection, and considerations of both model complexity and generalizability.

Create account to get full access

Overview

This paper discusses the challenges of developing machine learning (ML) models that are suitable for real-world deployment in the field of structural engineering.
The authors highlight several common pitfalls, including model overfitting, underspecification, issues with training data representativeness, and cross-validation problems.
The paper emphasizes the importance of rigorous model validation techniques, such as adaptive sampling, careful feature selection, and considerations of model complexity and generalizability.

Plain English Explanation

Machine learning (ML) is a powerful tool that is transforming many industries, including structural engineering. However, despite the promising performance of ML-based solutions, they are often only demonstrated as proof-of-concept and rarely deployed for real-world applications.

This paper explores the challenges of developing ML models that can be effectively used in the field of structural engineering. The authors discuss several common issues that can prevent ML models from being successfully deployed, such as:

Overfitting: The ML model may perform well on the training data but fail to generalize to new, unseen data. This can happen when the model is too complex and "memorizes" the training data rather than learning the underlying patterns.
Underspecification: The ML model may be missing important factors or variables that are relevant to the problem, leading to inaccurate or unreliable predictions.
Training data representativeness: The data used to train the ML model may not be representative of the real-world situations the model will be used in, resulting in poor performance.
Cross-validation issues: The way the data is split and used for training, validation, and testing can significantly impact the model's performance and generalizability.

To address these challenges, the paper emphasizes the importance of implementing rigorous model validation techniques. This includes:

Adaptive sampling to ensure the training data is representative of the real-world scenarios.
Careful selection of physics-informed features to ensure the model is capturing the most relevant factors.
Considering both model complexity and generalizability to strike the right balance between performance and real-world applicability.

By addressing these challenges, the authors aim to help researchers and practitioners develop ML models that can be successfully deployed in the field of structural engineering, and potentially in other emerging domains as well.

Technical Explanation

The paper presents two illustrative examples to highlight the challenges of developing ML models suitable for real-world deployment in structural engineering. The authors focus on several key issues:

Overfitting and Underspecification: The authors demonstrate how an ML model can overfit to the training data, leading to poor performance on unseen test data. They also show how the omission of important variables can result in underspecified models that fail to capture the underlying relationships in the data.

Training Data Representativeness: The authors emphasize the importance of having training data that is representative of the real-world scenarios the ML model will be used in. They demonstrate how issues with data representativeness can lead to biased and unreliable model predictions.

Cross-Validation: The paper discusses how the choice of cross-validation strategy can significantly impact the performance and generalizability of the ML model. The authors highlight the potential pitfalls of using traditional cross-validation approaches in the context of structural engineering problems.

To address these challenges, the authors recommend a range of model validation techniques, including adaptive sampling, careful feature selection, and considerations of both model complexity and generalizability. The results suggest that implementing these rigorous validation methods is crucial for developing ML models that can be successfully deployed in real-world structural engineering applications.

Critical Analysis

The paper provides a valuable and timely discussion of the challenges faced in developing ML models for real-world deployment in structural engineering. The authors do an excellent job of highlighting several common pitfalls that are often overlooked or underestimated, such as model overfitting, underspecification, and issues with training data representativeness.

One strength of the paper is the use of illustrative examples to demonstrate these challenges in a clear and accessible way. This helps readers understand the practical implications of these issues and the importance of addressing them.

However, the paper could have delved deeper into some of the potential solutions or best practices for overcoming these challenges. While the authors mention techniques like adaptive sampling and careful feature selection, more details on the implementation and effectiveness of these approaches would have been useful.

Additionally, the paper does not discuss the potential trade-offs or challenges in balancing model complexity, performance, and generalizability. This is an important consideration that could have been explored further.

Overall, the paper provides a solid foundation for understanding the challenges of developing ML models for real-world structural engineering applications. The insights and recommendations presented can serve as a valuable starting point for researchers and practitioners looking to address these issues and successfully deploy ML-based solutions in the field.

Conclusion

This paper highlights the importance of rigorous model validation and careful consideration of common pitfalls when developing machine learning (ML) models for real-world deployment in structural engineering. The authors demonstrate that despite the promising performance of ML-based solutions, there are significant challenges that must be addressed to ensure these models can be effectively used in practice.

By focusing on issues like model overfitting, underspecification, training data representativeness, and cross-validation problems, the paper provides valuable insights for researchers and practitioners working to advance the use of ML in structural engineering and other emerging domains. The recommended validation techniques, such as adaptive sampling, feature selection, and considerations of model complexity and generalizability, offer a roadmap for developing ML models that can be successfully deployed in the field.

As ML continues to transform various industries, this paper underscores the importance of addressing the practical challenges that can hinder the real-world application of these powerful tools. By learning from the insights presented here, the structural engineering community and beyond can work towards bridging the gap between the promise of ML and its effective deployment in complex, high-stakes domains.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

📊

Naming the Pain in Machine Learning-Enabled Systems Engineering

Marcos Kalinowski, Daniel Mendez, Gorkem Giray, Antonio Pedro Santos Alves, Kelly Azevedo, Tatiana Escovedo, Hugo Villamizar, Helio Lopes, Teresa Baldassarre, Stefan Wagner, Stefan Biffl, Jurgen Musil, Michael Felderer, Niklas Lavesson, Tony Gorschek

Context: Machine learning (ML)-enabled systems are being increasingly adopted by companies aiming to enhance their products and operational processes. Objective: This paper aims to deliver a comprehensive overview of the current status quo of engineering ML-enabled systems and lay the foundation to steer practically relevant and problem-driven academic research. Method: We conducted an international survey to collect insights from practitioners on the current practices and problems in engineering ML-enabled systems. We received 188 complete responses from 25 countries. We conducted quantitative statistical analyses on contemporary practices using bootstrapping with confidence intervals and qualitative analyses on the reported problems using open and axial coding procedures. Results: Our survey results reinforce and extend existing empirical evidence on engineering ML-enabled systems, providing additional insights into typical ML-enabled systems project contexts, the perceived relevance and complexity of ML life cycle phases, and current practices related to problem understanding, model deployment, and model monitoring. Furthermore, the qualitative analysis provides a detailed map of the problems practitioners face within each ML life cycle phase and the problems causing overall project failure. Conclusions: The results contribute to a better understanding of the status quo and problems in practical environments. We advocate for the further adaptation and dissemination of software engineering practices to enhance the engineering of ML-enabled systems.

6/10/2024

cs.SE cs.AI

🏋️

Machine Learning-Enabled Software and System Architecture Frameworks

Armin Moin, Atta Badii, Stephan Gunnemann, Moharram Challenger

Various architecture frameworks for software, systems, and enterprises have been proposed in the literature. They identified several stakeholders and defined modeling perspectives, architecture viewpoints, and views to frame and address stakeholder concerns. However, the stakeholders with data science and Machine Learning (ML) related concerns, such as data scientists and data engineers, are yet to be included in existing architecture frameworks. Only this way can we envision a holistic system architecture description of an ML-enabled system. Note that the ML component behavior and functionalities are special and should be distinguished from traditional software system behavior and functionalities. The main reason is that the actual functionality should be inferred from data instead of being specified at design time. Additionally, the structural models of ML components, such as ML model architectures, are typically specified using different notations and formalisms from what the Software Engineering (SE) community uses for software structural models. Yet, these two aspects, namely ML and non-ML, are becoming so intertwined that it necessitates an extension of software architecture frameworks and modeling practices toward supporting ML-enabled system architectures. In this paper, we address this gap through an empirical study using an online survey instrument. We surveyed 61 subject matter experts from over 25 organizations in 10 countries.

6/28/2024

cs.SE cs.LG

🔮

Physics-Informed Machine Learning for Seismic Response Prediction OF Nonlinear Steel Moment Resisting Frame Structures

R. Bailey Bond, Pu Ren, Jerome F. Hajjar, Hao Sun

There is growing interest in using machine learning (ML) methods for structural metamodeling due to the substantial computational cost of traditional simulations. Purely data-driven strategies often face limitations in model robustness, interpretability, and dependency on extensive data. To address these challenges, this paper introduces a novel physics-informed machine learning (PiML) method that integrates scientific principles and physical laws into deep neural networks to model seismic responses of nonlinear structures. The approach constrains the ML model's solution space within known physical bounds through three main features: dimensionality reduction via combined model order reduction and wavelet analysis, long short-term memory (LSTM) networks, and Newton's second law. Dimensionality reduction addresses structural systems' redundancy and boosts efficiency while extracting essential features through wavelet analysis. LSTM networks capture temporal dependencies for accurate time-series predictions. Manipulating the equation of motion helps learn system nonlinearities and confines solutions within physically interpretable results. These attributes allow for model training with sparse data, enhancing accuracy, interpretability, and robustness. Furthermore, a dataset of archetype steel moment resistant frames under seismic loading, available in the DesignSafe-CI Database [1], is considered for evaluation. The resulting metamodel handles complex data better than existing physics-guided LSTM models and outperforms other non-physics data-driven networks.

4/30/2024

cs.LG

🔄

Unraveling overoptimism and publication bias in ML-driven science

Pouria Saidi, Gautam Dasarathy, Visar Berisha

Machine Learning (ML) is increasingly used across many disciplines with impressive reported results across many domain areas. However, recent studies suggest that the published performance of ML models are often overoptimistic. Validity concerns are underscored by findings of an inverse relationship between sample size and reported accuracy in published ML models, contrasting with the theory of learning curves where accuracy should improve or remain stable with increasing sample size. This paper investigates factors contributing to overoptimistic accuracy reports in ML-driven science, focusing on data leakage and publication bias. We introduce a novel stochastic model for observed accuracy, integrating parametric learning curves and the aforementioned biases. We then construct an estimator that corrects for these biases in observed data. Theoretical and empirical results show that our framework can estimate the underlying learning curve, providing realistic performance assessments from published results. Applying the model to meta-analyses in ML-driven science, including neuroimaging-based and speech-based classifications of neurological conditions, we find prevalent overoptimism and estimate the inherent limits of ML-based prediction in each domain.

6/12/2024

cs.LG cs.AI cs.CY