Explainable machine learning to enable high-throughput electrical conductivity optimization and discovery of doped conjugated polymers

Read original: arXiv:2308.04103 - Published 4/30/2024 by Ji Wei Yoon, Adithya Kumar, Pawan Kumar, Kedar Hippalgaonkar, J Senthilnath, Vijila Chellappan
Total Score

0

🛠️

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper explores how combining high-throughput experimentation techniques and machine learning (ML) can accelerate the discovery of new materials with desirable properties.
  • The researchers focus on the challenge of measuring electrical conductivity in doped polymer materials, which requires meticulous process control and laborious measurements.
  • They propose a ML approach that uses readily available absorbance spectra to predict electrical conductivity, reducing the time and effort required.
  • The ML models achieved high accuracy in classifying and predicting the conductivity of highly conductive samples.
  • The proposed workflow can improve the efficiency of conductivity measurements by up to 89%.
  • The researchers also provide insights into the spectral features that influence conductivity, addressing the common challenge of model explainability in ML.

Plain English Explanation

The researchers in this study wanted to find a faster way to discover new materials with useful properties, like high electrical conductivity. Normally, measuring a material's conductivity requires a lot of careful experimentation and tedious measurements.

To speed things up, the researchers used machine learning (ML) to predict the conductivity of polymer materials based on their absorbance spectra - a measurement that's quicker and easier to take. Their ML models were able to accurately classify samples as having high or low conductivity, and could even predict the exact conductivity values for the most conductive samples.

By using this ML-assisted approach, the researchers were able to improve the efficiency of the conductivity measurement process by up to 89% compared to the traditional experimental methods. Additionally, they were able to gain insights into which spectral features influence a material's conductivity, helping to explain how the ML models were making their predictions.

Overall, this research demonstrates how combining high-throughput experimentation and ML can significantly accelerate the discovery of new, high-performance materials. It also shows how ML can be used to provide valuable insights, not just make predictions, which can help advance the field of experimental science.

Technical Explanation

The researchers developed a machine learning (ML) approach to predict the electrical conductivity of doped polymer materials based on their absorbance spectra. Typically, measuring the conductivity of these materials requires meticulous process control, experimentation, and laborious measurements, which can be time-consuming and inefficient.

To address this challenge, the researchers first trained a classification model to accurately distinguish between samples with high (>25 S/cm) and low conductivity, achieving up to 100% accuracy. For the subset of highly conductive samples, they then employed a regression model to predict the exact conductivity values, which yielded an impressive test R^2 value of 0.984.

The researchers further tested their models by successfully classifying and predicting the conductivity of samples with the two highest measured values (498 and 506 S/cm), demonstrating the models' ability to make accurate extrapolative predictions.

By leveraging this ML-assisted workflow, the researchers were able to improve the efficiency of the conductivity measurement process by up to 89% compared to the traditional experimental methods. Additionally, the researchers addressed the common challenge of model explainability in ML by exploiting the mathematical properties of the descriptors and the models, allowing them to gain corroborated insights into the spectral features that influence conductivity.

Critical Analysis

The researchers have demonstrated a compelling approach to accelerating the discovery of new materials with desirable properties, such as high electrical conductivity, by combining high-throughput experimentation and machine learning. Their ability to accurately classify and predict the conductivity of highly conductive polymer samples is particularly impressive.

However, the paper does not address some potential limitations of their approach. For instance, the researchers only tested their models on a relatively small dataset, and it's unclear how well the models would perform on a more diverse set of polymer materials. Additionally, the researchers did not discuss the potential for bias in the data or the models, which is an important consideration when using machine learning in experimental science.

Further research could also explore the applicability of this approach to other material properties beyond electrical conductivity, as well as investigate ways to make the models more robust and generalizable. It would also be valuable to see the researchers delve deeper into the insights they gained about the spectral features influencing conductivity and how those insights could inform the design of new materials.

Conclusion

This study demonstrates the power of combining high-throughput experimentation and machine learning to accelerate the discovery of new materials with desirable properties. By developing a ML-based approach to predict the electrical conductivity of doped polymer materials, the researchers were able to significantly improve the efficiency of the measurement process while also gaining valuable insights into the underlying factors that influence conductivity.

The researchers' success in accurately classifying and predicting the conductivity of highly conductive samples, including extrapolative cases, highlights the potential of this approach to advance the field of material science and enable the development of cutting-edge materials. Additionally, the researchers' efforts to address the challenge of model explainability in ML demonstrate a thoughtful and rigorous approach to ensuring the insights gained from their work are corroborated and meaningful.

Overall, this study offers an exciting glimpse into how the strategic use of machine learning can revolutionize the way we discover and optimize new materials, ultimately driving innovation and progress in a wide range of industries and applications.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🛠️

Total Score

0

Explainable machine learning to enable high-throughput electrical conductivity optimization and discovery of doped conjugated polymers

Ji Wei Yoon, Adithya Kumar, Pawan Kumar, Kedar Hippalgaonkar, J Senthilnath, Vijila Chellappan

The combination of high-throughput experimentation techniques and machine learning (ML) has recently ushered in a new era of accelerated material discovery, enabling the identification of materials with cutting-edge properties. However, the measurement of certain physical quantities remains challenging to automate. Specifically, meticulous process control, experimentation and laborious measurements are required to achieve optimal electrical conductivity in doped polymer materials. We propose a ML approach, which relies on readily measured absorbance spectra, to accelerate the workflow associated with measuring electrical conductivity. The classification model accurately classifies samples with a conductivity > 25 to 100 S/cm, achieving a maximum of 100 % accuracy rate. For the subset of highly conductive samples, we employed a regression model to predict their conductivities, yielding an impressive test R2 value of 0.984. We tested the models with samples of the two highest conductivities (498 and 506 S/cm) and showed that they were able to correctly classify and predict the two extrapolative conductivities at satisfactory levels of errors. The proposed ML-assisted workflow results in an improvement in the efficiency of the conductivity measurements by 89 % of the maximum achievable using our experimental techniques. Furthermore, our approach addressed the common challenge of the lack of explainability in ML models by exploiting bespoke mathematical properties of the descriptors and ML model, allowing us to gain corroborated insights into the spectral influences on conductivity. Through this study, we offer an accelerated pathway for optimizing the properties of doped polymer materials while showcasing the valuable insights that can be derived from purposeful utilization of ML in experimental science.

Read more

4/30/2024

🔮

Total Score

0

Machine Learning Based Prediction of Proton Conductivity in Metal-Organic Frameworks

Seunghee Han, Byeong Gwan Lee, Dae Woon Lim, Jihan Kim

Recently, metal-organic frameworks (MOFs) have demonstrated their potential as solid-state electrolytes in proton exchange membrane fuel cells. However, the number of MOFs reported to exhibit proton conductivity remains limited, and the mechanisms underlying this phenomenon are not fully elucidated, complicating the design of proton-conductive MOFs. In response, we developed a comprehensive database of proton-conductive MOFs and applied machine learning techniques to predict their proton conductivity. Our approach included the construction of both descriptor-based and transformer-based models. Notably, the transformer-based transfer learning (Freeze) model performed the best with a mean absolute error (MAE) of 0.91, suggesting that the proton conductivity of MOFs can be estimated within one order of magnitude using this model. Additionally, we employed feature importance and principal component analysis to explore the factors influencing proton conductivity. The insights gained from our database and machine learning model are expected to facilitate the targeted design of proton-conductive MOFs.

Read more

7/18/2024

An Automated Machine Learning Approach to Inkjet Printed Component Analysis: A Step Toward Smart Additive Manufacturing
Total Score

0

An Automated Machine Learning Approach to Inkjet Printed Component Analysis: A Step Toward Smart Additive Manufacturing

Abhishek Sahu, Peter H. Aaen, Praveen Damacharla

In this paper, we present a machine learning based architecture for microwave characterization of inkjet printed components on flexible substrates. Our proposed architecture uses several machine learning algorithms and automatically selects the best algorithm to extract the material parameters (ink conductivity and dielectric properties) from on-wafer measurements. Initially, the mutual dependence between material parameters of the inkjet printed coplanar waveguides (CPWs) and EM-simulated propagation constants is utilized to train the machine learning models. Next, these machine learning models along with measured propagation constants are used to extract the ink conductivity and dielectric properties of the test prototypes. To demonstrate the applicability of our proposed approach, we compare and contrast four heuristic based machine learning models. It is shown that eXtreme Gradient Boosted Trees Regressor (XGB) and Light Gradient Boosting (LGB) algorithms perform best for the characterization problem under study.

Read more

4/9/2024

Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing
Total Score

0

Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing

Pranav Shetty, Aishat Adeboye, Sonakshi Gupta, Chao Zhang, Rampi Ramprasad

We present a simulation of various active learning strategies for the discovery of polymer solar cell donor/acceptor pairs using data extracted from the literature spanning $sim$20 years by a natural language processing pipeline. While data-driven methods have been well established to discover novel materials faster than Edisonian trial-and-error approaches, their benefits have not been quantified for material discovery problems that can take decades. Our approach demonstrates a potential reduction in discovery time by approximately 75 %, equivalent to a 15 year acceleration in material innovation. Our pipeline enables us to extract data from greater than 3300 papers which is $sim$5 times larger and therefore more diverse than similar data sets reported by others. We also trained machine learning models to predict the power conversion efficiency and used our model to identify promising donor-acceptor combinations that are as yet unreported. We thus demonstrate a pipeline that goes from published literature to extracted material property data which in turn is used to obtain data-driven insights. Our insights include active learning strategies that can be used to train strong predictive models of material properties or be robust to the initial material system used. This work provides a valuable framework for data-driven research in materials science.

Read more

6/26/2024