Electron-nucleus cross sections from transfer learning

Read original: arXiv:2408.09936 - Published 8/20/2024 by Krzysztof M. Graczyk, Beata E. Kowal, Artur M. Ankowski, Rwik Dharmapal Banerjee, Jose Luis Bonilla, Hemant Prasad, Jan T. Sobczyk

Electron-nucleus cross sections from transfer learning

Overview

This research paper explores the use of transfer learning to predict electron-nucleus cross sections, which are important in various fields.
The researchers developed a deep learning model that can accurately predict electron-nucleus cross sections by leveraging knowledge from related tasks.
The model was tested on multiple datasets and showed significant improvements over traditional approaches.

Plain English Explanation

Transfer learning is a technique in machine learning where a model trained on one task is adapted to perform a different but related task. In this paper, the researchers used transfer learning to predict electron-nucleus cross sections, which are measurements of the likelihood of interactions between electrons and atomic nuclei.

Accurately predicting these cross sections is important in various fields, such as nuclear physics, plasma physics, and fusion energy research. The researchers developed a deep learning model that could leverage knowledge from related tasks to make more accurate predictions of electron-nucleus cross sections compared to traditional approaches.

The model was tested on multiple datasets and consistently showed improved performance, demonstrating the power of transfer learning in this domain.

Technical Explanation

The researchers developed a deep learning model that uses transfer learning to predict electron-nucleus cross sections. The model architecture consists of a feature extraction module and a prediction module, where the feature extraction module is pre-trained on related tasks and then fine-tuned for the specific task of predicting electron-nucleus cross sections.

The model was evaluated on multiple datasets, including nuclear physics, plasma physics, and fusion energy research data. The results showed that the transfer learning approach significantly outperformed traditional methods in terms of accuracy and computational efficiency.

Critical Analysis

The researchers acknowledge several limitations of their approach, such as the need for a large and diverse dataset to effectively leverage transfer learning. They also note that the performance of the model may depend on the specific task and the quality of the pre-trained feature extraction module.

Additional research could explore the use of more advanced transfer learning techniques, such as meta-learning or few-shot learning, to further improve the model's performance and its ability to adapt to new tasks and datasets. The researchers could also investigate the interpretability of the model's predictions to provide more insights into the underlying physics.

Conclusion

This research demonstrates the potential of transfer learning in the domain of electron-nucleus cross section prediction. The developed deep learning model was able to leverage knowledge from related tasks to make more accurate and efficient predictions compared to traditional methods. This work has implications for various fields, including nuclear physics, plasma physics, and fusion energy research, where accurate predictions of electron-nucleus cross sections are crucial.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Electron-nucleus cross sections from transfer learning

Krzysztof M. Graczyk, Beata E. Kowal, Artur M. Ankowski, Rwik Dharmapal Banerjee, Jose Luis Bonilla, Hemant Prasad, Jan T. Sobczyk

Transfer learning (TL) allows a deep neural network (DNN) trained on one type of data to be adapted for new problems with limited information. We propose to use the TL technique in physics. The DNN learns the physics of one process, and after fine-tuning, it makes predictions for related processes. We consider the DNNs, trained on inclusive electron-carbon scattering data, and show that after fine-tuning, they accurately predict cross sections for electron interactions with nuclear targets ranging from lithium to iron. The method works even when the DNN is fine-tuned on a small dataset.

8/20/2024

Probabilistic transfer learning methodology to expedite high fidelity simulation of reactive flows

Bruno S. Soriano, Ki Sung Jung, Tarek Echekki, Jacqueline H. Chen, Mohammad Khalil

Reduced order models based on the transport of a lower dimensional manifold representation of the thermochemical state, such as Principal Component (PC) transport and Machine Learning (ML) techniques, have been developed to reduce the computational cost associated with the Direct Numerical Simulations (DNS) of reactive flows. Both PC transport and ML normally require an abundance of data to exhibit sufficient predictive accuracy, which might not be available due to the prohibitive cost of DNS or experimental data acquisition. To alleviate such difficulties, similar data from an existing dataset or domain (source domain) can be used to train ML models, potentially resulting in adequate predictions in the domain of interest (target domain). This study presents a novel probabilistic transfer learning (TL) framework to enhance the trust in ML models in correctly predicting the thermochemical state in a lower dimensional manifold and a sparse data setting. The framework uses Bayesian neural networks, and autoencoders, to reduce the dimensionality of the state space and diffuse the knowledge from the source to the target domain. The new framework is applied to one-dimensional freely-propagating flame solutions under different data sparsity scenarios. The results reveal that there is an optimal amount of knowledge to be transferred, which depends on the amount of data available in the target domain and the similarity between the domains. TL can reduce the reconstruction error by one order of magnitude for cases with large sparsity. The new framework required 10 times less data for the target domain to reproduce the same error as in the abundant data scenario. Furthermore, comparisons with a state-of-the-art deterministic TL strategy show that the probabilistic method can require four times less data to achieve the same reconstruction error.

5/20/2024

Transferable Neural Wavefunctions for Solids

Leon Gerard, Michael Scherbela, Halvard Sutterud, Matthew Foulkes, Philipp Grohs

Deep-Learning-based Variational Monte Carlo (DL-VMC) has recently emerged as a highly accurate approach for finding approximate solutions to the many-electron Schrodinger equation. Despite its favorable scaling with the number of electrons, $mathcal{O}(n_text{el}^{4})$, the practical value of DL-VMC is limited by the high cost of optimizing the neural network weights for every system studied. To mitigate this problem, recent research has proposed optimizing a single neural network across multiple systems, reducing the cost per system. Here we extend this approach to solids, where similar but distinct calculations using different geometries, boundary conditions, and supercell sizes are often required. We show how to optimize a single ansatz across all of these variations, reducing the required number of optimization steps by an order of magnitude. Furthermore, we exploit the transfer capabilities of a pre-trained network. We successfully transfer a network, pre-trained on 2x2x2 supercells of LiH, to 3x3x3 supercells. This reduces the number of optimization steps required to simulate the large system by a factor of 50 compared to previous work.

5/14/2024

Data Quality Monitoring through Transfer Learning on Anomaly Detection for the Hadron Calorimeters

Mulugeta Weldezgina Asres, Christian Walter Omlin, Long Wang, Pavel Parygin, David Yu, Jay Dittmann, The CMS-HCAL Collaboration

The proliferation of sensors brings an immense volume of spatio-temporal (ST) data in many domains for various purposes, including monitoring, diagnostics, and prognostics applications. Data curation is a time-consuming process for a large volume of data, making it challenging and expensive to deploy data analytics platforms in new environments. Transfer learning (TL) mechanisms promise to mitigate data sparsity and model complexity by utilizing pre-trained models for a new task. Despite the triumph of TL in fields like computer vision and natural language processing, efforts on complex ST models for anomaly detection (AD) applications are limited. In this study, we present the potential of TL within the context of AD for the Hadron Calorimeter of the Compact Muon Solenoid experiment at CERN. We have transferred the ST AD models trained on data collected from one part of a calorimeter to another. We have investigated different configurations of TL on semi-supervised autoencoders of the ST AD models -- transferring convolutional, graph, and recurrent neural networks of both the encoder and decoder networks. The experiment results demonstrate that TL effectively enhances the model learning accuracy on a target subdetector. The TL achieves promising data reconstruction and AD performance while substantially reducing the trainable parameters of the AD models. It also improves robustness against anomaly contamination in the training data sets of the semi-supervised AD models.

8/30/2024