Convolutional neural network for Lyman break galaxies classification and redshift regression in DESI (Dark Energy Spectroscopic Instrument)

Read original: arXiv:2406.16730 - Published 6/26/2024 by Julien Taran

🧠

Overview

DESI is an international project to create a 3D map of the sky by observing over 40 million quasars and galaxies
The goal is to use this map to study various aspects of cosmology, from dark energy to neutrino mass
This paper focuses on using a convolutional neural network (CNN) to classify Lyman Break Galaxies (LBGs) and determine their redshift from their spectra

Plain English Explanation

The DESI project is an ambitious international effort to create a detailed 3D map of the universe by observing and cataloging over 40 million galaxies and quasars over a 5-year period. This 3D map will enable researchers to study a wide range of cosmological phenomena, from the mysterious "dark energy" that is driving the accelerating expansion of the universe to the tiny mass of neutrinos.

In this paper, the researchers are focusing on a specific type of galaxy observed by DESI called Lyman Break Galaxies (LBGs). LBGs appear in the distant universe due to the way their light is "redshifted" as it travels towards us over billions of years. By analyzing the spectra (light signatures) of these LBGs, the researchers aim to both confirm their identity and determine their exact distance from Earth using this redshift effect.

To do this, the researchers developed a convolutional neural network (CNN) model inspired by a previous system called QuasarNET. This CNN can simultaneously classify whether a galaxy is an LBG or not, and also predict its redshift (distance) value. To train the model, the researchers used data augmentation techniques like shifting the galaxy spectra and adding synthetic data to expand their training dataset from 3,019 examples to over 66,000.

After further refining the CNN architecture and optimizing its hyperparameters, the researchers were able to achieve significant improvements in the model's performance - up to a 26% gain on key evaluation metrics, particularly for LBGs at low and high redshifts. The best model ultimately reached an average score of 94%, a substantial improvement over the initial 75% baseline.

Technical Explanation

The researchers developed a convolutional neural network (CNN) inspired by QuasarNET to simultaneously classify Lyman Break Galaxies (LBGs) and predict their redshift from their spectra. This was done in the context of the larger DESI project, which aims to create a detailed 3D map of the observable universe by studying over 40 million galaxies and quasars.

Initially, the researchers had access to a dataset of 3,019 LBG spectra. To increase the size and diversity of the training data, they applied several data augmentation techniques, including shifting the spectra in wavelength, adding noise, and generating synthetic spectra. This expanded the dataset to over 66,000 examples.

In a second phase, the researchers made modifications to the QuasarNET architecture, incorporating transfer learning and using Bayesian optimization to tune the hyperparameters. These changes led to substantial performance gains, with improvements of up to 26% on the Purity/Efficiency curve - a key metric for evaluating model performance, particularly in regions of interest at low (around 2) and high (around 4) redshifts.

The best model ultimately achieved an average score of 94%, a significant improvement over the initial 75% baseline. This demonstrates the power of combining data augmentation, transfer learning, and hyperparameter optimization to enhance the performance of deep learning models for challenging astronomical tasks like classifying radio galaxies or analyzing galaxy images.

Critical Analysis

The researchers acknowledge several limitations and areas for further research in this work. For example, they note that the data augmentation techniques used, while effective, may not fully capture the true diversity of LBG spectra observed in the real universe. Additionally, the model was trained and evaluated on a relatively small dataset, and its performance on the full DESI dataset remains to be seen.

Furthermore, the researchers do not delve deeply into the potential sources of error or uncertainty in the redshift predictions made by the model. Accurate redshift estimation is crucial for properly mapping the 3D structure of the universe, so a more thorough analysis of the model's redshift prediction accuracy and its implications would be valuable.

Finally, while the researchers demonstrate significant performance gains compared to the initial QuasarNET model, it would be helpful to have a more detailed comparison to other state-of-the-art approaches for LBG classification and redshift estimation. This would provide a better understanding of how the proposed model compares to alternative techniques in the field.

Overall, this research represents an important step forward in leveraging deep learning to support the ambitious goals of the DESI project. However, further work is needed to fully understand the capabilities and limitations of the proposed approach, and to ensure its robust performance on the large-scale datasets that DESI will ultimately produce.

Conclusion

This paper presents a convolutional neural network (CNN) model that can simultaneously classify Lyman Break Galaxies (LBGs) and predict their redshift (distance) from their spectral data. This work is part of the larger DESI project, which aims to create a detailed 3D map of the observable universe by studying over 40 million galaxies and quasars.

The researchers used data augmentation techniques to expand their initial dataset of 3,019 LBG spectra to over 66,000 examples, and then made architectural modifications and performed hyperparameter tuning to improve the model's performance. They were able to achieve gains of up to 26% on key evaluation metrics, particularly for LBGs at low and high redshifts, with the best model reaching an average score of 94%.

While this research represents an important step forward, the researchers acknowledge several limitations and areas for further work, such as the need for a more thorough analysis of the model's redshift prediction accuracy and a more detailed comparison to other state-of-the-art approaches. Nonetheless, this work demonstrates the power of deep learning techniques to support the ambitious goals of large-scale astronomical projects like DESI, and lays the groundwork for continued advancements in this field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🧠

Convolutional neural network for Lyman break galaxies classification and redshift regression in DESI (Dark Energy Spectroscopic Instrument)

Julien Taran

DESI is a groundbreaking international project to observe more than 40 million quasars and galaxies over a 5-year period to create a 3D map of the sky. This map will enable us to probe multiple aspects of cosmology, from dark energy to neutrino mass. We are focusing here on one type of object observed by DESI, the Lyman Break Galaxies (LBGs). The aim is to use their spectra to determine whether they are indeed LBGs, and if so, to determine their distance from the Earth using a phenomenon called redshift. This will enable us to place these galaxies on the DESI 3D map. The aim is therefore to develop a convolutional neural network (CNN) inspired by QuasarNET (See arXiv:1808.09955), performing simultaneously a classification (LBG type or not) and a regression task (determine the redshift of the LBGs). Initially, data augmentation techniques such as shifting the spectra in wavelengths, adding noise to the spectra, or adding synthetic spectra were used to increase the model training dataset from 3,019 data to over 66,000. In a second phase, modifications to the QuasarNET architecture, notably through transfer learning and hyperparameter tuning with Bayesian optimization, boosted model performance. Gains of up to 26% were achieved on the Purity/Efficiency curve, which is used to evaluate model performance, particularly in areas with interesting redshifts, at low (around 2) and high (around 4) redshifts. The best model obtained an average score of 94%, compared with 75% for the initial model.

6/26/2024

Local primordial non-Gaussianity from the large-scale clustering of photometric DESI luminous red galaxies

Mehdi Rezaie, Ashley J. Ross, Hee-Jong Seo, Hui Kong, Anna Porredon, Lado Samushia, Edmond Chaussidon, Alex Krolewski, Arnaud de Mattia, Florian Beutler, Jessica Nicole Aguilar, Steven Ahlen, Shadab Alam, Santiago Avila, Benedict Bahr-Kalus, Jose Bermejo-Climent, David Brooks, Todd Claybaugh, Shaun Cole, Kyle Dawson, Axel de la Macorra, Peter Doel, Andreu Font-Ribera, Jaime E. Forero-Romero, Satya Gontcho A Gontcho, Julien Guy, Klaus Honscheid, Dragan Huterer, Theodore Kisner, Martin Landriau, Michael Levi, Marc Manera, Aaron Meisner, Ramon Miquel, Eva-Maria Mueller, Adam Myers, Jeffrey A. Newman, Jundan Nie, Nathalie Palanque-Delabrouille, Will Percival, Claire Poppett, Graziano Rossi, Eusebio Sanchez, Michael Schubnell, Gregory Tarl'e, Benjamin Alan Weaver, Christophe Y`eche, Zhimin Zhou, Hu Zou

We use angular clustering of luminous red galaxies from the Dark Energy Spectroscopic Instrument (DESI) imaging surveys to constrain the local primordial non-Gaussianity parameter $fnl$. Our sample comprises over 12 million targets, covering 14,000 square degrees of the sky, with redshifts in the range $0.2< z < 1.35$. We identify Galactic extinction, survey depth, and astronomical seeing as the primary sources of systematic error, and employ linear regression and artificial neural networks to alleviate non-cosmological excess clustering on large scales. Our methods are tested against simulations with and without $fnl$ and systematics, showing superior performance of the neural network treatment. The neural network with a set of nine imaging property maps passes our systematic null test criteria, and is chosen as the fiducial treatment. Assuming the universality relation, we find $fnl = 34^{+24(+50)}_{-44(-73)}$ at 68%(95%) confidence. We apply a series of robustness tests (e.g., cuts on imaging, declination, or scales used) that show consistency in the obtained constraints. We study how the regression method biases the measured angular power-spectrum and degrades the $fnl$ constraining power. The use of the nine maps more than doubles the uncertainty compared to using only the three primary maps in the regression. Our results thus motivate the development of more efficient methods that avoid over-correction, protect large-scale clustering information, and preserve constraining power. Additionally, our results encourage further studies of $fnl$ with DESI spectroscopic samples, where the inclusion of 3D clustering modes should help separate imaging systematics and lessen the degradation in the $fnl$ uncertainty.

6/26/2024

Model-independent cosmological inference post DESI DR1 BAO measurements

Purba Mukherjee, Anjan Ananda Sen

In this work, we implement Gaussian process regression to reconstruct the expansion history of the universe in a model-agnostic manner, using the Pantheon-Plus SN-Ia compilation in combination with two different BAO measurements (SDSS-IV and DESI DR1). In both the reconstructions, the $Lambda$CDM model is always included in the 95% confidence intervals. We find evidence that the DESI LRG data at $z_{text{eff}} = 0.51$ is not an outlier within our model-independent framework. We study the $mathcal{O}m$-diagnostics and the evolution of the total equation of state (EoS) of our universe, which hint towards the possibility of a quintessence-like dark energy scenario with a very slowly varying EoS, and a phantom-crossing in higher $z$. The entire exercise is later complemented by considering two more SN-Ia compilations - DES-5YR and Union3 - in combination with DESI BAO. Reconstruction with the DESI BAO + DES-5YR SN data sets predicts that the $Lambda$CDM model lies outside the 3$sigma$ confidence levels, whereas with DESI BAO + Union3 data, the $Lambda$CDM model is always included within 1$sigma$. We also report constraints on $H_0 r_d$ from our model-agnostic analysis, independent of the pre-recombination physics. Our results point towards an $approx$ 2$sigma$ discrepancy between the DESI + Pantheon-Plus and DESI + DES-5YR data sets, which calls for further investigation.

5/30/2024

A deep-learning algorithm to disentangle self-interacting dark matter and AGN feedback models

David Harvey

Different models of dark matter can alter the distribution of mass in galaxy clusters in a variety of ways. However, so can uncertain astrophysical feedback mechanisms. Here we present a Machine Learning method that ''learns'' how the impact of dark matter self-interactions differs from that of astrophysical feedback in order to break this degeneracy and make inferences on dark matter. We train a Convolutional Neural Network on images of galaxy clusters from hydro-dynamic simulations. In the idealised case our algorithm is 80% accurate at identifying if a galaxy cluster harbours collisionless dark matter, dark matter with ${sigma}_{rm DM}/m = 0.1$cm$^2/$g or with ${sigma}_{DM}/m = 1$cm$^2$/g. Whilst we find adding X-ray emissivity maps does not improve the performance in differentiating collisional dark matter, it does improve the ability to disentangle different models of astrophysical feedback. We include noise to resemble data expected from Euclid and Chandra and find our model has a statistical error of < 0.01cm$^2$/g and that our algorithm is insensitive to shape measurement bias and photometric redshift errors. This method represents a new way to analyse data from upcoming telescopes that is an order of magnitude more precise and many orders faster, enabling us to explore the dark matter parameter space like never before.

5/29/2024