Galaxy spectroscopy without spectra: Galaxy properties from photometric images with conditional diffusion models

2406.18175

Published 6/27/2024 by Lars Doorenbos, Eva Sextl, Kevin Heng, Stefano Cavuoti, Massimo Brescia, Olena Torbaniuk, Giuseppe Longo, Raphael Sznitman, Pablo M'arquez-Neila

cs.AI

Galaxy spectroscopy without spectra: Galaxy properties from photometric images with conditional diffusion models

Abstract

Modern spectroscopic surveys can only target a small fraction of the vast amount of photometrically cataloged sources in wide-field surveys. Here, we report the development of a generative AI method capable of predicting optical galaxy spectra from photometric broad-band images alone. This method draws from the latest advances in diffusion models in combination with contrastive networks. We pass multi-band galaxy images into the architecture to obtain optical spectra. From these, robust values for galaxy properties can be derived with any methods in the spectroscopic toolbox, such as standard population synthesis techniques and Lick indices. When trained and tested on 64x64-pixel images from the Sloan Digital Sky Survey, the global bimodality of star-forming and quiescent galaxies in photometric space is recovered, as well as a mass-metallicity relation of star-forming galaxies. The comparison between the observed and the artificially created spectra shows good agreement in overall metallicity, age, Dn4000, stellar velocity dispersion, and E(B-V) values. Photometric redshift estimates of our generative algorithm can compete with other current, specialized deep-learning techniques. Moreover, this work is the first attempt in the literature to infer velocity dispersion from photometric images. Additionally, we can predict the presence of an active galactic nucleus up to an accuracy of 82%. With our method, scientifically interesting galaxy properties, normally requiring spectroscopic inputs, can be obtained in future data sets from large-scale photometric surveys alone. The spectra prediction via AI can further assist in creating realistic mock catalogs.

Create account to get full access

Overview

This paper explores a novel approach to estimating galaxy properties from photometric images, without requiring galaxy spectra.
The researchers develop a conditional diffusion model that can predict various galaxy properties, such as stellar mass, star formation rate, and metallicity, directly from galaxy images.
The model is trained on a large dataset of galaxies with both photometric images and spectroscopic data, allowing it to learn the complex relationships between galaxy appearance and intrinsic properties.
The proposed method could significantly expand the types of galaxy surveys that can be conducted, as obtaining spectroscopic data is often time-consuming and expensive.

Plain English Explanation

Galaxies are massive collections of stars, gas, and dust that come in a variety of shapes and sizes. Traditionally, astronomers have studied the properties of galaxies, such as their mass, star formation rate, and chemical composition, by analyzing the light emitted by the galaxy and breaking it down into its component wavelengths, or spectrum. This process, known as spectroscopy, provides detailed information about the galaxy, but it can be a time-consuming and expensive endeavor.

In this paper, the researchers present a new approach that allows them to estimate galaxy properties without needing to obtain a full spectrum. They develop a machine learning model called a "conditional diffusion model" that can predict various galaxy properties directly from the galaxy's photometric image - that is, the overall brightness and color of the galaxy as seen through different filters. The model is trained on a large dataset of galaxies that have both photometric images and spectroscopic data, allowing it to learn the complex relationships between a galaxy's appearance and its underlying physical properties.

By using this photometric approach instead of spectroscopy, the researchers can potentially expand the types of galaxy surveys that can be conducted, as obtaining photometric data is generally faster and less resource-intensive than obtaining spectroscopic data. This could lead to new insights into the formation and evolution of galaxies across the universe.

Technical Explanation

The key innovation in this paper is the development of a conditional diffusion model that can predict various galaxy properties, such as stellar mass, star formation rate, and metallicity, directly from galaxy photometric images. Diffusion models are a type of generative machine learning model that have been successful in tasks like image generation and text-to-image synthesis.

In this case, the researchers train the diffusion model on a large dataset of galaxies with both photometric images and spectroscopic data. The model learns to capture the complex relationships between the visual appearance of a galaxy and its underlying physical properties. Once trained, the model can then be used to predict those properties for new galaxy images, without requiring the time-consuming process of obtaining a full galaxy spectrum.

The researchers evaluate their approach on data from the Sloan Digital Sky Survey, demonstrating that their conditional diffusion model can accurately estimate galaxy properties and outperform previous methods that relied on simpler Machine Learning models. They also show that the model is robust to factors like galaxy inclination angle and redshift, which can affect the appearance of galaxies in photometric images.

Critical Analysis

One potential limitation of the proposed approach is that it relies on having a large, high-quality dataset of galaxies with both photometric and spectroscopic data for training the model. Obtaining such a comprehensive dataset can be challenging, especially for certain types of galaxies or in regions of the sky that have not been well-surveyed. The researchers acknowledge this issue and suggest that their method could be further improved by incorporating additional data sources or using simulation-based techniques to augment the training data.

Additionally, while the conditional diffusion model can provide estimates of galaxy properties, it may not capture all the nuances and complexities of the underlying physics in the same way that a full spectroscopic analysis would. There may be some information loss or systematic biases introduced by the model, which could limit its applicability in certain scientific contexts where high precision is required.

Overall, the research presented in this paper represents a promising step forward in using machine learning to analyze and understand galaxy data. By leveraging the power of conditional diffusion models, the authors have demonstrated a new approach to extracting valuable information from galaxy photometric images, potentially opening up new avenues for large-scale galaxy surveys and expanding our knowledge of how galaxies form and evolve over cosmic time.

Conclusion

This paper introduces a novel approach to estimating galaxy properties from photometric images, without the need for time-consuming spectroscopic observations. By developing a conditional diffusion model trained on a large dataset of galaxies with both photometric and spectroscopic data, the researchers have shown that it is possible to accurately predict various galaxy properties, such as stellar mass, star formation rate, and metallicity, directly from the photometric images.

This photometric approach could significantly expand the types of galaxy surveys that can be conducted, as obtaining photometric data is generally faster and less resource-intensive than obtaining spectroscopic data. The insights gained from these surveys could lead to new understandings of galaxy formation and evolution across the universe.

While the proposed method has some limitations, such as the need for a comprehensive training dataset and the potential for information loss compared to full spectroscopic analysis, the research presented in this paper represents an important step forward in the use of machine learning techniques to analyze and understand galaxy data. As the field of astronomy continues to generate increasingly large and complex datasets, approaches like the one described in this paper will become increasingly valuable for extracting meaningful insights and advancing our knowledge of the cosmos.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

📈

AstroCLIP: A Cross-Modal Foundation Model for Galaxies

Liam Parker, Francois Lanusse, Siavash Golkar, Leopoldo Sarra, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Geraud Krawezik, Michael McCabe, Ruben Ohana, Mariel Pettee, Bruno Regaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used - without any model fine-tuning - for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation from both images and spectra, and (4) morphology classification. Our approach to implementing AstroCLIP consists of two parts. First, we embed galaxy images and spectra separately by pretraining separate transformer-based image and spectrum encoders in self-supervised settings. We then align the encoders using a contrastive loss. We apply our method to spectra from the Dark Energy Spectroscopic Instrument and images from its corresponding Legacy Imaging Survey. Overall, we find remarkable performance on all downstream tasks, even relative to supervised baselines. For example, for a task like photometric redshift prediction, we find similar performance to a specifically-trained ResNet18, and for additional tasks like physical property estimation (stellar mass, age, metallicity, and sSFR), we beat this supervised baseline by 19% in terms of $R^2$. We also compare our results to a state-of-the-art self-supervised single-modal model for galaxy images, and find that our approach outperforms this benchmark by roughly a factor of two on photometric redshift estimation and physical property prediction in terms of $R^2$, while remaining roughly in-line in terms of morphology classification. Ultimately, our approach represents the first cross-modal self-supervised model for galaxies, and the first self-supervised transformer-based architectures for galaxy images and spectra.

6/17/2024

cs.AI cs.LG

🔎

Machine learning for exoplanet detection in high-contrast spectroscopy Combining cross correlation maps and deep learning on medium-resolution integral-field spectra

Rakesh Nath-Ranga, Olivier Absil, Valentin Christiaens, Emily O. Garvin

The advent of high-contrast imaging instruments combined with medium-resolution spectrographs allows spectral and temporal dimensions to be combined with spatial dimensions to detect and potentially characterize exoplanets with higher sensitivity. We develop a new method to effectively leverage the spectral and spatial dimensions in integral-field spectroscopy (IFS) datasets using a supervised deep-learning algorithm to improve the detection sensitivity to high-contrast exoplanets. We begin by applying a data transform whereby the IFS datasets are replaced by cross-correlation coefficient tensors obtained by cross-correlating our data with young gas giant spectral template spectra. This transformed data is then used to train machine learning (ML) algorithms. We train a 2D CNN and 3D LSTM with our data. We compare the ML models with a non-ML algorithm, based on the STIM map of arXiv:1810.06895. We test our algorithms on simulated young gas giants in a dataset that contains no known exoplanet, and explore the sensitivity of algorithms to detect these exoplanets at contrasts ranging from 1e-3 to 1e-4 at different radial separations. We quantify the sensitivity using modified receiver operating characteristic curves (mROC). We discover that the ML algorithms produce fewer false positives and have a higher true positive rate than the STIM-based algorithm, and the true positive rate of ML algorithms is less impacted by changing radial separation. We discover that the velocity dimension is an important differentiating factor. Through this paper, we demonstrate that ML techniques have the potential to improve the detection limits and reduce false positives for directly imaged planets in IFS datasets, after transforming the spectral dimension into a radial velocity dimension through a cross-correlation operation.

5/24/2024

cs.LG

🤯

Field-level simulation-based inference with galaxy catalogs: the impact of systematic effects

Natal'i S. M. de Santi, Francisco Villaescusa-Navarro, L. Raul Abramo, Helen Shao, Lucia A. Perez, Tiago Castro, Yueying Ni, Christopher C. Lovell, Elena Hernandez-Martinez, Federico Marinacci, David N. Spergel, Klaus Dolag, Lars Hernquist, Mark Vogelsberger

It has been recently shown that a powerful way to constrain cosmological parameters from galaxy redshift surveys is to train graph neural networks to perform field-level likelihood-free inference without imposing cuts on scale. In particular, de Santi et al. (2023) developed models that could accurately infer the value of $Omega_{rm m}$ from catalogs that only contain the positions and radial velocities of galaxies that are robust to uncertainties in astrophysics and subgrid models. However, observations are affected by many effects, including 1) masking, 2) uncertainties in peculiar velocities and radial distances, and 3) different galaxy selections. Moreover, observations only allow us to measure redshift, intertwining galaxies' radial positions and velocities. In this paper we train and test our models on galaxy catalogs, created from thousands of state-of-the-art hydrodynamic simulations run with different codes from the CAMELS project, that incorporate these observational effects. We find that, although the presence of these effects degrades the precision and accuracy of the models, and increases the fraction of catalogs where the model breaks down, the fraction of galaxy catalogs where the model performs well is over 90 %, demonstrating the potential of these models to constrain cosmological parameters even when applied to real data.

5/13/2024

cs.LG

A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model

Mingxiang Fu, Yu Song, Jiameng Lv, Liang Cao, Peng Jia, Nan Li, Xiangru Li, Jifeng Liu, A-Li Luo, Bo Qiu, Shiyin Shen, Liangping Tu, Lili Wang, Shoulin Wei, Haifeng Yang, Zhenping Yi, Zhiqiang Zou

The exponential growth of astronomical datasets provides an unprecedented opportunity for humans to gain insight into the Universe. However, effectively analyzing this vast amount of data poses a significant challenge. Astronomers are turning to deep learning techniques to address this, but the methods are limited by their specific training sets, leading to considerable duplicate workloads too. Hence, as an example to present how to overcome the issue, we built a framework for general analysis of galaxy images, based on a large vision model (LVM) plus downstream tasks (DST), including galaxy morphological classification, image restoration, object detection, parameter extraction, and more. Considering the low signal-to-noise ratio of galaxy images and the imbalanced distribution of galaxy categories, we have incorporated a Human-in-the-loop (HITL) module into our large vision model, which leverages human knowledge to enhance the reliability and interpretability of processing galaxy images interactively. The proposed framework exhibits notable few-shot learning capabilities and versatile adaptability to all the abovementioned tasks on galaxy images in the DESI legacy imaging surveys. Expressly, for object detection, trained by 1000 data points, our DST upon the LVM achieves an accuracy of 96.7%, while ResNet50 plus Mask R-CNN gives an accuracy of 93.1%; for morphology classification, to obtain AUC ~0.9, LVM plus DST and HITL only requests 1/50 training sets compared to ResNet18. Expectedly, multimodal data can be integrated similarly, which opens up possibilities for conducting joint analyses with datasets spanning diverse domains in the era of multi-message astronomy.

5/20/2024

cs.AI