Machine learning for exoplanet detection in high-contrast spectroscopy Combining cross correlation maps and deep learning on medium-resolution integral-field spectra

2405.13468

Published 5/24/2024 by Rakesh Nath-Ranga, Olivier Absil, Valentin Christiaens, Emily O. Garvin

🔎

Abstract

The advent of high-contrast imaging instruments combined with medium-resolution spectrographs allows spectral and temporal dimensions to be combined with spatial dimensions to detect and potentially characterize exoplanets with higher sensitivity. We develop a new method to effectively leverage the spectral and spatial dimensions in integral-field spectroscopy (IFS) datasets using a supervised deep-learning algorithm to improve the detection sensitivity to high-contrast exoplanets. We begin by applying a data transform whereby the IFS datasets are replaced by cross-correlation coefficient tensors obtained by cross-correlating our data with young gas giant spectral template spectra. This transformed data is then used to train machine learning (ML) algorithms. We train a 2D CNN and 3D LSTM with our data. We compare the ML models with a non-ML algorithm, based on the STIM map of arXiv:1810.06895. We test our algorithms on simulated young gas giants in a dataset that contains no known exoplanet, and explore the sensitivity of algorithms to detect these exoplanets at contrasts ranging from 1e-3 to 1e-4 at different radial separations. We quantify the sensitivity using modified receiver operating characteristic curves (mROC). We discover that the ML algorithms produce fewer false positives and have a higher true positive rate than the STIM-based algorithm, and the true positive rate of ML algorithms is less impacted by changing radial separation. We discover that the velocity dimension is an important differentiating factor. Through this paper, we demonstrate that ML techniques have the potential to improve the detection limits and reduce false positives for directly imaged planets in IFS datasets, after transforming the spectral dimension into a radial velocity dimension through a cross-correlation operation.

Create account to get full access

Overview

Researchers developed a new deep learning-based method to improve the detection of exoplanets in integral-field spectroscopy (IFS) datasets.
The method involves transforming the IFS data into cross-correlation coefficient tensors, which are then used to train machine learning models.
The machine learning algorithms were able to detect simulated young gas giant exoplanets more effectively than a non-machine learning algorithm.

Plain English Explanation

Astronomers are constantly searching for planets orbiting other stars, known as exoplanets. Improving Earth-like Planet Detection with Radial Velocity and Machine Learning-based Identification of Gaia Astrometric Exoplanets are examples of machine learning techniques being applied to this challenge. In this research, the authors developed a new approach that combines the spatial, spectral, and temporal dimensions of observational data to detect exoplanets more effectively.

The key idea is to transform the IFS data, which contains information about the brightness and spectrum of light from different parts of the sky, into a format that is better suited for machine learning algorithms. Specifically, they calculate the cross-correlation between the data and known spectra of young gas giant planets. This creates a "cross-correlation coefficient tensor" that captures the similarity between the observed data and the template spectra.

The authors then trained two different machine learning models - a 2D convolutional neural network and a 3D long short-term memory network - to analyze these cross-correlation tensors and detect the presence of exoplanets. They found that these machine learning algorithms were able to detect simulated young gas giant exoplanets more effectively than a non-machine learning algorithm, with fewer false positives and a higher true positive rate.

The key insight is that the velocity dimension, which is encoded in the cross-correlation, is an important factor in distinguishing exoplanets from other features in the data. By leveraging this information, the machine learning models were able to improve the detection limits and reduce false positives compared to previous methods.

Technical Explanation

The researchers began by applying a data transformation to the IFS datasets, replacing them with cross-correlation coefficient tensors. These tensors were obtained by cross-correlating the IFS data with spectral templates of young gas giant planets. This transformed data was then used to train two machine learning models: a 2D convolutional neural network (CNN) and a 3D long short-term memory (LSTM) network.

The performance of these machine learning models was compared to a non-machine learning algorithm based on the STIM map from Deep Learning and LLM-based Methods Applied to Astronomical Data. The algorithms were tested on simulated datasets containing no known exoplanets, but with injected young gas giant exoplanets at contrasts ranging from 10^-3 to 10^-4 and varying radial separations.

The sensitivity of the algorithms was quantified using modified receiver operating characteristic (mROC) curves. The results showed that the machine learning models produced fewer false positives and had a higher true positive rate compared to the STIM-based algorithm. Additionally, the true positive rate of the machine learning algorithms was less impacted by changes in radial separation.

The researchers discovered that the velocity dimension, encoded in the cross-correlation operation, was an important differentiating factor for the machine learning models. This suggests that the spectral and temporal dimensions, when combined with the spatial information, can provide valuable cues for exoplanet detection.

Critical Analysis

The paper presents a promising approach for improving the detection of directly imaged exoplanets in IFS datasets using machine learning techniques. The use of cross-correlation coefficient tensors to transform the data into a format more suitable for machine learning is a novel and interesting idea.

However, the authors acknowledge that the simulated datasets used in their experiments may not fully capture the complexity of real-world observational data. Flare Up Your Data: Diffusion-based Augmentation and Detecting Moving Objects with Machine Learning discuss the challenges of working with real-world astronomical data. Further testing on actual observational data would be necessary to validate the effectiveness of this approach in practical settings.

Additionally, the paper does not provide much detail on the specific architectures and hyperparameters of the machine learning models used. More information on the model design choices and their impact on performance would be helpful for researchers interested in replicating or building upon this work.

Overall, the paper demonstrates the potential of machine learning techniques to enhance exoplanet detection in IFS datasets, particularly by leveraging the spectral and temporal dimensions of the data. However, more research is needed to address the limitations and ensure the robustness of the approach in real-world astronomical applications.

Conclusion

This research presents a novel deep learning-based method for improving the detection of exoplanets in integral-field spectroscopy (IFS) datasets. By transforming the IFS data into cross-correlation coefficient tensors and using these as input to machine learning models, the researchers were able to achieve better detection sensitivity and fewer false positives compared to a non-machine learning algorithm.

The key insight is that the velocity dimension, encoded in the cross-correlation operation, is an important factor in distinguishing exoplanets from other features in the data. This suggests that combining spatial, spectral, and temporal information can be a powerful approach for exoplanet detection.

While the results are promising, further testing on real-world observational data is needed to validate the effectiveness of this method in practical astronomical applications. Additionally, more details on the machine learning model architectures and hyperparameters would be helpful for researchers interested in building upon this work.

Overall, this research demonstrates the potential of machine learning techniques to enhance exoplanet detection and characterization, and highlights the value of integrating multiple observational dimensions for this challenging task.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🔎

Improving Earth-like planet detection in radial velocity using deep learning

Yinan Zhao, Xavier Dumusque, Michael Cretignier, Andrew Collier Cameron, David W. Latham, Mercedes L'opez-Morales, Michel Mayor, Alessandro Sozzetti, Rosario Cosentino, Isidro G'omez-Vargas, Francesco Pepe, Stephane Udry

Many novel methods have been proposed to mitigate stellar activity for exoplanet detection as the presence of stellar activity in radial velocity (RV) measurements is the current major limitation. Unlike traditional methods that model stellar activity in the RV domain, more methods are moving in the direction of disentangling stellar activity at the spectral level. The goal of this paper is to present a novel convolutional neural network-based algorithm that efficiently models stellar activity signals at the spectral level, enhancing the detection of Earth-like planets. We trained a convolutional neural network to build the correlation between the change in the spectral line profile and the corresponding RV, full width at half maximum (FWHM) and bisector span (BIS) values derived from the classical cross-correlation function. This algorithm has been tested on three intensively observed stars: Alpha Centauri B (HD128621), Tau ceti (HD10700), and the Sun. By injecting simulated planetary signals at the spectral level, we demonstrate that our machine learning algorithm can achieve, for HD128621 and HD10700, a detection threshold of 0.5 m/s in semi-amplitude for planets with periods ranging from 10 to 300 days. This threshold would correspond to the detection of a $sim$4$mathrm{M}_{oplus}$ in the habitable zone of those stars. On the HARPS-N solar dataset, our algorithm is even more efficient at mitigating stellar activity signals and can reach a threshold of 0.2 m/s, which would correspond to a 2.2$mathrm{M}_{oplus}$ planet on the orbit of the Earth. To the best of our knowledge, it is the first time that such low detection thresholds are reported for the Sun, but also for other stars, and therefore this highlights the efficiency of our convolutional neural network-based algorithm at mitigating stellar activity in RV measurements.

5/24/2024

cs.LG

Machine learning-based identification of Gaia astrometric exoplanet orbits

Johannes Sahlmann, Pablo G'omez

The third Gaia data release (DR3) contains $sim$170 000 astrometric orbit solutions of two-body systems located within $sim$500 pc of the Sun. Determining component masses in these systems, in particular of stars hosting exoplanets, usually hinges on incorporating complementary observations in addition to the astrometry, e.g. spectroscopy and radial velocities. Several DR3 two-body systems with exoplanet, brown-dwarf, stellar, and black-hole components have been confirmed in this way. We developed an alternative machine learning approach that uses only the DR3 orbital solutions with the aim of identifying the best candidates for exoplanets and brown-dwarf companions. Based on confirmed substellar companions in the literature, we use semi-supervised anomaly detection methods in combination with extreme gradient boosting and random forest classifiers to determine likely low-mass outliers in the population of non-single sources. We employ and study feature importance to investigate the method's plausibility and produced a list of 22 best candidates of which four are exoplanet candidates and another five are either very-massive brown dwarfs or very-low mass stars. Three candidates, including one initial exoplanet candidate, correspond to false-positive solutions where longer-period binary star motion was fitted with a biased shorter-period orbit. We highlight nine candidates with brown-dwarf companions for preferential follow-up. One candidate companion around the Sun-like star G 15-6 could be confirmed as a genuine brown dwarf using external radial-velocity data. This new approach is a powerful complement to the traditional identification methods for substellar companions among Gaia astrometric orbits. It is particularly relevant in the context of Gaia DR4 and its expected exoplanet discovery yield.

4/16/2024

cs.LG

NotPlaNET: Removing False Positives from Planet Hunters TESS with Machine Learning

Valentina Tardugno Poleo (NYU), Nora Eisner (CCA), David W. Hogg (NYU, CCA)

Differentiating between real transit events and false positive signals in photometric time series data is a bottleneck in the identification of transiting exoplanets, particularly long-period planets. This differentiation typically requires visual inspection of a large number of transit-like signals to rule out instrumental and astrophysical false positives that mimic planetary transit signals. We build a one-dimensional convolutional neural network (CNN) to separate eclipsing binaries and other false positives from potential planet candidates, reducing the number of light curves that require human vetting. Our CNN is trained using the TESS light curves that were identified by Planet Hunters citizen scientists as likely containing a transit. We also include the background flux and centroid information. The light curves are visually inspected and labeled by project scientists and are minimally pre-processed, with only normalization and data augmentation taking place before training. The median percentage of contaminants flagged across the test sectors is 18% with a maximum of 37% and a minimum of 10%. Our model keeps 100% of the planets for 16 of the 18 test sectors, while incorrectly flagging one planet candidate (0.3%) for one sector and two (0.6%) for the remaining sector. Our method shows potential to reduce the number of light curves requiring manual vetting by up to a third with minimal misclassification of planet candidates.

5/29/2024

cs.LG

Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification

Yu-Yang Li, Yu Bai, Cunshi Wang, Mengwei Qu, Ziteng Lu, Roberto Soria, Jifeng Liu

Light curves serve as a valuable source of information on stellar formation and evolution. With the rapid advancement of machine learning techniques, it can be effectively processed to extract astronomical patterns and information. In this study, we present a comprehensive evaluation of deep-learning and large language model (LLM) based models for the automatic classification of variable star light curves, based on large datasets from the Kepler and K2 missions. Special emphasis is placed on Cepheids, RR Lyrae, and eclipsing binaries, examining the influence of observational cadence and phase distribution on classification precision. Employing AutoDL optimization, we achieve striking performance with the 1D-Convolution+BiLSTM architecture and the Swin Transformer, hitting accuracies of 94% and 99% correspondingly, with the latter demonstrating a notable 83% accuracy in discerning the elusive Type II Cepheids-comprising merely 0.02% of the total dataset.We unveil StarWhisper LightCurve (LC), an innovative Series comprising three LLM-based models: LLM, multimodal large language model (MLLM), and Large Audio Language Model (LALM). Each model is fine-tuned with strategic prompt engineering and customized training methods to explore the emergent abilities of these models for astronomical data. Remarkably, StarWhisper LC Series exhibit high accuracies around 90%, significantly reducing the need for explicit feature engineering, thereby paving the way for streamlined parallel data processing and the progression of multifaceted multimodal models in astronomical applications. The study furnishes two detailed catalogs illustrating the impacts of phase and sampling intervals on deep learning classification accuracy, showing that a substantial decrease of up to 14% in observation duration and 21% in sampling points can be realized without compromising accuracy by more than 10%.

4/17/2024

cs.CL cs.LG