A1 Refereed original research article in a scientific journal

Using Multivariate Imputation by Chained Equations to Predict Redshifts of Active Galactic Nuclei




AuthorsGibson Spencer James, Narendra Aditya, Dainotti Maria Giovanna, Bogdan Malgorzata, Pollo Agnieszka, Poliszczuk Artem, Rinaldi Enrico, Liodakis Ioannis

PublisherFRONTIERS MEDIA SA

Publication year2022

JournalFrontiers in Astronomy and Space Sciences

Journal name in sourceFRONTIERS IN ASTRONOMY AND SPACE SCIENCES

Journal acronymFRONT ASTRON SPACE

Article number 836215

Volume9

Number of pages16

ISSN2296-987X

DOIhttps://doi.org/10.3389/fspas.2022.836215

Web address https://www.frontiersin.org/articles/10.3389/fspas.2022.836215/full

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/175075385


Abstract
Redshift measurement of active galactic nuclei (AGNs) remains a time-consuming and challenging task, as it requires follow up spectroscopic observations and detailed analysis. Hence, there exists an urgent requirement for alternative redshift estimation techniques. The use of machine learning (ML) for this purpose has been growing over the last few years, primarily due to the availability of large-scale galactic surveys. However, due to observational errors, a significant fraction of these data sets often have missing entries, rendering that fraction unusable for ML regression applications. In this study, we demonstrate the performance of an imputation technique called Multivariate Imputation by Chained Equations (MICE), which rectifies the issue of missing data entries by imputing them using the available information in the catalog. We use the Fermi-LAT Fourth Data Release Catalog (4LAC) and impute 24% of the catalog. Subsequently, we follow the methodology described in Dainotti et al. (ApJ, 2021, 920, 118) and create an ML model for estimating the redshift of 4LAC AGNs. We present results which highlight positive impact of MICE imputation technique on the machine learning models performance and obtained redshift estimation accuracy.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 20:52