A1 Refereed original research article in a scientific journal

Euclid preparation LXXV. Estimating galaxy physical properties using CatBoost chained regressors with attention




AuthorsHumphrey, A.; Cunha, P. A. C.; Bisigello, L.; Tortora, C.; Bolzonella, M.; Pozzetti, L.; Baes, M.; Granett, B. R.; Amara, A.; Andreon, S.; Auricchio, N.; Baccigalupi, C.; Baldi, M.; Bardelli, S.; Bodendorf, C.; Bonino, D.; Branchini, E.; Brescia, M.; Brinchmann, J.; Camera, S.; Capobianco, V.; Carbone, C.; Carretero, J.; Casas, S.; Castellano, M.; Castignani, G.; Cavuoti, S.; Cimatti, A.; Colodro-Conde, C.; Congedo, G.; Conselice, C. J.; Conversi, L.; Copin, Y.; Courbin, F.; Courtois, H. M.; Da Silva, A.; Degaudenzi, H.; De Lucia, G.; Dinis, J.; Dubath, F.; Dupac, X.; Dusini, S.; Farina, M.; Farrens, S.; Ferriol, S.; Frailis, M.; Franceschi, E.; Galeotta, S.; George, K.; Gillis, B.; Giocoli, C.; Grazian, A.; Grupp, F.; Guzzo, L.; Haugan, S. V. H.; Holmes, W.; Hook, I.; Hormuth, F.; Hornstrup, A.; Jahnke, K.; Joachimi, B.; Keihänen, E.; Kermiche, S.; Kiessling, A.; Kilbinger, M.; Kubik, B.; Kümmel, M.; Kunz, M.; Kurki-Suonio, H.; Ligori, S.; Lilje, P. B.; Lindholm, V.; Lloro, I.; Mainetti, G.; Maino, D.; Maiorano, E.; Mansutti, O.; Marggraf, O.; Markovic, K.; Martinelli, M.; Martinet, N.; Marulli, F.; Massey, R.; McCracken, H. J.; Medinaceli, E.; Mei, S.; Melchior, M.; Mellier, Y.; Meneghetti, M.; Merlin, E.; Meylan, G.; Moresco, M.; Moscardini, L.; Munari, E.; Nakajima, R.; Niemi, S.-M.; Nightingale, J. W.; Padilla, C.; Paltani, S.; Pasian, F.; Pedersen, K.; Pettorino, V.; Pires, S.; Polenta, G.; Poncet, M.; Popa, L. A.; Raison, F.; Rebolo, R.; Renzi, A.; Rhodes, J.; Riccio, G.; Romelli, E.; Roncarelli, M.; Rossetti, E.; Saglia, R.; Sakr, Z.; Sánchez, A. G.; Sapone, D.; Scaramella, R.; Schneider, P.; Schrabback, T.; Scodeggio, M.; Secroun, A.; Sefusatti, E.; Seidel, G.; Serrano, S.; Sirignano, C.; Stanco, L.; Steinwagner, J.; Tallada-Crespí, P.; Taylor, A. N.; Tereno, I.; Toledo-Moreo, R.; Torradeflot, F.; Tutusaus, I.; Valenziano, L.; Vassallo, T.; Veropalumbo, A.; Wang, Y.; Weller, J.; Zamorani, G.; Zoubian, J.; Zucca, E.; Biviano, A.; Boucaud, A.; Bozzo, E.; Burigana, C.; Calabrese, M.; Farinelli, R.; Mauri, N.; Scottez, V.; Tenti, M.; Viel, M.; Wiesmann, M.; Akrami, Y.; Allevato, V.; Anselmi, S.; Ballardini, M.; Blanchard, A.; Borgani, S.; Bruton, S.; Cabanac, R.; Calabro, A.; Cañas-Herrera, G.; Cappi, A.; Carvalho, C. S.; Castro, T.; Chambers, K. C.; Contarini, S.; Cooray, A. R.; Coupon, J.; Cucciati, O.; Desprez, G.; Díaz-Sánchez, A.; Di Domizio, S.; Escartin Vigo, J. A.; Escoffier, S.; Ferrari, A. G.; Ferreira, P. G.; Ferrero, I.; Fornari, F.; Gabarra, L.; Ganga, K.; García-Bellido, J.; Gaztanaga, E.; Giacomini, F.; Gozaliasl, G.; Gregorio, A.; Hall, A.; Hildebrandt, H.; Hjorth, J.; Kajava, J. J. E.; Kansal, V.; Karagiannis, D.; Kirkpatrick, C. C.; Legrand, L.; Libet G.; Loureiro, A.; Maggio, G.; Magliocchetti, M.; Mannucci, F.; Maoli, R.; Martins, C. J. A. P.; Matthew, S.; Maurin, L.; Metcalf, R. B.; Monaco, P.; Moretti, C.; Morgante, G.; Walton, Nicholas A.; Odier, J.; Patrizii, L.; Pöntinen, M.; Popa, V.; Porciani, C.; Potter, D.; Risso, I.; Rocci, P.-F.; Sahlén, M.; Schneider, A.; Sereno, M.; Simon, P.; Spurio Mancini, A.; Tao, C.; Testera, G.; Teyssier, R.; Toft, S.; Tosi, S.; Troja, A.; Tucci, M.; Valieri, C.; Valiviita, J.; Vergani, D.; Verza, G.; Euclid Collaboration

PublisherEDP Sciences

Publication year2025

Journal: Astronomy and Astrophysics

Article numberA74

Volume702

ISSN0004-6361

eISSN1432-0746

DOIhttps://doi.org/10.1051/0004-6361/202452468

Publication's open availability at the time of reportingOpen Access

Publication channel's open availability Open Access publication channel

Web address https://doi.org/10.1051/0004-6361/202452468

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/505588811


Abstract

The Euclid Space Telescope will image about 14 000 deg2 of the extragalactic sky at visible and near-infrared wavelengths, providing a dataset of unprecedented size and richness that will facilitate a multitude of studies into the evolution of galaxies. Although spectroscopy will also be available for some of the galaxies, in the vast majority of cases the main source of information will come from broadband images and data products thereof (i.e. photometry). Therefore, there is a pressing need to identify or develop scalable yet reliable methodologies to estimate the redshift and physical properties of galaxies using broadband photometry from Euclid. Optionally, such methods could also include ground-based optical photometry. To address this need, we present a novel method developed as part of a ‘data challenge’ within the Euclid Collaboration to estimate the redshift, stellar mass, star-formation rate, specific star-formation rate, E(B − V), and age of galaxies using mock Euclid and ground-based photometry. The main novelty of our property-estimation pipeline is its use of the CatBoost implementation of gradient-boosted regression-trees together with chained regression and an intelligent, automatic optimisation of the training data. The pipeline also includes a computationally efficient method to estimate prediction uncertainties, and, in the absence of ground-truth labels, it provides accurate predictions for metrics of model performance up to z ~ 2. We applied our pipeline to several datasets consisting of mock Euclid broadband photometry and mock ground-based ugriz photometry, with the objective of evaluating the performance of our methodology for estimating the redshift and physical properties of galaxies detected in the Euclid Wide Survey. The statistical metrics of prediction residuals vary depending on which mock catalogue and filters are tested. Nonetheless, the quality of our photometric redshift and physical property estimates are highly competitive overall, validating our modelling approach. However, at z ≳ 3.5, the relative sparsity of galaxies resulted in unreliable redshift and physical property estimates, which we argue could be mitigated by building catalogues with better sampling of z ≳ 3.5 galaxies or by switching to the use of spectral energy distribution fitting in this regime. We also find that the inclusion of ground-based optical photometry significantly improves the quality of the property estimation, highlighting the importance of combining Euclid data with ancillary ground-based data from such surveys as the Vera C. Rubin Observatory Legacy Survey of Space and Time and UNIONS.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Funding information in the publication
This work was supported by Fundação para a Ciência e a Tecnologia (FCT) through grants UID/FIS/04434/2019, UIDB/04434/2020, UIDP/04434/2020, and PTDC/FIS-AST/29245/2017, and an FCT-CAPES Transnational Coöperation Project. AH acknowledges support from the NVIDIA Academic Hardware Grant Program. PACC acknowledges financial support from the FCT through grant 2022.11477.


Last updated on 2025-26-11 at 12:11