A1 Refereed original research article in a scientific journal
Machine learning for transient recognition in difference imaging with minimum sampling effort
Authors: Mong YL, Ackley K, Galloway DK, Killestein T, Lyman J, Steeghs D, Dhillon V, O'Brien PT, Ramsay G, Poshyachinda S, Kotak R, Nuttall L, Palle E, Pollacco D, Thrane E, Dyer MJ, Ulaczyk K, Cutter R, McCormac J, Chote P, Levan AJ, Marsh T, Stanway E, Gompertz B, Wiersema K, Chrimes A, Obradovic A, Mullaney J, Daw E, Littlefair S, Maund J, Makrygianni L, Burhanudin U, Starling RLC, Eyles-Ferris RAJ, Tooke S, Duffy C, Aukkaravittayapun S, Sawangwit U, Awiphan S, Mkrtichian D, Irawati P, Mattila S, Heikkila T, Breton R, Kennedy M, Sanchez DM, Rol E
Publisher: OXFORD UNIV PRESS
Publication year: 2020
Journal: Monthly Notices of the Royal Astronomical Society
Journal name in source: MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
Journal acronym: MON NOT R ASTRON SOC
Volume: 499
Issue: 4
First page : 6009
Last page: 6017
Number of pages: 9
ISSN: 0035-8711
eISSN: 1365-2966
DOI: https://doi.org/10.1093/mnras/staa3096
Self-archived copy’s web address: https://arxiv.org/abs/2008.10178
The amount of observational data produced by time-domain astronomy is exponentially increasing. Human inspection alone is not an effective way to identify genuine transients from the data. An automatic real-bogus classifier is needed and machine learning techniques are commonly used to achieve this goal. Building a training set with a sufficiently large number of verified transients is challenging, due to the requirement of human verification. We present an approach for creating a training set by using all detections in the science images to be the sample of real detections and all detections in the difference images, which are generated by the process of difference imaging to detect transients, to be the samples of bogus detections. This strategy effectively minimizes the labour involved in the data labelling for supervised machine learning methods. We demonstrate the utility of the training set by using it to train several classifiers utilizing as the feature representation the normalized pixel values in 21 x 21 pixel stamps centred at the detection position, observed with the Gravitational-wave Optical Transient Observer (GOTO) prototype. The real-bogus classifier trained with this strategy can provide up to 95 per cent prediction accuracy on the real detections at a false alarm rate of 1 per cent.
Downloadable publication This is an electronic reprint of the original article. |