D4 Published development or research report or study
Hyperparameter-free NN algorithm for large-scale regression problems
Authors: Napsu Karmitsa, Sona Taheri, Kaisa Joki, Pauliina Mäkinen, Adil M. Bagirov, Marko M. Mäkelä
Publisher: Turku Centre for Computer Science
Publishing place: Turku
Publication year: 2020
Series title: TUCS Technical Reports
Number in series: 1213
ISBN: 978-952-12-4005-8
ISSN: 1239-1891
Web address : http://oldtucs.abo.fi/publications/view/?pub_id=tKaTaJoMxBaMx20a
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/50375902
In this paper, a new nonsmooth optimization based algorithm for solving large-scale regression problems is introduced. The regression problem is modeled using fullyconnected feedforward neural networks with one hidden layer, the piecewise linear activation, and the L1-loss functions. A novel constructive approach is developed for an automated determination of the proper number of hidden nodes. The limited memory bundle method [Haarala et.al., 2004, 2007] is applied to minimize the nonsmooth objective of the new regression problem. The proposed algorithm is evaluated using real-world data sets with both large number of input features and large number of samples. It is also compared with the well-known backpropagation neural network for regression using TensorFlow. The results demonstrate the superiority of the proposed algorithm as a predictive tool in most data sets used in our numerical experiments.
Downloadable publication This is an electronic reprint of the original article. |