D4 Julkaistu kehittämis- tai tutkimusraportti tai -selvitys

Hyperparameter-free NN algorithm for large-scale regression problems




TekijätNapsu Karmitsa, Sona Taheri, Kaisa Joki, Pauliina Mäkinen, Adil M. Bagirov, Marko M. Mäkelä

KustantajaTurku Centre for Computer Science

KustannuspaikkaTurku

Julkaisuvuosi2020

Sarjan nimiTUCS Technical Reports

Numero sarjassa1213

ISBN978-952-12-4005-8

ISSN1239-1891

Verkko-osoitehttp://oldtucs.abo.fi/publications/view/?pub_id=tKaTaJoMxBaMx20a

Rinnakkaistallenteen osoitehttps://research.utu.fi/converis/portal/detail/Publication/50375902


Tiivistelmä

In this paper, a new nonsmooth optimization based algorithm for solving large-scale regression problems is introduced. The regression problem is modeled using fullyconnected feedforward neural networks with one hidden layer, the piecewise linear activation, and the L1-loss functions. A novel constructive approach is developed for an automated determination of the proper number of hidden nodes. The limited memory bundle method [Haarala et.al., 2004, 2007] is applied to minimize the nonsmooth objective of the new regression problem. The proposed algorithm is evaluated using real-world data sets with both large number of input features and large number of samples. It is also compared with the well-known backpropagation neural network for regression using TensorFlow. The results demonstrate the superiority of the proposed algorithm as a predictive tool in most data sets used in our numerical experiments.


Ladattava julkaisu

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 20:51