D4 Published development or research report or study

Hyperparameter-free NN algorithm for large-scale regression problems




AuthorsNapsu Karmitsa, Sona Taheri, Kaisa Joki, Pauliina Mäkinen, Adil M. Bagirov, Marko M. Mäkelä

PublisherTurku Centre for Computer Science

Publishing placeTurku

Publication year2020

Series titleTUCS Technical Reports

Number in series1213

ISBN978-952-12-4005-8

ISSN1239-1891

Web address http://oldtucs.abo.fi/publications/view/?pub_id=tKaTaJoMxBaMx20a

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/50375902


Abstract

In this paper, a new nonsmooth optimization based algorithm for solving large-scale regression problems is introduced. The regression problem is modeled using fullyconnected feedforward neural networks with one hidden layer, the piecewise linear activation, and the L1-loss functions. A novel constructive approach is developed for an automated determination of the proper number of hidden nodes. The limited memory bundle method [Haarala et.al., 2004, 2007] is applied to minimize the nonsmooth objective of the new regression problem. The proposed algorithm is evaluated using real-world data sets with both large number of input features and large number of samples. It is also compared with the well-known backpropagation neural network for regression using TensorFlow. The results demonstrate the superiority of the proposed algorithm as a predictive tool in most data sets used in our numerical experiments.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 20:51