Super Level Sets and Exponential Decay: A Synergistic Approach to Stable Neural Network Training - UTU Research Portal

A1 Refereed original research article in a scientific journal

Super Level Sets and Exponential Decay: A Synergistic Approach to Stable Neural Network Training

Authors: Chaudary, Jatin; Nidhi, Dipak; Heikkonen, Jukka; Merisaari, Harri; Kanth, Rajiv

Publisher: AI Access Foundation

Publication year: 2025

Journal:Journal of Artificial Intelligence Research

Journal name in sourceJournal of Artificial Intelligence Research

Article number: 21

Volume: 83

ISSN: 1076-9757

eISSN: 1943-5037

DOI: https://doi.org/10.1613/jair.1.17272

Web address : https://doi.org/10.1613/jair.1.17272

Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/499842072

Abstract

This paper presents a theoretically grounded optimization framework for neural network training that integrates an Exponentially Decaying Learning Rate with Lyapunov-based stability analysis. We develop a dynamic learning rate algorithm and prove that it induces connected and stable descent paths through the loss landscape by maintaining the connectivity of super-level sets 𝑆𝜆={𝜃∈R𝑛:L(𝜃) ≥𝜆}. Under the condition that the Lyapunov function 𝑉(𝜃)=L(𝜃)satisfies∇𝑉(𝜃)·∇L(𝜃) ≥0, we establish that these super-level sets are not only connected but also equiconnected across epochs, providing uniform topological stability. We further derive convergence guarantees using a second-order Taylor expansion and demonstrate that our exponentially scheduled learning rate with gradient-based modulation leads to a monotonic decrease in loss. The proposed algorithm incorporates this schedule into a stability-aware update mechanism that adapts step sizes based on both curvature and energy-level geometry. This work formalizes the role of topological structure in convergence dynamics and introduces a provably stable optimization algorithm for high-dimensional, non-convex neural networks.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

17272b.pdf

Funding information in the publication:
Jatin Chaudhary would like to acknowledge the University of Turku Graduate School’s grant for conducting thiswork.