A4 Refereed article in a conference publication

Raman spectroscopy and machine learning can quantitatively asses clindamycin in liquid samples




AuthorsMilea, Eduard C.; Alecu, Andreia; Stoica, Alice; Necula, Marian; Petre, Ion; Litescu, Simona; Paun, Mihaela

EditorsFlearmoy, Jonathan

Conference nameInternational Conference on Knowledge-Based and Intelligent Information & Engineering Systems

Publication year2025

Journal: Procedia Computer Science

Book title 29th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2025)

Volume270

First page 3518

Last page3527

eISSN1877-0509

DOIhttps://doi.org/10.1016/j.procs.2025.09.477

Publication's open availability at the time of reportingOpen Access

Publication channel's open availability Open Access publication channel

Web address https://doi.org/10.1016/j.procs.2025.09.477

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/505713152


Abstract

Raman spectroscopy offers a powerful, non-destructive tool for pharmaceutical quantification, particularly in environments where traditional techniques like HPLC are limited by throughput and sample preparation demands. However, the quantification of low-concentration compounds remains challenging due to weak Raman scattering and high background interference. This study evaluates the use of portable Raman instrumentation coupled with Support Vector Regression (SVR) to quantify clindamycin across various concentrations. Spectral preprocessing steps included Savitzky–Golay smoothing, Standard Normal Variate (SNV) normalisation, and blank subtraction (ΔSNV) to enhance analyte-specific signal fidelity. Three SVR-based models were developed using full spectra, chemically meaningful fingerprint bands, and coefficient-filtered features. Models were evaluated through grouped cross-validation, bootstrapping, and external testing on formulations prepared in different solvent matrix and derived from distinct clindamycin sources (commercial tablet vs. analytical-grade standard). The top performing model achieved R² values exceeding 0.98 with root mean squared errors below 2.85 mg/mL. Blind sample predictions, made on fully unseen data, fell within 95% confidence intervals of the true concentrations, demonstrating strong model robustness.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Funding information in the publication
This work was supported in part by the MRID, project PNRR-I8 no 842027778, contract no 760096 and by the
Core Program within the National Research, Development and Innovation Plan 2022-2027, carried out with the support of MRID, project no. 23020101(SIA-PRO), contract no 7N/2022.


Last updated on 2025-03-12 at 14:24