In silico methods for mutagenicity prediction

Thumbnail Image
Date
2025-12-04
Journal Title
Journal ISSN
Volume Title
Publisher
Plovdiv University Press "Paisii Hilendarski"
Abstract
This research presents a thorough exploratory data analysis to develop an in silico model for mutagenicity prediction, contributing to Safe-by-Design strategies. Using a publicly available dataset, chemical structures were encoded via a range of molecular fingerprints and descriptors. Multiple machine learning algorithms—including k-nearest neighbors, support vector machines, and random forest—were assessed. Performance was validated through 10- fold cross-validation and further tested on an external dataset. Random Forest emerged as the most effective method, achieving a cross-validation MCC of 0.68. The in-house models showed competitive performance relative to existing publicly available tools.
Description
Keywords
ames mutagenicity, machine learning, QSAR, in silico
Citation