Random forest classification for predicting lifespan-extending chemical compounds release_ogxewa2hdzgxxek4ogp7ssojuu

by Sofia Kapsiani, Brendan J. Howlin

Released as a post by Research Square.

2021  

Abstract

<jats:title>Abstract</jats:title> Ageing is a major risk factor for many conditions including cancer, cardiovascular and neurodegenerative diseases. Pharmaceutical interventions that slow down ageing and delay the onset of age-related diseases are a growing research area. The aim of this study was to build a machine learning model based on the data of the DrugAge database to predict whether a chemical compound will extend the lifespan of <jats:italic>Caenorhabditis elegans</jats:italic>. Five predictive models were built using the random forest algorithm with molecular fingerprints and/or molecular descriptors as features. The best performing classifier, built using molecular descriptors, achieved an area under the curve (AUC) score of 0.815 for classifying the compounds in the test set. The features of the model were ranked using the Gini importance measure of the random forest algorithm. The top 30 features included descriptors related to atom and bond counts, topological and partial charge properties. The model was applied to predict the class of compounds in an external database, consisting of 1,738 small-molecules. The chemical compounds of the screening database with a predictive probability of ≥ 0.80 for increasing the lifespan of <jats:italic>Caenorhabditis elegans</jats:italic> were broadly separated into (i) flavonoids, (ii) fatty acids and conjugates, and (iii) organooxygen compounds.
In application/xml+jats format

Archived Files and Locations

application/pdf   536.2 kB
file_sulimdzbh5hlvooftfz5i7sefi
assets.researchsquare.com (publisher)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  post
Stage   unknown
Date   2021-03-09
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 201c9370-cb87-4286-86d9-ee87a95806e8
API URL: JSON