MASSA Algorithm: an automated rational sampling of training and test subsets for QSAR modeling

Abstract QSAR models capable of predicting biological, toxicity, and pharmacokinetic properties were widely used to search lead bioactive molecules in chemical databases. The dataset’s preparation to build these models has a strong influence on the quality of the generated models, and sampling requi...
Ausführliche Beschreibung

Gespeichert in:
Autor*in:

Veríssimo, Gabriel Corrêa [verfasserIn]

Pantaleão, Simone Queiroz

Fernandes, Philipe de Olveira

Gertrudes, Jadson Castro

Kronenberger, Thales

Honorio, Kathia Maria

Maltarollo, Vinícius Gonçalves

Format:

E-Artikel

Sprache:

Englisch

Erschienen:

2023

Schlagwörter:

Clustering

Hierarchical clustering analysis (HCA)

K-modes

Training and test sampling

QSAR

Computer-aided drug design

Python

Anmerkung:

© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Übergeordnetes Werk:

Enthalten in: Journal of computer aided molecular design - Dordrecht [u.a.] : Springer Science + Business Media B.V, 1987, 37(2023), 12 vom: 07. Okt., Seite 735-754

Übergeordnetes Werk:

volume:37 ; year:2023 ; number:12 ; day:07 ; month:10 ; pages:735-754

Links:

Volltext

DOI / URN:

10.1007/s10822-023-00536-y

Katalog-ID:

SPR053587243

Nicht das Richtige dabei?

Schreiben Sie uns!