Tautomeric rearrangements affect the results of cheminformatics applications that depend on the knowledge of the 2D or 3D structure of a compound, such as tools for database searches, fingerprint generation, virtual screening, and physical-chemical properties prediction. In this paper we present TauThor, a tool to enumerate tautomers and predict tautomer stability in the aqueous medium. The enumeration is based on a recursive process that generates tautomers according to the general scheme HX-Y=Z ⇋ X=Y-ZH. The stability of a tautomer is calculated by using a library of 145 fragments associated with experimental tautomeric percentages in water and a pKa based-method that utilizes pKa values predicted by MoKa. Predicted tautomeric ratios based on pKa calculations were benchmarked against literature data for a set of eleven compounds. The FDA approved drugs database, the NCI database and two vendor databases - Specs Screening Library and Asinex Gold Collection - were used to illustrate the impact of tautomerism on chemical libraries and to evaluate the relative occurrences of alternative tautomeric forms.

Tautomer Enumeration and Stability Prediction for Virtual Screening on Large Chemical Databases

STORCHI, LORIANO;
2009-01-01

Abstract

Tautomeric rearrangements affect the results of cheminformatics applications that depend on the knowledge of the 2D or 3D structure of a compound, such as tools for database searches, fingerprint generation, virtual screening, and physical-chemical properties prediction. In this paper we present TauThor, a tool to enumerate tautomers and predict tautomer stability in the aqueous medium. The enumeration is based on a recursive process that generates tautomers according to the general scheme HX-Y=Z ⇋ X=Y-ZH. The stability of a tautomer is calculated by using a library of 145 fragments associated with experimental tautomeric percentages in water and a pKa based-method that utilizes pKa values predicted by MoKa. Predicted tautomeric ratios based on pKa calculations were benchmarked against literature data for a set of eleven compounds. The FDA approved drugs database, the NCI database and two vendor databases - Specs Screening Library and Asinex Gold Collection - were used to illustrate the impact of tautomerism on chemical libraries and to evaluate the relative occurrences of alternative tautomeric forms.
File in questo prodotto:
File Dimensione Formato  
ci800340j.pdf

Solo gestori archivio

Tipologia: PDF editoriale
Dimensione 508.21 kB
Formato Adobe PDF
508.21 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11564/225419
Citazioni
  • ???jsp.display-item.citation.pmc??? 18
  • Scopus 114
  • ???jsp.display-item.citation.isi??? 112
social impact