ValidatorDB:CaseStudies

From WebChem Wiki
Revision as of 15:51, 1 November 2014 by Lukas (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

One interesting question is how the quality of the structures varies for different classes of molecules. We have thus designed and conducted several case studies to show how ValidatorDB can answer such questions. We selected the molecules according to a combination of features related to chemical structure, biological function, area of application, availability, etc. The following classes were defined as subsets of models from wwPDB CCD:

1. Polycyclic compounds: Contain 3 or more conjugated rings. The molecules containing metals were excluded, as their quality is influenced more by the presence of the metal than by their polycyclic structure.

List of annotations (3-letter codes) of all molecules in this study
Access the results of this case study

Compared to the PDB-wide statistics for all ligands and non-standard residues, polycyclic molecules have overall higher quality (higher percentage of molecules with complete structure and correct chirality). Nonetheless, they exhibit more errors in C chirality, probably due to their more complicated, carbon-based scaffolds.

2. Carbohydrates: Contain the pyran or furan ring. Molecules containing P (e.g., ATP) were excluded, as their quality is influenced by the occurrence of phosphate derivatives than by the sugar part.

List of annotations (3-letter codes) of all molecules in this study
Access the results of this case study

Carbohydrate molecules show similar trends as polycyclic molecules, since their structure is also ring based. However, they exhibit a higher rate of errors in C chirality, a consequence of the fact that they generally contain more chiral atoms.

3. Mannose derivatives: Subclass of carbohydrates.

List of annotations (3-letter codes) of all molecules in this study
Access the results of this case study

Mannose derivatives play an important role in cell-cell recognition, a biological function which relies heavily on chirality. Therefore they must have a characteristic structure (determined by chirality) and are also strongly predisposed to have C chirality errors. We found that the percentage of errors in C chirality is over 3 times higher for mannose derivatives than the PDB-wide evaluation for all ligands and non-standard residues.

4. Organometalls: Contain a metal atom.

List of annotations (3-letter codes) of all molecules in this study
Access the results of this case study

Organometals seem to have overall lower quality. Part of the errors is artifacts of our validation algorithm, as such molecules can have very complicated scaffolds (see algorithm limitations in the Supplementary Material). However, the majority of the reported errors are significant, proving that many challenges remain in the field of structure determination for organometals.

5. Experimental drugs: Described in DrugBank as experimental drugs, i.e., have been shown to bind specific proteins in mammals, bacteria, viruses, fungi, or parasites.

List of annotations (3-letter codes) of all molecules in this study
Access the results of this case study

On the other hand, the overall quality of the structure of experimental drugs is clearly much higher than the PDB-wide statistics for all ligands and non-standard residues.

6. Approved drugs: Described in DrugBank as approved drugs, i.e., have received approval in at least one country.

List of annotations (3-letter codes) of all molecules in this study
Access the results of this case study

For approved drugs, i.e., drugs already on the market, the situation is even better. About 95 % of these molecules are complete and have correct chirality, a consequence of their rather simpler structure, and that markedly more effort is expended in the determination of their structure in biomacromolecular complexes.