Frequent substructure mining of GPCR ligands

Horst, E van der; Bender, A; IJzerman, AP

doi:10.1186/1752-153X-3-S1-P69

Volume 3 Supplement 1

4th German Conference on Chemoinformatics: 22. CIC-Workshop

Poster presentation
Open access
Published: 05 June 2009

Frequent substructure mining of GPCR ligands

E van der Horst¹,
A Bender¹ &
AP IJzerman¹

Chemistry Central Journal volume 3, Article number: P69 (2009) Cite this article

1400 Accesses
Metrics details

In this study, we conducted frequent substructure mining to find the structural features that discriminate between ligands that either do or do not bind to G protein-coupled receptors (GPCRs). Finding which substructures are rare and which are common in GPCR ligands will help in the design of new ligands and for prioritizing compounds for screening. Besides the normal 2D structure notation, three other chemical representations were used. The first 'elaborate' representation used a special type for aromatic bonds, the second also added a special type for any aromatic atom, and the third representation used a special notation for planar, not necessarily aromatic, structures. In all but the normal representation, wildcards were used for halogens and aliphatic heteroatoms with an extra label indicating the atom type. A set of 16 k GPCR ligands was compared against a roughly equal number from a screening set of compounds (Chembridge). For analysis of the results, two decision trees were constructed, one for the most-common substructure for GPCR ligands and one for the most-common substructure in the screening set. The alkylamine substructures were most discriminating for GPCR ligands as compared to the Chembridge set. This reflects the presence of aminergic receptor ligands in the GPCR dataset. Carboxamide substructures were most common in the Chembridge dataset. This is probably due to particular reaction types used to construct the screening library. The 'normal' representation mode led to the most significant substructure for GPCR ligands; the aromatic bonds representation yielded the most significant substructure for the screening compounds. In conclusion, frequent substructure mining is a useful approach for characterizing heterogeneous ligand datasets.

Author information

Authors and Affiliations

Division of Medicinal Chemistry, Leiden/Amsterdam Center for Drug Research, Leiden University, Einsteinweg 55, 2333CC, Leiden, The Netherlands
E van der Horst, A Bender & AP IJzerman

Authors

E van der Horst
View author publications
You can also search for this author in PubMed Google Scholar
A Bender
View author publications
You can also search for this author in PubMed Google Scholar
AP IJzerman
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Horst, E.v.d., Bender, A. & IJzerman, A. Frequent substructure mining of GPCR ligands. Chemistry Central Journal 3 (Suppl 1), P69 (2009). https://doi.org/10.1186/1752-153X-3-S1-P69

Download citation

Published: 05 June 2009
DOI: https://doi.org/10.1186/1752-153X-3-S1-P69

4th German Conference on Chemoinformatics: 22. CIC-Workshop

Frequent substructure mining of GPCR ligands

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

BMC Chemistry

Contact us

4th German Conference on Chemoinformatics: 22. CIC-Workshop

Frequent substructure mining of GPCR ligands

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Chemistry

Contact us