A new approach to kernel based data analysis algorithms

Mussa, HY; Glen, RC

doi:10.1186/1752-153X-3-S1-O6

Volume 3 Supplement 1

4th German Conference on Chemoinformatics: 22. CIC-Workshop

Oral presentation
Open access
Published: 05 June 2009

A new approach to kernel based data analysis algorithms

HY Mussa¹ &
RC Glen¹

Chemistry Central Journal volume 3, Article number: O6 (2009) Cite this article

2241 Accesses
Metrics details

Kernel based methods (KBMs) [1, 2] are arguably the best data analysis technique currently available [3, 4]. Unlike Neural Networks in which, besides a global minimum, several local minima exist, a Kernel based fitting/classifying problem is a convex optimization problem with a single minimum. However, finding this minimum (and in doing so yielding optimal parameters of a given observational model) in practice requires the manipulation, such as inversion, of large matrices. This has been challenging even when the number of data points is just over a few thousands [5][6].

The well established direct methods for updating, or inverting huge matrices fail due to the expense of a large increase in core-memory storage and CPU-time, even for moderately-sized systems. The root of the problem is that direct methods have O(N²) core memory storage requirements and the CPU-time scales as O(N³), where N is the dimension of the matrix (the number of data points, here). Despite the advances in computer power, "conventional" computers can only solve relatively small problems (N ≈ 10⁴ to 10⁵).

Another outstanding drawback of the KBMs is how to choose the appropriate kernel function for a given data set [4].

In this paper we would like to propose a computationally efficient training scheme for KBMs for obtaining the global minimum. We also present a systematic approach to selecting the appropriate kernel functions. Some preliminary results on chemical data sets will be illustrated.

References

Nadaraya EA: Theory Prob Appl. 1964, 10: 186-10.1137/1110024.
Article Google Scholar
Watson GS: Sankhya Ser A. 1964, 26: 359-
Google Scholar
Vapnik V: The Nature of Statistical Learning Theory. 1995, Springer-Verlag, New York
Book Google Scholar
Shawe-Taylor J, Cristianini N: Kernel Methods for Pattern Analysis. 2004, Cambridge University Press
Book Google Scholar
Chua KS: Pattern Recognition Letters. 2003, 24: 75-10.1016/S0167-8655(02)00190-3.
Article Google Scholar
Mangasarian OL, Musicant DR: J Mach Learn Res. 2001, 1: 161-10.1162/15324430152748218.
Google Scholar

Download references

Author information

Authors and Affiliations

Unilever Centre for Molecular Informatics, Department of Chemistry, University of Cambridge Lensfield Road, Cambridge, CB2 1EW, UK
HY Mussa & RC Glen

Authors

HY Mussa
View author publications
You can also search for this author in PubMed Google Scholar
RC Glen
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Mussa, H., Glen, R. A new approach to kernel based data analysis algorithms. Chemistry Central Journal 3 (Suppl 1), O6 (2009). https://doi.org/10.1186/1752-153X-3-S1-O6

Download citation

Published: 05 June 2009
DOI: https://doi.org/10.1186/1752-153X-3-S1-O6

4th German Conference on Chemoinformatics: 22. CIC-Workshop

A new approach to kernel based data analysis algorithms

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

BMC Chemistry

Contact us

4th German Conference on Chemoinformatics: 22. CIC-Workshop

A new approach to kernel based data analysis algorithms

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Chemistry

Contact us