Machine learning with kernel methods

Jean-Philippe Vert, Mines ParisTech

MSc Mathematics, Vision, Learning (MVA) (ENS Cachan), Spring 2013

MSc Mathematics for Life Sciences (MathSV) (University Paris South, Ecole Polytechnique, ENS Cachan), Spring 2013

Outline

Many problems in real-world applications of machine learning can be formalized as classical statistical problems, e.g., pattern recognition, regression or dimension reduction, with the caveat that the data are often not vectors of numbers. For example, protein sequences and structures in computational biology, text and XML documents in web mining, segmented pictures in image processing, or time series in speech recognition and finance, have particular structures which contain relevant information for the statistical problem but can hardly be encoded into finite-dimensional vector representations.

Kernel methods are a class of algorithms well suited for such problems. Indeed they extend the applicability of many statistical methods initially designed for vectors to virtually any type of data, without the need for explicit vectorization of the data. The price to pay for this extension to non-vectors is the need to define a so-called positive definite kernel function between the objects, formally equivalent to an implicit vectorization of the data. The "art" of kernel design for various objects have witnessed important advances in recent years, resulting in many state-of-the-art algorithms and successful applications in many domains.

The goal of this course is to present the mathematical foundations of kernel methods, as well as the main approaches that have emerged so far in kernel design. We will start with a presentation of the theory of positive definite kernels and reproducing kernel Hilbert spaces, which will allow us to introduce several kernel methods including kernel principal component analysis and support vector machines. Then we will come back to the problem of defining the kernel. We will present the main results about Mercer kernels and semigroup kernels, as well as a few examples of kernel for strings and graphs, taken from applications in computational biology, text processing and image analysis.

References

N. Aronszajn, "Theory of reproducing kernels", Transactions of the American Mathematical Society, 68:337-404, 1950.
C. Berg, J.P.R. Christensen et P. Ressel, "Harmonic analysis on semi-groups", Springer, 1994.
N. Cristianini and J. Shawe-Taylor, "Kernel Methods for Pattern Analysis", Cambridge University Press, 2004.
B. Schˆlkopf et A. Smola, "Learning with kernels", MIT Press, 2002.
B. Schˆlkopf, K. Tsuda et J.-P. Vert, "Kernel methods in computational biology", MIT Press, 2004.
V. Vapnik, "Statistical Learning Theory", Wiley, 1998.

Schedule and Homework

Lecture take place usually in room C103 (ENS Cachan, Cournot building).

Homeworks are due at the begining of the following lecture, by hard copy or (better) e-mail to Jean-Philippe.Vert@mines.org. Implementations can be done in the programming language of your choice, e.g., the free R language, or Matlab and its free clone Octave

Date Topic Slides Homework Data

Jan 23, 1pm-4pm Positive definite kernels, RKHS, Aronszajn's theorem 1-32 Homework 1

Jan 30, 1pm-4pm Kernel trick, representer theorem 33-59 Homework 2

Feb 6, 1pm-4pm Kernel PCA, kernel ridge regression, pattern recognition 60-99 Homework 3 xtrain.txt, ytrain.txt, xtest.txt, ytest.txt

Feb 13, 1pm-4pm Optimization, SVM, RKHS and Green functions 100-174 Homework 4

Feb 20, 1pm-4pm Translation invariant and semigroup kernels. Spectrum kernel for strings 175-227 Homework 5

Feb 27, 1pm-4pm

Jobs / internships / PhD proposals

Contact me for intern/PhD opportunities in machine learning and bioinformatics

Results

The final note will be an average of the homeworks.

Back to homepage

Date	Topic	Slides	Homework	Data
Jan 23, 1pm-4pm	Positive definite kernels, RKHS, Aronszajn's theorem	1-32	Homework 1
Jan 30, 1pm-4pm	Kernel trick, representer theorem	33-59	Homework 2
Feb 6, 1pm-4pm	Kernel PCA, kernel ridge regression, pattern recognition	60-99	Homework 3	xtrain.txt, ytrain.txt, xtest.txt, ytest.txt
Feb 13, 1pm-4pm	Optimization, SVM, RKHS and Green functions	100-174	Homework 4
Feb 20, 1pm-4pm	Translation invariant and semigroup kernels. Spectrum kernel for strings	175-227	Homework 5
Feb 27, 1pm-4pm