30.12.2014 Views

kernlab: Machine Learning with kernels in R - VideoLectures

kernlab: Machine Learning with kernels in R - VideoLectures

kernlab: Machine Learning with kernels in R - VideoLectures

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>kernlab</strong> Introduction<br />

Demo<br />

<strong>kernlab</strong>:<br />

<strong>Mach<strong>in</strong>e</strong> <strong>Learn<strong>in</strong>g</strong> <strong>with</strong> <strong>kernels</strong> <strong>in</strong> R<br />

Alexandros Karatzoglou 1<br />

jo<strong>in</strong>t work <strong>with</strong> Alex Smola and Kurt Hornik<br />

1 INSA de Rouen<br />

LITIS<br />

France<br />

December 12, 2008


<strong>kernlab</strong> Introduction<br />

Demo<br />

Outl<strong>in</strong>e<br />

1 <strong>kernlab</strong> Introduction<br />

2 Demo


<strong>kernlab</strong> Introduction<br />

Demo<br />

<strong>kernlab</strong><br />

<strong>kernlab</strong> R package:<br />

Conta<strong>in</strong>s a wide range of kernel-based <strong>Mach<strong>in</strong>e</strong> <strong>Learn<strong>in</strong>g</strong><br />

methods<br />

Uses modern Open Source R environment<br />

Extensible by exploit<strong>in</strong>g the <strong>in</strong>herent modularity of kernel<br />

methods<br />

GPL 2


<strong>kernlab</strong> Introduction<br />

Demo<br />

R


<strong>kernlab</strong> Introduction<br />

Demo<br />

R<br />

Environment for statistical data analysis, <strong>in</strong>ference and<br />

visualization.<br />

Ports for Unix, W<strong>in</strong>dows and MacOSX<br />

Highly extensible through user-def<strong>in</strong>ed functions<br />

Generic functions and conventions for standard operations<br />

like plot, predict etc.<br />

∼ 1200 add-on packages contributed by developers from<br />

all over the world<br />

e.g. Multivariate Statistics, <strong>Mach<strong>in</strong>e</strong> <strong>Learn<strong>in</strong>g</strong>, Natural<br />

Language Process<strong>in</strong>g, Bio<strong>in</strong>formatics (Bioconductor).<br />

Interfaces to C, C++, Fortran, Java


<strong>kernlab</strong> Introduction<br />

Demo<br />

<strong>kernlab</strong> Package Content<br />

User<br />

Methods<br />

Utility Functions<br />

Kernels<br />

R<br />

9 kernel functions, 4 kernel expression functions<br />

18 kernel-based ML methods:<br />

Regression, Quantile regression, Classification and Novelty<br />

detection (SVM, RVM, Gaussian Processes)<br />

Cluster<strong>in</strong>g (kernel k-means, Spectral Cluster<strong>in</strong>g)<br />

Rank<strong>in</strong>g and dimensionality reduction (kernel PCA, kernel<br />

CCA, etc.) kernel-based two sample test (KMMD)<br />

Utility Functions:<br />

Quadratic Problem solver<br />

Incomplete Cholesky Decomposition


<strong>kernlab</strong> Introduction<br />

Demo<br />

Some kernel functions <strong>in</strong> <strong>kernlab</strong><br />

L<strong>in</strong>ear kernel<br />

Gaussian radial basis kernel<br />

Hyperbolic tangent kernel<br />

Str<strong>in</strong>g kernel<br />

Polynomial kernel<br />

...


<strong>kernlab</strong> Introduction<br />

Demo<br />

Runtime<br />

Inputs<br />

<strong>kernlab</strong> methods take kernel function and data or kernel<br />

matrix as <strong>in</strong>put<br />

Function parameters are given as options<br />

Outputs<br />

Return objects that conta<strong>in</strong> model parameters.<br />

Objects can be used <strong>with</strong> generic functions such as<br />

predict, plot etc.


<strong>kernlab</strong> Introduction<br />

Demo<br />

Performance<br />

Nystrom Method for eigendecomposition based methods<br />

(e.g. spectral cluster<strong>in</strong>g). Eigenvectors can be calculated<br />

us<strong>in</strong>g only part of the kernel matrix yield<strong>in</strong>g speedups and<br />

better scal<strong>in</strong>g O(n 3 ) + O(nN).<br />

Build <strong>in</strong> kernel function computations are vectorized<br />

C++ on performance bottlenecks<br />

Use of scalable algorithms<br />

. . .


<strong>kernlab</strong> Introduction<br />

Demo<br />

Features<br />

Most function implement additional features such as e.g.<br />

Cross-validation (svm, GP’s)<br />

Model-selection <strong>in</strong> particular hyper-parameter estimation<br />

Data Scal<strong>in</strong>g<br />

Extensive documentation of every function and options


<strong>kernlab</strong> Introduction<br />

Demo<br />

Demo

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!