2019 | Theory of Statistical Mathematics

Home
> School of Computing
> Graduate major in Mathematical and Computing Science
> Theory of Statistical Mathematics

Undergraduate
Graduate

2019　Theory of Statistical Mathematics

Font size S M L

Academic unit or major: Graduate major in Mathematical and Computing Science

Instructor(s): Kanamori Takafumi

Class Format: Lecture

Media-enhanced courses

Day/Period(Room No.): Tue7-8(W834) Fri7-8(W834)

Group: -

Course number: MCS.T507

Credits: 2

Academic year: 2019

Offered quarter: 1Q

Syllabus updated: 2019/3/18

Lecture notes updated: 2019/4/26

Language used: English

Access Index

Course description and aims

Some advanced topics and theories related to statistics and machine learning are taught. More specifically, a nonparametric method called kernel method, statistical properties of training and prediction errors, generalization error bound using Rademacher complexity, and recent deep learning algorithms are taught.

Student learning outcomes

[Objectives] Statistical science and machine learning are disciplines in which useful information is extracted from data to aid human decision making. Students will learn methodology not simply as knowledge, but also learning the background theory including the validity of those methods to promote understanding the essence. Students will broadly apply all kinds of techniques to a variety of problems, learning to construct new techniques on one's own.
[Topics] Students in this course will learn several of statistical science's more advanced techniques, based on their connection to various application fields. We will focus in particular on the connection with machine learning, introducing central topics from both statistical science and machine learning.

Keywords

machine learning, statistics, kernel methods, prediction error, Rademacher complexity, deep learning

Competencies that will be developed

✔ Specialist skills

Intercultural skills

Communication skills

✔ Critical thinking skills

Practical and/or problem-solving skills

Class flow

Lectures are given using black board mainly.

Course schedule/Required learning

	Course schedule	Required learning
Class 1	Regression analysis and kernel methods	Understand statistical modeling with kernel functions in regression analysis.
Class 2	Kernel methods I: positive definite kernels	Learn some properties of kernel functions.
Class 3	Kernel methods II: reproducing kernel Hilbert space	Learn a kernel method that is a nonparametric method on reproducing kernel Hilbert space.
Class 4	Spline smoothing and kernel methods	Learn the relationship between spline smoothing methods and kernel methods.
Class 5	Classification analysis and kernel methods: support vector machine	Learn kernel-based support vector machine for classification problems.
Class 6	Multi-class classification	Understand learning algorithms for multi-class classification problems such as error correcting output coding method.
Class 7	Kernel embedding methods	Learn kernel embedding methods and its applications to various statistical inferences.
Class 8	Problem setup of statistical learning theory	Understand the problem setup of statistical learning theory. Learn definitions of training errors, prediction errors, Bayes errors and Bayes rules.
Class 9	Inequalities in Probability Theory	Review the probability theory and understand some probabilistic inequalities.
Class 10	Prediction Error Bound for Finite hypothesis class	Learn some properties understand the relationship between the complexity of the model and the prediction error. f kernel functions
Class 11	Rademacher Complexity	Learn the Rademacher complexity of the model.
Class 12	Generalization Error bound using Rademacher Complexity	Understand how to calculate prediction errors using Rademacher complexity.
Class 13	Theory of surrogate losses	Learn the relationship between surrogate losses and prediction errors in classification problems.
Class 14	Computation algorithm for support vector machines.	Learn an efficient computation algorithm for support vector machines.
Class 15	Generative Adversarial Networks(GAN)	Learn the algorithm and statistical properties of Generative Adversarial Networks(GAN) that is used as the generative model for images.

Textbook(s)

Unspecified.

Reference books, course materials, etc.

Shai Shalev-Shwartz and Shai Ben-David, Understanding Machine Learning: From Theory to Algorithms, Cambridge University Press, 2014.

Assessment criteria and methods

Evaluated by report submission.

Related courses

MCS.T223 ： Mathematical Statistics
MCS.T402 ： Mathematical Optimization: Theory and Algorithms
MCS.T403 ： Statistical Learning Theory

Prerequisites (i.e., required knowledge, skills, courses, etc.)

It is preferred that students know the basics of statistics and probability theory.

Contact information (e-mail and phone) Notice : Please replace from "[at]" to "@"(half-width character).

kanamori[at]c.titech.ac.jp

Office hours

Contact by e-mail in advance.

TOKYO INSTITUTE OF TECHNOLOGY