In the current society, it is essential in all fields to appropriately exploit "big data" for finding rules and/or making predictions/decisions. This course gives fundamental knowledges and basic skills for handling large-scale data sets with the aid of computers.
Students will be able to apply basic knowledges on statistics for analyzing data and evaluating the obtained results mathematically.
classification, clustering, principal component analysis, dimension reduction, training/generalization errors, cross validation
|Intercultural skills||Communication skills||✔ Specialist skills||Critical thinking skills||✔ Practical and/or problem-solving skills|
All classes are given in both Ookayama and Suzukakedai campuses with the use of video conference systems.
|Course schedule||Required learning|
|Class 1||Class guidance||Guidance for class flow, computing environment, and used programming language (Python)|
|Class 2||Fundamentas of data analysis||Learn basic knowledge about statistics and data science|
|Class 3||Classification and model evaluation||Learn methods for extracting discrimination rules from labeled data. Learn about difference between training error and generalization error, and methods of model evaluation.|
|Class 4||Clustering||Learn methods for categorizing unlabeled data into several categories|
|Class 5||Principal component analysis||Learn principal component analysis together with mathematical issues related to it|
|Class 6||Dimension reduction||Learn methods for dimension reduction such as multidimensional scaling and canonical correlation analysis|
|Class 7||Advanced topics||Learn methods for ensemble learning|
|Class 8||General discussion||Discuss possible applications of data analysis in various fields|
Distributed via OCW-i.
Based on quizzes in class/reports.
Preferred to have basic knowledge about linear algebra, analysis, and mathematical statistics.