In the current society it is essential in almost all fields to make use of computation and data appropriately. This course gives fundamental knowledge and basic skills for using computers to analyze big data and making use of it.
This course aims to (1) teach foundations of statistics and computation for processing and analyzing a given data set, and (2) help students to understand and apply various computer software tools for data analysis to get new findings.
(1) Students will be able to apply basic statistical knowledge to analyzing data and evaluating the obtained results mathematically.
(2) Students will be able to understand the basis of data processing mechanisms and make use of various data mining software tools appropriately.
data mininig, computation (data processing), knowledge representation, data mining software, statistical test
✔ Specialist skills | Intercultural skills | Communication skills | Critical thinking skills | ✔ Practical and/or problem-solving skills |
In each week we give a lecture and an exercise session. All classes are given in both Ookayama and Suzukakedai campuses (except for the first one). Classes are given in Japanese (in Ookayama campus) and in English (in Suzukakedai campus). The first class (June 11th (Mon)) will be given only at Ookayama campus, where class guidance and guidance for installing computer software enviroments will be given.
Course schedule | Required learning | |
---|---|---|
Class 1 | Class guidance and an instruction for installing computer softwares into your PC. | Prepare software enviroments for exercises in each student's PC. |
Class 2 | Brief introduction to - computation and programming. Introduction to - data mining, and - basic statistics and mathematics for data analysis | Understand the general idea on computation and data mining, and obtain basic knowledges for this course. |
Class 3 | Exercise: analysis of mushroom data | Understand the outline of data mining, and explain what is sample data set. |
Class 4 | Classification : simple rule and naiive basyesian rule | Understand the outline of classification, and explain simple rule and naiive baysesian rule derivations. |
Class 5 | Exercise: naiive bayesian rule | Understand simple classification rule generation mechanism and use the generator appropriately. |
Class 6 | Classification: decision tree | Explain the priciple idea of decision tree construction. |
Class 7 | Exercise: decsion rule | Undersdand the mechanism of decision tree construction algorithm and use the standard decsion tree constructor appropriately. |
Class 8 | Association rule | Explain a concept of association rule and its evaluation method. |
Class 9 | Exercise: association rule | Undersdand the mechanism of association rule generation and use the standard generator appropriately. |
Class 10 | Regression | Undersdand a concept of regression and various evaluation methods. |
Class 11 | Exercise: regression | Undersdand the principle idea of deriving regression rules and use the standard regression rule derivation tools appropriately. |
Class 12 | Clustering | Explain the idea of clustering and basic techniques for clustering. |
Class 13 | Exercise: clustering | Understand the mechanism of standard clustering techniques and use them for identifying clusters. |
Class 14 | Summary and advanced topics | Choose appropriate data mining methods and tools given in this course, and overview advanced topics. |
Class 15 | Final project | Choose an appropriate method for analyzing more practical (but still basic) data mining problems, and evaluate the obtained analysis. |
I.H. Witten, E. Frank, M.A. Hall, and C. Pal, Data Mining: Practical Machine Learning Tools and Techniques (The 4th Ed.),
Morgan Kaufmann Pub., 2013, ISBN-13: 978-0128042915
Weka Web Page, https://www.cs.waikato.ac.nz/ml/weka/
Assignments given at each exercise session (90%) and a final report project (10%).
Each student's own PC is used at each exercise session. We will help students to install necessary software environments right after the guidance of the first class held on June 11th (at Ookayama, only). To get this support, please get ready to connect your PC to the Tokyo tech wireless network and bring it to the class.
watanabe[at]c.titech.ac.jp (Osamu Watanabe)
Questions can be sent by email (at any time).
Those who will not be able to attend the first class (held on June 11th (Mon) from 9:00 at Ookayama campus) should contact the instructor (Osamu Watanabe, watanabe@c.titech.ac.jp) in advance.