2018 Introduction to Using Computation and Data

Font size  SML

Register update notification mail Add to favorite lecture list
Academic unit or major
School of Computing
Instructor(s)
Watanabe Osamu  Ono Isao  Murata Tsuyoshi 
Class Format
Lecture     
Media-enhanced courses
Day/Period(Room No.)
Mon1-2(W833, G114)  Thr1-2(W833, G114)  
Group
-
Course number
XCO.T479
Credits
2
Academic year
2018
Offered quarter
2Q
Syllabus updated
2018/3/20
Lecture notes updated
2018/7/30
Language used
English
Access Index

Course description and aims

In the current society it is essential in almost all fields to make use of computation and data appropriately. This course gives fundamental knowledge and basic skills for using computers to analyze big data and making use of it.
This course aims to (1) teach foundations of statistics and computation for processing and analyzing a given data set, and (2) help students to understand and apply various computer software tools for data analysis to get new findings.

Student learning outcomes

(1) Students will be able to apply basic statistical knowledge to analyzing data and evaluating the obtained results mathematically.
(2) Students will be able to understand the basis of data processing mechanisms and make use of various data mining software tools appropriately.

Keywords

data mininig, computation (data processing), knowledge representation, data mining software, statistical test

Competencies that will be developed

Specialist skills Intercultural skills Communication skills Critical thinking skills Practical and/or problem-solving skills

Class flow

In each week we give a lecture and an exercise session. All classes are given in both Ookayama and Suzukakedai campuses (except for the first one). Classes are given in Japanese (in Ookayama campus) and in English (in Suzukakedai campus). The first class (June 11th (Mon)) will be given only at Ookayama campus, where class guidance and guidance for installing computer software enviroments will be given.

Course schedule/Required learning

  Course schedule Required learning
Class 1 Class guidance and an instruction for installing computer softwares into your PC. Prepare software enviroments for exercises in each student's PC.
Class 2 Brief introduction to - computation and programming. Introduction to - data mining, and - basic statistics and mathematics for data analysis Understand the general idea on computation and data mining, and obtain basic knowledges for this course.
Class 3 Exercise: analysis of mushroom data Understand the outline of data mining, and explain what is sample data set.
Class 4 Classification : simple rule and naiive basyesian rule Understand the outline of classification, and explain simple rule and naiive baysesian rule derivations.
Class 5 Exercise: naiive bayesian rule Understand simple classification rule generation mechanism and use the generator appropriately.
Class 6 Classification: decision tree Explain the priciple idea of decision tree construction.
Class 7 Exercise: decsion rule Undersdand the mechanism of decision tree construction algorithm and use the standard decsion tree constructor appropriately.
Class 8 Association rule Explain a concept of association rule and its evaluation method.
Class 9 Exercise: association rule Undersdand the mechanism of association rule generation and use the standard generator appropriately.
Class 10 Regression Undersdand a concept of regression and various evaluation methods.
Class 11 Exercise: regression Undersdand the principle idea of deriving regression rules and use the standard regression rule derivation tools appropriately.
Class 12 Clustering Explain the idea of clustering and basic techniques for clustering.
Class 13 Exercise: clustering Understand the mechanism of standard clustering techniques and use them for identifying clusters.
Class 14 Summary and advanced topics Choose appropriate data mining methods and tools given in this course, and overview advanced topics.
Class 15 Final project Choose an appropriate method for analyzing more practical (but still basic) data mining problems, and evaluate the obtained analysis.

Textbook(s)

I.H. Witten, E. Frank, M.A. Hall, and C. Pal, Data Mining: Practical Machine Learning Tools and Techniques (The 4th Ed.),
Morgan Kaufmann Pub., 2013, ISBN-13: 978-0128042915

Reference books, course materials, etc.

Weka Web Page, https://www.cs.waikato.ac.nz/ml/weka/

Assessment criteria and methods

Assignments given at each exercise session (90%) and a final report project (10%).

Related courses

  • ART.T458 : Machine Learning
  • LAS.I121 : Computer Science I
  • LAS.I122 : Computer Science II

Prerequisites (i.e., required knowledge, skills, courses, etc.)

Each student's own PC is used at each exercise session. We will help students to install necessary software environments right after the guidance of the first class held on June 11th (at Ookayama, only). To get this support, please get ready to connect your PC to the Tokyo tech wireless network and bring it to the class.

Contact information (e-mail and phone)    Notice : Please replace from "[at]" to "@"(half-width character).

watanabe[at]c.titech.ac.jp (Osamu Watanabe)

Office hours

Questions can be sent by email (at any time).

Other

Those who will not be able to attend the first class (held on June 11th (Mon) from 9:00 at Ookayama campus) should contact the instructor (Osamu Watanabe, watanabe@c.titech.ac.jp) in advance.

Page Top