In the current society, it is essential in all fields to appropriately exploit "big data" for finding rules and/or making predictions/decisions. This course aims to help students to manipulate computer software tools for data analysis to get new findings.
Students will be able to understand the basis of data processing mechanisms and make use of various data analysis software tools appropriately.
classification, clustering, principal component analysis, dimension reduction, training/generalization errors, cross validation
✔ Specialist skills | Intercultural skills | Communication skills | Critical thinking skills | ✔ Practical and/or problem-solving skills |
In class, students are required to solve exercise problems that are linked with the contents of taught course ``XCO.T487 Fundamentals of data science".
Course schedule | Required learning | |
---|---|---|
Class 1 | Class guidance and introduction to Python programming | Variables, Control statements, Functions, etc. |
Class 2 | Descriptive and inferential statistics | Fundamental of data analysis such as descriptive and inferential statistics using pandas, a library of Python |
Class 3 | Classification | Do exersises on methods for extracting discrimination rules from labeled data |
Class 4 | Clustering | Do exersises on methods for categorizing unlabeled data into several categories |
Class 5 | Principal component analysis | Do exersises on principal component analysis with mathematical issues related to it |
Class 6 | Dimension reduction | Do exersises on methods for dimension reduction such as multidimensional scaling and canonical correlation analysis |
Class 7 | Advanced topics | Do exersises on methods for ensemble learning |
To enhance effective learning, students are encouraged to spend approximately 100 minutes preparing for class and another 100 minutes reviewing class content afterwards (including assignments) for each class.
They should do so by referring to textbooks and other course material.
Not specified
Distributed via OCW-i
Based on reports for given assignments.
When you apply this exercise, take "XCO.T487 Fundamentals of data science'' , "XCO.T489 Fundamentals of artificial Intelligence" and "T490 Exercises in fundamentals in artificial intelligence" of the same quarter of the same year in parallel. If there are many applicants, a lottery may be held. In the case of students of Tokyo Tech Academy for Convergence of Materials and Informatics, take “TCM.A404 Materials Informatics” instead of “XCO.T487 Fundamentals of data science” and “XCO.T488 Exercises in fundamentals of data science."
Students of the doctor course are required to register XCO.T678 "Exercises in fundamentals of advanced data science" instead of XCO.T488"Exercises in fundamentals of data science."
Exercises are carried out using Google Colaboratory. Students are required to get Google accounts and to get ready for using functions of "file upload/download" in Google Drive.