2023 | 3D Computer Vision - TOKYO TECH OCW

Home
> School of Computing
> Graduate major in Artificial Intelligence
> 3D Computer Vision

Undergraduate
Graduate

2023　3D Computer Vision

Font size S M L

Academic unit or major: Graduate major in Artificial Intelligence

Instructor(s): Kanezaki Asako

Class Format: Lecture (HyFlex)

Media-enhanced courses

Day/Period(Room No.): Tue3-4(S4-201(S421)) Fri3-4(S4-201(S421))

Group: -

Course number: ART.T466

Credits: 2

Academic year: 2023

Offered quarter: 4Q

Syllabus updated: 2023/3/20

Lecture notes updated: -

Language used: English

Access Index

Syllabus

Course description and aims

This course teaches how to process 3D data to extract higher-level information from the data. It covers the basics of data processing, geometric transformations, and linear algebra for machine learning, as well as recent cutting-edge research on deep neural networks, with hands-on exercises using programming languages such as Python.

Student learning outcomes

At the end of this course, students should be able to
1) acquire the basics of 3D data processing, and
2) implement their desired processing, including deep learning with, e.g., Python.

Keywords

3D Data Processing, Geometric Transformations, Object Recognition, Deep Learning, Neural Networks

Competencies that will be developed

✔ Specialist skills

Intercultural skills

Communication skills

Critical thinking skills

Practical and/or problem-solving skills

Class flow

The lecture will cover the basics, applications, and practices of 3D data processing using lecture materials (slides) and programming exercises.

Course schedule/Required learning

	Course schedule	Required learning
Class 1	Guidance, Introduction of 3D data processing	Understand 3D sensors and data acquisition methods.
Class 2	I/O and Rendering	Understand file input/output and data rendering methods.
Class 3	Rotation, Translation, Scale, and Sampling	Understand geometric transformations and sampling methods for 3D data.
Class 4	Normal estimation, Keypoints, and Features	Understand normal vector estimation methods, 3D keypoint detection methods, and 3D features.
Class 5	Point Cloud Registration (Nearest neighbor search, ICP)	Understand k-d tree data structures and nearest neighbor search, RANSAC, and ICP algorithms.
Class 6	Point Cloud Registration (Implementation)	Implement point cloud registration and apply it to 3D data.
Class 7	Object classification, Pose estimation	Learn about various object recognition tasks and understand the methods.
Class 8	Primitive detection, Segmentation	Learn about various object recognition tasks and understand the methods.
Class 9	Fundamentals of Deep Learning	Learn methods of image classification using deep learning
Class 10	PointNet, Point cloud convolution	Learn and implement point cloud processing methods using deep learning.
Class 11	Latest research trends, General object pose estimation	Learn about the latest research trends in 3D data processing using deep learning and understand general object pose estimation.
Class 12	RGBD, Voxel data, Mesh, Multi-view images, and Implicit functions	Learn about data formats other than 3D point clouds and understand the latest research trends using implicit functions.
Class 13	Student Presentation (1) - 2-3 min. talk for each student	Each student will give an oral presentation, ask questions, and discuss others' presentations.
Class 14	Student Presentation (2) - 2-3 min. talk for each student	Each student will give an oral presentation, ask questions, and discuss others' presentations.

Out-of-Class Study Time (Preparation and Review)

To improve the effectiveness of learning, students are expected to prepare for and review (including assignments) the contents of each class for approximately 60 minutes each, referring to the appropriate sections of the textbook and handouts.

Textbook(s)

None

Reference books, course materials, etc.

Materials translated into English from the above-mentioned Japanese reference book will be distributed.

Assessment criteria and methods

Evaluation of the student's understanding of the lecture content. Grades will be based on attendance and the final presentation.

Related courses

ART.T467 ： Fundamentals of Computer Vision
ART.T552 ： Advanced Topics in Computer Vision
ART.T551 ： Image and Video Recognition

Prerequisites (i.e., required knowledge, skills, courses, etc.)

There are no prerequisites for this course.

TOKYO INSTITUTE OF TECHNOLOGY