2016 Dependable Computing

Font size  SML

Register update notification mail Add to favorite lecture list
Academic unit or major
Graduate major in Computer Science
Instructor(s)
Kaneko Haruhiko  Watanabe Takuo 
Class Format
Lecture     
Media-enhanced courses
Day/Period(Room No.)
Mon3-4(W834)  Thr3-4(W834)  
Group
-
Course number
CSC.T524
Credits
2
Academic year
2016
Offered quarter
3Q
Syllabus updated
2016/4/27
Lecture notes updated
-
Language used
English
Access Index

Course description and aims

Design methodologies of dependable computer systems are becoming more important because recent computer systems are highly complex, and thus designing dependable systems is becoming more challenging. In this course, students will learn basic concepts of dependable computing, such as fault avoidance, fault tolerance, static and dynamic masks of faults, and dependability calculation methods. As elemental technologies of dependable computer system construction, students will also learn circuit testing methods to detect faults in electronic circuit, error control codes to detect and correct errors in data, dependable techniques for distributed storage systems and networks, security, and cryptography.
The aim of this course is to provide a systematized knowledge of dependable techniques for computer systems, and give elemental techniques for improving system dependability. Based on these, students will acquire abilities to design and construct dependable computer systems.

Student learning outcomes

At the end of this course, students will be able to
1) Have an understanding of basic concepts of dependable computing, such as fault, error, and failure.
2) Have an understanding of reliability metrics, such as failure rate, reliability, and mean-time-to-failure, and acquire the ability to solve basic problems of reliability calculation.
3) Have an understanding of basic techniques for fault-tolerant systems, namely, static/dynamic mask and fail safe.
4) Have an understanding of fault-tolerant algorithms for distributed systems.
5) Have an understanding of error control coding techniques, such as bit error control codes, BCH code, Reed-Solomon code, low-density parity-check code, product code, and interleaving.
6) Have an understanding of design methods of dependable electronic circuits, such as electronic circuit test and design for testability.

Keywords

dependable system, fault tolerance, circuit testing, distributed algorithm, error control code

Competencies that will be developed

Specialist skills Intercultural skills Communication skills Critical thinking skills Practical and/or problem-solving skills

Class flow

Lecture handout will be uploaded to OCWi before each class.
Towards the end of class, students are given exercise problems related to what is taught on that day to solve.
Answer to the exercise problems are given at the beginning of next class.

Course schedule/Required learning

  Course schedule Required learning
Class 1 Fault, error, failure, dependability, and reliability Definitions of fault, error, and failure. Calculation methods of failure rate, reliability, and MTTF.
Class 2 Dependable design by redundancy Static mask, TMR, and nMR.
Class 3 System reconstruction and recovery techniques, and fail-safe techniques Dynamic mask, check-pointing, and fail -safe.
Class 4 Circuit testing: test generation (D-algorithm) D-algorithm (circuit testing)
Class 5 Circuit testing: test generation (PODEM) PODEM (circuit testing)
Class 6 Circuit testing: testability, fault simulation, design for testability Testability, fault simulation, scan design, built-in self test
Class 7 Error control codes: Galois field, linear space, minimum distance prime field, linear space, minimum distance, generator matrix, parity-check matrix
Class 8 Error control codes for data bus systems cyclic code, generator polynomial, parity-check polynomial
Class 9 Error control codes for SRAM/DRAM/Flash memories Hamming code, odd-weight-column SEC-DED code, BCH code
Class 10 Error control codes for optical discs Reed-Solomon code, product code, concatenated code
Class 11 Error control codes for magnetic disks and disk arrays Fire code, interleaving, low-density parity-check code, EVENODD code
Class 12 Dependability of distributed systems: Byzantine general problem Algorithm for Byzantine general problem
Class 13 Dependability of distributed systems: dependable storages Techniques for dependable storage
Class 14 Dependability of distributed systems: dependable networks Techniques for dependable network
Class 15 Dependability of distributed systems: security and cryptography Symmetric key cryptography, public key cryptography, RSA, AES

Textbook(s)

Non required
(Course materials are provided via OCWi)

Reference books, course materials, etc.

Fujiwara, Eiji. Code Design for Dependable Systems. Wiley-InterScience, ISBN: 978-0471756187.
Yoneda, Tomohiro et al., Dependable systems, Tokyo: Kyoritsu-Syuppan, ISBN: 978-4320121522. (Japanese)

Assessment criteria and methods

Students will be assessed on their understanding of fundamental theories of dependable computing, circuit testing methods, error control codings, and dependability methods for distributed systems.
Student's course score are based papers (assigned around 4 times).

Related courses

  • CSC.T433 : Advanced Computer Architecture
  • MCS.T333 : Information Theory
  • ZUS.M302 : Algebraic Systems and Coding Theory

Prerequisites (i.e., required knowledge, skills, courses, etc.)

No prerequisites are necessary, but enrollment in the related courses is desirable.

Page Top