Recent Advances in Testing Techniques for AI Hardware Accelerators

Chaudhuri, Arjun; Chen, Ching-Yuan; Chakrabarty, Krishnendu

doi:10.1561/3500000011

Article navigation

Research Article| June 21 2023

Recent Advances in Testing Techniques for AI Hardware Accelerators

Arjun Chaudhuri;

Arjun Chaudhuri

Duke University

,

USA

Search for other works by this author on:

This Site

PubMed

Google Scholar

Ching-Yuan Chen;

Ching-Yuan Chen

Duke University

,

USA

Search for other works by this author on:

This Site

PubMed

Google Scholar

Krishnendu Chakrabarty

Arizona State University

,

USA

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Online ISSN: 2693-9355

Print ISSN: 2693-9347

2023

A. Chaudhuri et al.

Licensed re-use rights only

Foundations and Trends in Integrated Circuits and Systems (2023) 2 (4): 244–380.

https://doi.org/10.1561/3500000011

Emerging device technologies such as silicon photonics, nonvolatile memories, and heterogeneous monolithic 3D (M3D) integration are being explored as post-Moore’s law alternatives for achieving high-density integration of many-core AI accelerators. In addition to innovations at the device level, architectural optimizations are also being carried out to achieve high-performance processing of large AI workloads with custom accelerator hardware. Systolic array-based inferencing accelerators achieve higher throughput and improved energy efficiency compared to CPUs and GPUs because of the homogeneous and regular data flow in systolic arrays. However, the performance of such emerging AI accelerators can be adversely affected by faults due to process variations, manufacturing defects, and aging. In this monograph, we analyze the performance of several emerging AI accelerators in the presence of different uncertainties and present low-cost methods to assess the significance of faults and mitigate their effects. We show that across all technologies, the functional criticality of faults can vary significantly based on the fault type, fault location, and the application workload. The fault criticality assessment and mitigation techniques presented in this monograph are necessary for enabling low-cost test, diagnosis, and design of robust AI accelerators.

2023

A. Chaudhuri et al.

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Recent Advances in Testing Techniques for AI Hardware Accelerators

Email Alerts

Cited By

Recent Advances in Testing Techniques for AI Hardware Accelerators

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable