Embedded self-test for heterogeneous SoC

 

Yield and Cost Modeling of a Spare-Enhanced Multi-core Chip

Resilient System Design, Task 1.2.3.4.

Saeed Shamshiri and Kwang-Ting (Tim) Cheng

University of California, Santa Barbara

 

Abstract

It becomes increasingly difficult to achieve a high manufacturing yield for multi-core chips due to larger chip sizes, higher device densities, and greater failure rates. By adding a limited number of spare cores to replace defective cores either before shipment or in the field, the effective yield of the chip and its overall cost can be significantly improved. Moreover, permanently faulty wires of the network on chip can be replaced by spare wires.

 In this project, we provide a yield and cost analysis framework to better understand the dependency of a multi-core chip’s cost on key parameters such as the number of spare cores and wires, yield, shape parameter, scale parameter, and defect coverage of manufacturing and in-field testing for each component of the chip. We apply our analysis on an exemplary 9-core processor and on an Intel 80-core processor, and show that a spare scheme can significantly improve the reliability, reduce the cost, and substitute for the burn-in process. We also illustrate that, with in-field recovery capability, the reliance on high quality manufacturing testing is significantly reduced.

 

 

Publications

Two papers have been published on this project so far. In the first paper, we have provided an analytical model for the yield and cost of a multi-core system with spare cores. The analytical results of this work show that the yield and cost of a multi-core system improves significantly by integrating spare cores into the system.  Furthermore, the burn-in process can be eliminated and a cheaper manufacturing testing can be employed for more reduction in the cost of such spare-enhanced system. In the second paper, we consider communication in addition to computation for an NoC-based SoC with mesh architecture. We model the yield and cost of a system with spare cores and spare wires and show that the integrated redundancy in computation and communication blocks makes the system more reliable and reduces the overall cost.

1.        Saeed Shamshiri, Peter Lisherness, Sung-Jui Pan, and Kwang-Ting Cheng, “A cost analysis framework for multi-core systems with spares,” in proc. of IEEE International Test Conference (ITC), Oct. 2008. [doc][pdf] [presentation]

2.        Saeed Shamshiri and Kwang-Ting Cheng, “Yield and cost analysis of a reliable NoC,” to be appeared in VLSI Test Symposium (VTS), May 2009 [doc][pdf][presentation]

 

Posters

The first poster corresponds to the first paper and the second poster corresponds to the second paper.

1.        Saeed Shamshiri, Peter Lisherness, Sung-Jui Pan, and Kwang-Ting Cheng, “A cost analysis framework for multi-core systems with spares,” GSRC Annual symposium, Sep. 29-30, 2008. [ppt]

2.        Saeed Shamshiri and Kwang-Ting Cheng, “Yield and Cost Modeling of a Spare-Enhanced NoC,” GSRC Quarterly Workshop, March 9-10, 2009. [ppt]

 

 

Technical Report

A complete set of models and equations is available in this technical report:

1.        S. Shamshiri and K.-T. Cheng, “Yield and cost analysis for spare-enhanced Network-on-Chips,” UCSB Technical Report, 2008. [pdf]

 

Demonstration Software

A software has been developed for this project. The software covers the complete set of models and equations and a Monte Carlo simulation engine. Through this software one can set the input parameters of the model and investigate the trends of yield and cost of a system on chip with integrated spare cores and spare wires.

Demo program will be posted soon.

 

Last update: April 7, 2009