Power-Aware Testing and Test Strategies for Low Power Devices
Patrick Girard, Nicola Nicolici, Xiaoqing Wen

To cite this version:

HAL Id: lirmm-00820650
https://hal-lirmm.ccsd.cnrs.fr/lirmm-00820650
Submitted on 6 May 2013

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
Full-day tutorial on
Power-Aware Testing and Test
Strategies for Low Power Devices

Patrick GIRARD
LIRMM / CNRS
France

Nicola NICOLICI
McMaster University
Canada

Xiaoqing WEN
Kyushu Institute of Technology
Japan

Power-Aware Testing and Test
Strategies for Low Power Devices

Girard, Patrick; Nicolici, Nicola; Wen, Xiaoqing (Eds.)
390 p., 222 illus., Hardcover
November 2009

Available at the Springer booth !!
Motivation and Objectives

Power Consumption Trends

- Exponential growth in transistor density
  - More functionality
- But linear reduction in supply voltage
  - Not adequate to prevent power density to increase

(Tirimurti et al., DATE, 2004)

Motivation and Objectives

Power Consumption Trends

- The new power-performance paradigm:

(courtesy: M. Hirech, Synopsys)
Motivation and Objectives

Power Impact on Test

• Power constraints have severe impact on test

• Implications to test and DfT engineers / test tool developers:
  • Reduce power consumption also in test mode
  • Dedicated test strategies for low-power devices are needed

• Objectives of this tutorial:
  • Learning more about the impact of power during test
  • How to alleviate test power issues
  • How low-power devices and power management structures can be tested safely

Outline

1. Introduction  
2. Power Issues During Test  
3. Low-Power Test Pattern Generation  
4. Power-Aware Design-for-Test  
5. Power-Aware BIST and Test Data Compression  
6. Power-Aware System-Level Test Planning  
7. Low-Power Design Techniques and Test Implications  
8. Test Strategies for Multi-Voltage Designs  
9. Test Strategies for Gated Clock Designs  
10. Test of Power Management Structures  
11. EDA Solutions for Power-Aware Design-for-Test  
Conclusion
1. Introduction

Part 1
Chapter 1
Introduction

Power dissipation in the test mode is much higher than during functional operations.
1. Introduction

Many industrial experiences reported in the literature

[ITC 2003] : Texas Inst. & Siemens AG
ASIC (arithmetic) with Scan, 1M gates, 300kbits SRAM

Toggle activity under functional mode : 15%-20%
Toggle activity under test mode : 35%-40%

[ITC 2008] : Freescale
Power under test mode up to 3.8X power during functional mode

And many other experiences …

1. Introduction

Many reasons (i)

• Switching activity may be higher during testing
  ▪ Structural testing is predominant and tends to produce more toggling than functional patterns because:
    ✓ Conflict of objectives between test and functional mode
    ✓ No correlation between consecutive test vectors
    ✓ DFT (e.g. scan) circuitry is intensively used
  ▪ Test compaction leads to higher switching activity due to parallel fault activation and propagation
  ▪ Multiple cores in a SoC are tested in parallel to reduce test time
1. Introduction

Many reasons (ii)

- Typical power management schemes are disabled during testing
  - Clock gating is turned off to improve observability of internal nodes
  - Dynamic frequency scaling is turned off because:
    - The system clock is bypassed, or
    - The PLL suffers from a relocking time overhead during which no meaningful test can be conducted
  - Dynamic voltage scaling is usually avoided due to time constant in stabilizing supply voltage

1. Introduction

Power availability and quality may be limited during testing

Main reasons

- Long connectors from Tester Power Supply (TPS) to probe-card often result in higher inductance on the power delivery path. This may lead to voltage drop during test power cycling
- During wafer sort test, all power pins may not be connected to the TPS, resulting in reduced power availability
- Current limiters placed on TPS to prevent burn-out due to short-circuit current may affect power availability and quality
1. Introduction

Straightforward solutions for reducing test power

- Test with lower clock frequency
- Partitioning and appropriate test planning
- Over sizing packages and use of cooling equipments
- Over sizing power distribution network (PDN)
  - Grid Sizing based on functional power requirements
    - all parts not active at a time
  - Grid Sizing for test purpose too expensive!!

Costly or longer test time

~ Test in the Past ~
High Fault Coverage
Short Test Time
Small Test Data Volume
Low Test Development Efforts
Low Area Overhead

~ Test from Now ~

Low Test Power
(low average and peak power)

Make test power dissipation comparable to functional power
2. Power Issues during Test

Chapter 2
Power Issues during Test

2.1 Power and Energy Basics

- Static Leakage Power
  - Power consumed when the circuit is idle
  - The static power dissipation of a circuit is given by
    \[ P_{\text{stat}} = \sum_{i=1}^{n} I_{\text{stat}}^i \cdot V_{DD} \]

where \( I_{\text{stat}}^i \) is the current that flows between the supply rails in the absence of switching activity and \( i \) is the index of a gate in a circuit consisting of \( n \) gates.
2. Power Issues during Test

2.1 Power and Energy Basics

- **Static Leakage Power**
  - Due to six components of leakage current:
    - $I_1$: reverse-bias pn junction leakage current
    - $I_2$: sub-threshold leakage current
    - $I_3$: gate oxide tunneling current
    - $I_4$: gate current due to hot carrier injection
    - $I_5$: gate induced drain leakage current
    - $I_6$: channel punch-through current
  - Mainly due to sub-threshold leakage current (in cut-off region), which depends on channel doping concentration, length, $V_t$ and $T^o$
  - \[ P_{stat} = \sum V_{DD} \cdot I_{SUB} \text{ where } I_{SUB} \propto \frac{V_{DD}}{V_t} \]

- **Dynamic Switching Power**
  - Due to charge/discharge of load capacitance during switching:
  \[ P_d = C_L V_{DD}^2 f_{0 \rightarrow 1} \]
  where $f_{0 \rightarrow 1}$ is the number of output rising transitions per second
2. Power Issues during Test

2.1 Power and Energy Basics

• Dynamic Short-Circuit Power
  - Due to direct current path from $V_{DD}$ to Gnd during output switching

\[ P_{sc} = V_{DD} \sum I_{sc}(\tau) d\tau / (t_1 - t_0) \]

- $P_{sc} = \sum V_{DD} \cdot I_{sc}$ with $I_{sc} \propto \text{input_slew} / C_{load}$

2. Power Issues during Test

2.1 Power and Energy Basics

• Total Power Consumption

\[ P_{total} = P_{stat} + P_d + P_{sc} \]

- Switching power is still the main contributor, but leakage power may account for 25% in SoCs and 40% in high performance logic *

• Energy Dissipation
  - $T$ is a given time period

\[ E_{total} = \int_T P_{stat} d\tau + \int_T P_d d\tau + \int_T P_{sc} d\tau \]

- All power components are input state dependent!

* ITRS 2007
2. Power Issues during Test

2.2 Manufacturing Test Flow

• A typical manufacturing test flow

![Manufacturing Test Flow Diagram]

(courtesy: Bernardi et al., ETS, 2009)
2. Power Issues during Test

2.3 Power Delivery Issues during Test

• Main packaging technologies
  • Wire-bonded packages
  • Flip-chip packages

• During wafer sort test
  • All power pads may not be contacted
  • Power delivery constraint arising out of this limitation is shown on the graph
  • A larger inductance on power delivery path is a second problem ...

![Diagram of wafer sort test](image)

(Kundu et al., ITC, 2004)

2. Power Issues during Test

2.3 Power Delivery Issues during Test

• Low-frequency power droop
  • Associated with inductance of off-chip power supply lines ($L \frac{di}{dt}$)
  • Due to current variations (during switching) through inductive connections (ex.: connections from the tester to wafer metal pads)

![Diagram of current variations](image)

- Depends on the level and change rate of current
- Can erroneously change the logic state of some circuit nodes or flip-flops, and cause test fail, thus leading to yield loss

(Kundu et al., ITC, 2004)
2. Power Issues during Test

2.3 Power Delivery Issues during Test

• **Mid-frequency power droop**
  - Associated with inductance at the package level \((L\frac{di}{dt})\) event
  - During execution of instructions, if power demand shifts from one area of the die to a different area, one area will experience a drop in voltage while the other will experience an increase voltage
  - These droops affect an entire region
  - Addressed at the functional level by introducing multiple sensors

  ● MF power droop may lead to manufacturing yield loss if not properly addressed during test

• **High-frequency power droop**
  - Associated with the PDN on the die
  - Occurs when multiple cells drawing current from the same power grid segment suddenly increase their current demand. If the current cannot be provided quickly enough from other parts of the chip, power starvation results in a voltage drop
  - Handled in functional mode by adding a *frequency guard band*
  - A similar guard band is necessary during test mode

  ● May occur due to excess switching during at-speed scan test
  ● HF power droop may lead to manufacturing yield loss
2. Power Issues during Test

2.3 Power Delivery Issues during Test

- IR Drop refers to the amount of decrease (increase) in the power (ground) rail voltage and is linked to the existence of a non-negligible resistance between the rail and each node in the circuit under test.

\[ V_{dd} - R \times I \]

- With high peak current demands, gate voltages are reduced and gate delays increase.
- IR Drop may lead to manufacturing yield loss.

2.4 Thermal Issues during Test

- Heat produced during the operation of a circuit is proportional to the dissipated power (Joule effect) and is responsible for die temperature increase.

\[ T_{die} = T_{air} + \theta \times P_{Average} \]

where \( \theta \) is the package thermal impedance.

Elevated Average Power

- Temperature Increase
- Excessive Heat Dissipation
- Limits at-speed testing and concurrent testing due to thermal hot-spots.
2. Power Issues during Test

2.4 Thermal Issues during Test

- Thermal hot-spot results from localized overheating due to non-uniform spatial on-die power distribution
- Thermal hot-spot are likely to increase during package testing since test power dissipation can be high
- Main impact on the carrier’s mobility
- Slow down the device in the thermal hot-spot affected region of the chip
- Increase gate delays  → yield loss
- May cause permanent damage !
  (Hot-Carrier-Induced Defects, Dielectric Breakdown)
- Arrange test patterns is a possible solution but it increases L(di/dt) !!

\[
\mu(T) = \mu(T_0) \left( \frac{T}{T_0} \right)^\lambda
\]

2.5 Test Throughput Problem

- Test throughput is defined as number of devices tested per test equipment over a given period
- Mainly due to:
  - Reduction in test frequency during package test (to avoid thermal or power delivery issues) or wafer sort test (due to limited power availability) that induces longer test time
  - Simultaneous testing of multiple cores placed in a SoC (to avoid thermal issues) is limited - longer test time
### 2. Power Issues during Test

#### 2.6 Manufacturing Yield Loss

[Diagram showing the flow of impacts: High Instantaneous Current → Elevated Peak Power → Power Supply Noise (IR-Drop, Ldi/dt) → Significant Delay Increase due to Excessive PSN → Erroneous Behavior Only During Testing (test fail) → Manufacturing Yield Loss (Over-Kill)]

*As huge designs can be manufactured today, most power-related test issues (during scan) are due to excessive peak power.*

#### 2.7 Test Power Metrics and Estimation

<table>
<thead>
<tr>
<th>Metric</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Energy</td>
<td>Total switching activity generated during test application</td>
</tr>
<tr>
<td>$E_{\text{Total}} = \frac{1}{2} \cdot c_0 \cdot V_{DD}^2 \cdot \sum_k \sum_i s_i(k) \cdot F_i$</td>
<td></td>
</tr>
<tr>
<td>$s_i(k)$</td>
<td>Number of transitions provoked by $V_k$ at node $i$</td>
</tr>
<tr>
<td>Average Power</td>
<td>Ratio between energy dissipated during test and test time</td>
</tr>
<tr>
<td>$P_{\text{Average}} = \frac{E_{\text{Total}}}{\text{Length}_{\text{Test}} \cdot T}$</td>
<td></td>
</tr>
<tr>
<td>Impact</td>
<td>Impact the thermal load of the device</td>
</tr>
</tbody>
</table>
2. Power Issues during Test

2.7 Test Power Metrics and Estimation

• Instantaneous Power:
  • Power consumed at any given instant during test

• Peak Power:
  • Highest value of instantaneous power measured during test
  • \( P_{\text{Peak}} = \max_k P_{\text{inst}}(V_k) = \max_k \left( \frac{E_{vk}}{t_{\text{small}}} \right) \)
  • Determines the thermal and electrical limits of components and the system packaging requirements

• Weighted Switching Activity (WSA):
  • \( WSA_i = F_i \cdot S_i \) where \( F_i \) is the fanout at node \( i \) and \( S_i \) is the switching activity factor (ave. # of transitions during a time interval)
  • Often used as a metric for test power evaluation

2. Power Issues during Test

2.7 Test Power Metrics and Estimation

• Impact of the various metrics:
  • The switching activity factor (s,) affects \( E_{\text{Total}} \), \( P_{\text{average}} \) and \( P_{\text{peak}} \)
  • Clock frequency during test affects \( P_{\text{average}} \) and \( P_{\text{peak}} \)
  • Test length affects \( E_{\text{Total}} \)

• Static power dissipation:
  • For a given technology, depends only on input patterns and \( T^o \)
  • Static power dissipation is not a concern, except during during \( I_{\text{DDQ}} \) testing (reduced sensitivity) and during Burn-In test (can result in thermal runaway condition and yield loss)
2. Power Issues during Test

2.7 Test Power Metrics and Estimation

- Test power estimation:
  - Required for sign-off to avoid destructive testing and power-aware test space exploration during DfT or ATPG
  - Conventional flow adopted to perform test power estimation is simulation-based and performed at gate level
  - Estimation is performed at various PVT corners
  - Challenges for multi-million gate SoCs
    - Time-consuming
    - Memory sizes can be very large
  - The weighted transition metric is quick but approximate

\[
\text{WeightedTransitions} = \sum \left( \text{ScanChainLength} - \text{TransitionPositionInVector} \right)
\]

3. Low-Power Test Pattern Generation

Part 2
Chapter 3
Low-Power Test Pattern Generation
3.1 Low-Power ATPG

**Content**

1. General Low-Power Test Generation
2. Low-Shift-Power Scan Test Generation
3. Low-Capture-Power Scan Test Generation
   - Capture-Safety Checking
   - LCP ATPG Technique: Reversible Backtracking

---

### 3.1 Low-Power ATPG

**General Low-Power Test Generation**

- **Goal**: To create a sequence of test vectors that cause a minimal number of transitions at inputs between any two consecutive cycles.
- **Method**: Modify PODEM with new cost functions.


For each line \( l \) in a circuit and each logic value \( s \), calculate:

- **Transition Controllability** \( TC_s(l) \) to reflect the minimum numbers of weighted transitions required to set \( l \) to \( s \).
  - Use TC to guide backtrace

- **Transition Observability** \( TO_s(l) \) to reflect the minimum numbers of weighted transitions required to propagate a fault effect (the corresponding fault-free value of which is \( s \)) from \( l \) to a primary output.
  - Use TO to guide D-frontier selection

- **Transition Test Generation Cost** \( TC_s(l) \) to reflect the transition level required to detect the stuck-at-\( \overline{s} \) fault at \( l \).
  - Use TC to guide target fault selection
3.1 Low-Power ATPG

Low-Shift-Power Scan Test Generation

Assign logic values to block the impact of FF transitions.

- Use modified PODEM to maximize the # of Xs.
- Assign logic values to minimize FF transitions.


3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

Capture Operation in At-Speed Scan Testing

- LOS (Launch-on-Shift): easy ATPG (high coverage) but hard physical design
- LOC (Launch-on-Capture): easy physical design but hard ATPG (low coverage)
- LSA (Launch Switching Activity): affect timing / a more serious problem
- CSA (Capture Switching Activity): may cause FF malfunction / but may be rare
3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

**LOS-Type LSA Reduction in At-Speed Scan Testing**

- LOS-type LSA for \(<v; PPI> = <b_1 b_2 \ldots b_{n-1} b_n>\) is dominated by \(\Sigma (b_{i-1} \oplus b_i)\) for \(i = 2, \ldots, n\).
- LOS-type LSA can be readily reduced by minimizing the difference between neighbouring bits in \(<v; PPI>\).
- 0-fill, 1-fill, etc. can be used for reducing LOS-type LSA.

**LOC-Type LSA Reduction in At-Speed Scan Testing**

- LOC-type LSA is dominated by \(\Sigma (v; PPI) \oplus (F(v); PPO)\).
- Test responses must be considered in order to reduce LOC-type LSA.
  - **LOC-type LSA is hard to reduce than LOS-type LSA**
3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

Overview of Capture-Safety Checking

- Not all test vectors generated by conventional ATPG, which does not try to minimize test power, suffer from excessive LSA.
- Capture-safety checking metrics are needed to judge if a test vector will cause any problem in capture mode due to excessive LSA.

<table>
<thead>
<tr>
<th>Non-Simulation-Based Metric</th>
<th>FLIS</th>
</tr>
</thead>
<tbody>
<tr>
<td>Simulation-Based Metrics</td>
<td>Toggle</td>
</tr>
<tr>
<td></td>
<td>WSA</td>
</tr>
<tr>
<td>Region-Based Metrics</td>
<td>Toggle-Based</td>
</tr>
<tr>
<td></td>
<td>WSA-Based</td>
</tr>
<tr>
<td>Timing-Based Metrics</td>
<td>SCAP</td>
</tr>
<tr>
<td></td>
<td>CCT</td>
</tr>
</tbody>
</table>

3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

Simulation-Based Capture-Safety Checking Metrics

**Toggle Count**

- Calculated as the number of transitions in the whole circuit or a region.
- Logic simulation can be conducted with a zero-delay, a unit-delay, or even a timing model, resulting in trade-off between accuracy and CPU time.

**WSA (Weighted Switching Activity)**

- Calculated as the number of weighted transitions in the whole circuit or a region.
- Ideally, the weight of a node should be set to reflect its output capacitance.
- In practice, the weight of a node is often set as the number of its fanouts or the number of its fanouts plus 1.

3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

Region-Based Capture-Safety Checking Metrics

- The impact of switching activity is local and depends on power grid design.
- A circuit is divided into regions, bounded by power straps or with intersections of power straps as midpoints.

(J. Lee et al., Proc. DATE, pp. 1172-1177, 2008)

3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

Region-Based Capture-Safety Checking Metrics

**Toggle-Based Metric**

- **GTC (Global Toggle Count)**
  - transitions in all regions throughout the test cycle

- **GITC (Global Instantaneous Toggle Count)**
  - transitions at any time instant in all regions

- **RITC (Regional Instantaneous Toggle Count)**
  - transitions at any time instant in a specific region

**WSA-Based Metric**

\[
WSA_i(v) = \sum_{k \in R(i,j)} (g_k \times (f_k \times f_{\neg k}))
\]

\(g_k\) is the weight of gate \(k\) in \(R(i,j)\), \(f_k\) is the number of its fanouts, \(f_{\neg k}\) is its fanout load weight, and \(f_k\) is 1 (0) if a transition occurs (does not occur) at gate \(k\).

(J. Lee et al., Proc. DATE, pp. 1172-1177, 2008)
3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

**LCP ATPG Technique: Reversible Backtracking**

[Basic Concept]

**D-Conflict**
Current input assignments are found to be unable to detect the target fault.

Conventional Backtrack

**Ci-Conflict**
Current input assignments cause different logic values between PPI and PPO.

Reversible Backtrack
(Backtrack decision is reversed if it cause a fault to become undetected.)

\[(X. Wen et al., Proc. VTS, pp. 58-63, 2006)\]

3.1 Low-Power ATPG

Low-Capture-Power Scan Test Generation

**LCP ATPG Technique: Reversible Backtracking**

[Collecting Restoration Information]

**C1-Conflict**

| a: 0 |
| b: 1 |
| c: 0 → c: 1 |
| d: 0 → d: 1 |

**D-conflict**

\[(X. Wen et al., Proc. VTS, pp. 58-63, 2006)\]
3.2 Low-Power Test Compaction

**Content**

- Low-Shift-Power Static Dynamic Compaction
- Low-Capture Static Compaction

- Transitions at different bit positions in a test cube have different degrees of impact on shift-in power.

\[
\text{Weighted Transitions} = \sum (\text{Scan Chain Length} - \text{Transition Position})
\]

3.2 Low-Power Test Compaction

Low-Power Static Compaction

Low-Capture-Power Static Compaction

- The goal is to evenly distribute the LSA of a test vector across the entire chip rather than allowing high LSA to occur in a small area.
- The chip is divided into regions, and the extended WSA metric is used to calculate the LSA profile for each region.

\[
V_1 = \langle 1 \ 0 \ X \ X \ 0 \rangle \\
V_m = \langle 1 \ 0 \ X \ X \ 0 \rangle \\
V_2 = \langle X \ 1 \ X \ 0 \rangle
\]

\[\text{Compatible}\]

\[\text{Capture-Safe}\]

- Test cubes v1 and v2 are compatible. Since there is no overlapping of risky regions of their LSA profiles, the resulting merged test cube vm is capture-safe.

(J. Lee et. al., Proc. VTS, pp. 227-232, 2008)

3.3 Low-Power X-Filling

Outline

<table>
<thead>
<tr>
<th>Low-Shift-Power X-Filling</th>
<th>Low-Capture-Power X-Filling</th>
<th>Low-Shift-and-Capture-Power X-Filling</th>
</tr>
</thead>
<tbody>
<tr>
<td>Shift-In Power Reduction</td>
<td>Shift-Out Power Reduction</td>
<td>Total Shift Power Reduction</td>
</tr>
<tr>
<td>0-fill</td>
<td>output-justification-based X-filling</td>
<td>MTR-fill</td>
</tr>
<tr>
<td>1-fill</td>
<td></td>
<td></td>
</tr>
<tr>
<td>MT-fill</td>
<td></td>
<td></td>
</tr>
<tr>
<td>adjacent fill / repeat fill</td>
<td></td>
<td></td>
</tr>
<tr>
<td>FF-Oriented</td>
<td>Node-Oriented</td>
<td>Critical-Area-Oriented</td>
</tr>
<tr>
<td>PMF-fill</td>
<td>PWT-fill</td>
<td>CCT-fill</td>
</tr>
<tr>
<td>LCP-fill</td>
<td>state-sensitive X-filling</td>
<td></td>
</tr>
<tr>
<td>preferred fill</td>
<td></td>
<td></td>
</tr>
<tr>
<td>JP-fill</td>
<td></td>
<td></td>
</tr>
<tr>
<td>CTX-fill</td>
<td></td>
<td></td>
</tr>
<tr>
<td>Low-Power X-Filling for Compressed Scan Testing</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0-fill</td>
<td>PHS-fill</td>
<td>CJP-fill</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>
3.3 Low-Power X-Filling

Content

1. Test Cube Preparation
   - Direct Generation
   - Test Relaxation

2. Low-Shift-Power X-Filling
   - Shift-In Power Reduction
   - Shift-Out Power Reduction
   - Total Shift Power Reduction

3. Low-Capture-Power X-Filling
   - FF-Oriented X-Filling

4. Low-Shift-and-Capture-Power X-Filling
   - Bounded Adjacent Fill

5. Low-Power X-Filling for Compressed Scan Testing
   - X-Filling in Broadcast-Based Test Compression
   - X-Filling for Linear Decompressor-Based Test Compression

Test Cube Generation

Direct Generation

Test Cube Generation for Primary Fault Detection

Dynamic Compaction

Fully-Specified Test Cube

Random-Fill

Partially-Specified Test Cube

- Random-fill increases the chances of accidental detection, resulting in smaller test vector count.
- Disabling random-fill may increase test vector count.

\[ For 7 \text{ industrial circuits (229K ~ 2.01M), test vector count increased by 144.8\% on average when random-fill was not conducted} \]

3.3 Low-Power X-Filling

Test Cube Generation

Test Relaxation
[Concept]

foreach test vector \(v_i\) in \(T\) {
    \(EF = \text{find\_essential\_fault}(v_i)\); /* Step-1 */
    \(\text{initial}_c = \text{create\_test\_cube}(EF)\); /* Step-2 */
    \(3\_\text{valued\_fault\_simulation}(\text{initial}_c)\); /* Step-3 */
}

foreach test vector \(t_i\) in \(T\) {
    \(UF = \text{find\_undetected\_fault}(v_i, c_i)\); /* Step-4 */
    \(\text{final}_c = \text{adjust\_test\_cube}(UF)\); /* Step-5 */
    \(3\_\text{valued\_fault\_simulation}(\text{final}_c)\); /* Step-6 */
}

"Property" is usually the fault coverage for various fault models.
Fault simulation, justification, and implication are used to find X-bits.
60%-80% of bits in a compact test set are usually identified as X-bits.

3.3 Low-Power X-Filling

Low-Shift-Power X-Filling

*Shift-In Power Reduction*

- 0-Fill: Fill all X-bits with 0.
- 1-Fill: Fill all X-bits with 1.
- MT-Fill: If the specified bits on both sides of an X-string have the same logic value, the X-string is filled with that logic value; if the specified bits on the two sides of an X-string have opposite logic values, the X-string is filled with an arbitrary logic value.
- Adjacent-Fill: Fill any X-bit with the nearest care-bit from the input side.

0-Fill: \[\begin{array}{cccc}
0 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 \\
0 & 1 & 0 & 0 \\
0 & 1 & 1 & 0
\end{array}\]

1-Fill: \[\begin{array}{cccc}
1 & 1 & 1 & 1 \\
1 & 1 & 1 & 1 \\
1 & 1 & 1 & 1 \\
1 & 1 & 1 & 1
\end{array}\]

MT-Fill: \[\begin{array}{cccc}
0 & 0 & 0 & 0 \\
1 & 1 & 1 & 1 \\
0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0
\end{array}\]

Adjacent-Fill: \[\begin{array}{cccc}
0 & 0 & 0 & 0 \\
1 & 1 & 1 & 1 \\
1 & 1 & 1 & 1 \\
0 & 0 & 0 & 0
\end{array}\]

3.3 Low-Power X-Filling

Low-Shift-Power X-Filling

*Shift-Out Power Reduction*

- 0-controllability = 0 / 1-controllability = \(\infty\)
- 1-controllability = \(\infty\) / 1-controllability = 0
- \(X\): 0-controllability = 1 / 1-controllability = 10 (original value = 0)
- \(X\): 0-controllability = 10 / 1-controllability = 1 (original value = 1)

Test Vector v

Test Cube c

- Create a test cube from a test vector by bit-stripping.
- Use justification to change values for some bits in the test response so as to reduce shift-out power.

3.3 Low-Power X-Filling

**Low-Shift-Power X-Filling**

*Total Shift Power Reduction*

- Use a metric to estimate the total shift (shift-in and shift-output) switching activity.
- TWTM (*Total Weighted Transition Metric*)

\[
TWTM(t_i, r_i) = \sum_{j=1}^{l-1} (t_{i,j} \oplus t_{i,j+1}) \times j + \sum_{j=1}^{l-1} (r_{i,j} \oplus r_{i,j+1}) \times j
\]

- MTR-fill (*Minimum Transition Random X-filling*)
  - First, fill a test cube \( c_i \) with \( n \) X-bits by MT-fill to create an solution test vector \( t_i \).
  - Then, flip the \( n \) X-bit positions in \( c_i \), one by one, to see if it results in a test vector of a smaller TWTM. If so, replace \( t_i \) by the test vector.


3.3 Low-Power X-Filling

**Low-Capture-Power X-Filling**

*FF-Oriented X-Filling*  
[Basics]

- The basic idea is FF-silencing, i.e., trying to equalize the input and output of an FF.

- There are four types of PPI-PPO bit pairs.

<table>
<thead>
<tr>
<th>Bit ( p ) in ( &lt;v, PPI&gt; )</th>
<th>Bit ( q ) in ( &lt;f(v): PPO&gt; )</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 or 1</td>
<td>0 or 1</td>
</tr>
<tr>
<td>X</td>
<td>Type-A</td>
</tr>
<tr>
<td>X</td>
<td>Type-B</td>
</tr>
<tr>
<td></td>
<td>Type-C</td>
</tr>
<tr>
<td></td>
<td>Type-D</td>
</tr>
</tbody>
</table>

- Different FF-oriented X-filling methods handle Type-B, Type-C, and Type-D pairs differently.

3.3 Low-Power X-Filling

**Low-Capture-Power X-Filling**

*FF-Oriented X-Filling*

[Random FF-Silencing]

- PMF-Fill (Progressive Match Filling):
  - Use assignment for Type-B bit-pairs.
  - Use justification for Type-C bit-pairs.
  - Randomly select $n$ Type-D bit-pairs, and randomly assign logic values to corresponding PPI X-bits.

- A smaller $n$ leads to a more effective reduction but at the cost of a longer execution time.

(W. Li et. al., Proc. IEEE Annual Symp. on VLSI, pp. 151-161, 2005)

3.3 Low-Power X-Filling

**Low-Capture-Power X-Filling**

*FF-Oriented X-Filling*

[Justification-Based FF-Silencing]

- LCP-Fill (Low-Capture Power X-Filling):
  - Highly effective due to the use of deterministic ATPG techniques.
  - Relatively long CPU time due to multi-pass processing.

3.3 Low-Power X-Filling

Low-Capture-Power X-Filling

FF-Oriented X-Filling

[Probability-Based FF-Silencing]

- Preferred Fill:

  - (0-Prob, 1-Prob) → Preferred Values
  - (1.00, 0.00) → (0.73, 0.27)
  - (0.50, 0.50) → (0.49, 0.51)
  - (0.50, 0.50) → (0.93, 0.07)
  - (0.00, 1.00) → (0.50, 0.50)

  - Relatively short CPU time due to single-pass processing.
  - Effectiveness depends on the accuracy of probability calculation.


3.3 Low-Power X-Filling

Low-Capture-Power X-Filling

FF-Oriented X-Filling

[Justification-Probability-Based FF-Silencing]

- JP-Fill (Justification-Probability-Based X-Filling):

  - Use justification and multiple passes to improve its effectiveness.
  - Use probability-based parallel logic value determination to improve its scalability.

3.3 Low-Power X-Filling

Low-Shift and-Capture-Power X-Filling

Bounded Adjacent Fill

- The basic idea is to first set several X-bits in a test cube to 0 and then conduct adjacent fill.
- The occurrence of 0 in the resulting fully-specified test vector is increased, which helps reduce shift-out and capture power.
- At the same time, applying adjacent fill helps reduce shift-in power.


3.3 Low-Power X-Filling

Low-Power X-Filling for Compressed Scan Testing

X-Filling in Broadcast-Based Test Compression

[K. Miyase et. al., Proc. ICCAD, 2009]
3.3 Low-Power X-Filling

Low-Power X-Filling for Compressed Scan Testing

X-Filling in Broadcast-Based Test Compression

Extended Circuit Model

Any low-capture-power X-filling method can be applied.

(K. Miyase et. al., Proc. ICCAD, 2009)

3.3 Low-Power X-Filling

Low-Power X-Filling for Compressed Scan Testing

X-Filling for Linear Decompressor-Based Test Compression

- CJP-Fill (Compressible JP-Fill):

- X-classification is to separate implied X-bits (which must have certain logic values) from free X-bits (which can have any logic values, provided that they are filled one at a time).

- Compatible free bit set (CFBS) identification is to find a set of free X-bits that can be filled with any logic values simultaneously without affecting compressibility.

3.4 Low-Power Test Ordering

Content

1. Internal-Transition-Based Ordering
2. Inter-Vector-Hamming-Distance-Based Ordering

Internal-Transition-Based Ordering

- Guide test vector ordering by transitions at internal nodes (FFs and/or gates).
- A complete graph, called transition graph (TG), is used to represent transitions.

- A optimal test vector order is obtained by finding a node sequence with the smallest edge-weight sum.
  - \( \text{NP-complete traveling salesman problem} \)
- A good test vector order is obtained by finding a Hamiltonian path with the smallest edge-weight sum.
  - \( \text{Time complexity: } O(n \log n) \)

3.4 Low-Power Test Ordering

Inter-Vector-Hamming-Distance-Based Ordering

- Building a TG needs $n \times (n - 1)$ logic simulations for a circuit of $n$ nodes, which can be prohibitively time-consuming for a large circuit.
- Using the Hamming distance between $V_i$ and $V_j$ as the weight of the edge from $V_i$ to $V_j$ will greatly reduce the CPU time needed for building a TG.
- There is a good correlation between Hamming distance and circuit activity.


3.5 Low-Power Memory Test Generation

Content

1. Address Switching Activity Reduction

2. Precharge Restriction
3.5 Low-Power Memory Test Generation

Address Switching Activity Reduction

<table>
<thead>
<tr>
<th>Zero-One</th>
<th>Original Test Algorithm</th>
<th>Low-Power Test Algorithm</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>( (W_0); \uparrow (R_0); \uparrow (W_1); \uparrow (R_1); )</td>
<td>( \uparrow_z (W_0, R_0, W_1, R_1); )</td>
</tr>
</tbody>
</table>

| Checker Board    | \( (W(1_{odd}/0_{even})); \uparrow (R(1_{odd}/0_{even})); \) | \( \uparrow_z (W(1_{odd}/0_{even}), R(1_{odd}/0_{even}), \)  \(-W(0_{odd}/1_{even}), R(0_{odd}/1_{even}); \) |

- Reorder test patterns to minimize switching activity on address lines without any fault coverage loss.
- Original test algorithm’s addressing order: \( \uparrow \) (single bit change)
- Low-power test algorithm’s addressing order: \( \uparrow_z \) (single bit change)

Example: \( 00 \rightarrow 01 \rightarrow 11 \rightarrow 10 \)


3.5 Low-Power Memory Test Generation

Precharge Restriction

- Precharge circuitry in SRAM precharges and equalizes long and high capacitive bit lines for correct memory operations, and may take 70% of power dissipation.
- The test addressing sequence is fixed, and there is no need to precharge all columns.
- Add to precharge control logic (MUX + NAND) to precharge only the current column and the next column to be accessed.

(P. Dilillo et. al., Proc. DATE, pp. 1159-1165, 2006)
4. Power-Aware Design-for-Test

Chapter 4
Power-Aware Design-for-Test

4.1 Overview of Power-Aware DFT

• Low-power scan cells
• Scan path organization
• Partitioning for low power
4. Power-Aware Design-for-Test

4.2 Low-power scan cells / clock gating

- Level sensitive scan design (LSSD) cell
  - Non-overlapping clocks
  - Robustness against shift-power violations

(Stojanovic and Oklobdzija., JSSCC, pp. 536-548, 1999.)

4. Power-Aware Design-for-Test

4.2 Low-power scan cells / clock gating

Pseudo - Random Test Sequence

- Useless patterns do not detect any faults
  - Pattern suppression
  - Deactivate scan clocks when no fault coverage improvement

(P. Girard et al., Proc. VTS, pp. 407-412, 1999.)
4. Power-Aware Design-for-Test

4.2 Low-power scan cells / clock gating

- Decoder used for pattern suppression
- Fault simulation
- Logic minimization

(S. Gerstendorfer and H.-J. Wunderlich, Proc. ITC, pp. 77-84, 1999)

### Implementing the pattern decoder

(S. Gerstendorfer and H.-J. Wunderlich, Proc. ITC, pp. 77-84, 1999)
4. Power-Aware Design-for-Test

4.2 Low-power scan cells / clock gating

- DFT with parallel scan chains
- Clock gating per chain (for better granularity)
- Test hold register and local clock buffers

(courtesy: H.-J. Wunderlich and C. Zoellin, Univ Stuttgart)

4.2 Low-power scan cells / clock gating

- Partial scan
  - Disable clocks on non-scan flip-flops or latches

(C. Zoellin et al., Proc. ITC, paper 32.3, 2006)
4. Power-Aware Design-for-Test

4.2 Low-power scan cells / clock gating

Different scan chains can be activated based on multiple sensitizations of the same fault

(C. Zoellin et al., Proc. ITC, paper 32.3, 2006)

- Toggle suppression during shift
- Master-slave structure of a mux-D flip-flop is modified
- Gate the data output during shift

4. Power-Aware Design-for-Test

4.2 Low-power scan cells / clock gating

A different implementation of toggle suppression during shift using a latch at the output of the mux-D flip-flop with scan


4.3 Scan path organization

Only one scan segment is activated at a time

4. Power-Aware Design-for-Test

4.3 Scan path organization

- Scan segmentation
- Partitioning scan cell in multiple scan chains such that only one of them is activated at a time
- Applicable also launch-off-shift and launch-off-capture

(courtesy: H.-J. Wunderlich and C. Zoellin, Univ Stuttgart)
4. Power-Aware Design-for-Test

4.3 Scan path organization

• Scan cell clustering using hyper-graph partitioning

(courtesy: H.-J. Wunderlich and C. Zoellin, Univ Stuttgart)

---

4. Power-Aware Design-for-Test

4.3 Scan path organization

| Test Pattern | 0 | 1 | 0 | 0 | 0 |
| Test Response | 1 | 0 | 1 | 0 |

• Scan cell order and its influence on the number of transitions

(courtesy: H.-J. Wunderlich and C. Zoellin, Univ Stuttgart)
4. Power-Aware Design-for-Test

4.3 Scan path organization

Using vectors and responses to compute edge weights before tackling scan cell ordering through the travelling salesman problem

(courtesy: H.-J. Wunderlich and C. Zoellin, Univ Stuttgart)

4. Power-Aware Design-for-Test

4.3 Scan path organization

Test Cube   $0 \times 1 \times 0$

Filled Pattern

Response captured in scan cells

Shifted In

Observed at Scan Out

Inserting logic into scan chains to modify the transition count during shift

(courtesy: H.-J. Wunderlich and C. Zoellin, Univ Stuttgart)
4. Power-Aware Design-for-Test

4.3 Scan path organization

Scan segment inversion by embedding a linear function in the scan path

(courtesy: H.-J. Wunderlich and C. Zoellin, Univ Stuttgart)

4.4 Partitioning for low-power

Isolating multiple cores of logic using boundary registers

5. Power-Aware BIST and Test Data Compression

Chapter 5
Power-Aware BIST and Test Data Compression

5.1 Overview of power-aware BIST and compression

- Coding-based compression methods
- LFSR-decompression-based compression methods
- Low-power BIST techniques
5. Power-Aware BIST and Test Data Compression

### 5.2 Coding-based compression

<table>
<thead>
<tr>
<th>Group</th>
<th>Run-Length</th>
<th>Group Prefix</th>
<th>Tail</th>
<th>Codeword</th>
</tr>
</thead>
<tbody>
<tr>
<td>A₁</td>
<td></td>
<td></td>
<td>00</td>
<td>000</td>
</tr>
<tr>
<td></td>
<td>0</td>
<td></td>
<td>01</td>
<td>001</td>
</tr>
<tr>
<td></td>
<td>1</td>
<td></td>
<td>01</td>
<td>010</td>
</tr>
<tr>
<td></td>
<td>2</td>
<td></td>
<td>11</td>
<td>011</td>
</tr>
<tr>
<td></td>
<td>3</td>
<td></td>
<td>00</td>
<td>1000</td>
</tr>
<tr>
<td>A₂</td>
<td>4</td>
<td>10</td>
<td>00</td>
<td>1001</td>
</tr>
<tr>
<td></td>
<td>5</td>
<td></td>
<td>10</td>
<td>1010</td>
</tr>
<tr>
<td></td>
<td>6</td>
<td></td>
<td>11</td>
<td>1011</td>
</tr>
<tr>
<td>A₃</td>
<td>8</td>
<td>110</td>
<td>00</td>
<td>11000</td>
</tr>
<tr>
<td></td>
<td>9</td>
<td></td>
<td>01</td>
<td>11001</td>
</tr>
<tr>
<td></td>
<td>10</td>
<td></td>
<td>10</td>
<td>11010</td>
</tr>
<tr>
<td></td>
<td>11</td>
<td></td>
<td>11</td>
<td>11011</td>
</tr>
</tbody>
</table>

Golomb code

(courtesy: K. Chakrabarty and S.K. Goel, Duke Univ)

---

5. Power-Aware BIST and Test Data Compression

### 5.2 Coding-based compression

<table>
<thead>
<tr>
<th>Group</th>
<th>Run-Length</th>
<th>Group Prefix</th>
<th>Tail</th>
<th>Codeword</th>
</tr>
</thead>
<tbody>
<tr>
<td>A₁</td>
<td>0</td>
<td>0</td>
<td>00</td>
<td>00</td>
</tr>
<tr>
<td></td>
<td>1</td>
<td></td>
<td>01</td>
<td>01</td>
</tr>
<tr>
<td>A₂</td>
<td>2</td>
<td>10</td>
<td>00</td>
<td>1000</td>
</tr>
<tr>
<td></td>
<td>3</td>
<td></td>
<td>01</td>
<td>1001</td>
</tr>
<tr>
<td></td>
<td>4</td>
<td></td>
<td>10</td>
<td>1010</td>
</tr>
<tr>
<td></td>
<td>5</td>
<td></td>
<td>11</td>
<td>1011</td>
</tr>
<tr>
<td>A₃</td>
<td>6</td>
<td>110</td>
<td>000</td>
<td>110000</td>
</tr>
<tr>
<td></td>
<td>7</td>
<td></td>
<td>001</td>
<td>110001</td>
</tr>
<tr>
<td></td>
<td>8</td>
<td></td>
<td>010</td>
<td>110010</td>
</tr>
<tr>
<td></td>
<td>9</td>
<td></td>
<td>011</td>
<td>110011</td>
</tr>
<tr>
<td></td>
<td>10</td>
<td></td>
<td>100</td>
<td>110100</td>
</tr>
<tr>
<td></td>
<td>11</td>
<td></td>
<td>101</td>
<td>110101</td>
</tr>
<tr>
<td></td>
<td>12</td>
<td></td>
<td>110</td>
<td>110110</td>
</tr>
<tr>
<td></td>
<td>13</td>
<td></td>
<td>111</td>
<td>110111</td>
</tr>
</tbody>
</table>

Frequency-directed run-length (FDR) code

(courtesy: K. Chakrabarty and S.K. Goel, Duke Univ)
5. Power-Aware BIST and Test Data Compression

5.2 Coding-based compression

<table>
<thead>
<tr>
<th>Group</th>
<th>a =0</th>
<th>a =1</th>
<th>Group Prefix</th>
<th>Tail</th>
<th>Codeword</th>
</tr>
</thead>
<tbody>
<tr>
<td>A_1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>00</td>
</tr>
<tr>
<td></td>
<td>1</td>
<td>1</td>
<td></td>
<td>1</td>
<td>01</td>
</tr>
<tr>
<td>A_2</td>
<td>2</td>
<td>2</td>
<td>10</td>
<td>00</td>
<td>1000</td>
</tr>
<tr>
<td></td>
<td>3</td>
<td>3</td>
<td></td>
<td>01</td>
<td>1001</td>
</tr>
<tr>
<td></td>
<td>4</td>
<td>4</td>
<td></td>
<td>10</td>
<td>1010</td>
</tr>
<tr>
<td></td>
<td>5</td>
<td>5</td>
<td></td>
<td>11</td>
<td>1011</td>
</tr>
<tr>
<td>A_3</td>
<td>6</td>
<td>6</td>
<td>110</td>
<td>000</td>
<td>110000</td>
</tr>
<tr>
<td></td>
<td>7</td>
<td>7</td>
<td></td>
<td>001</td>
<td>110001</td>
</tr>
<tr>
<td></td>
<td>8</td>
<td>8</td>
<td></td>
<td>010</td>
<td>110010</td>
</tr>
<tr>
<td></td>
<td>9</td>
<td>9</td>
<td></td>
<td>011</td>
<td>110011</td>
</tr>
<tr>
<td></td>
<td>10</td>
<td>10</td>
<td></td>
<td>100</td>
<td>110100</td>
</tr>
<tr>
<td></td>
<td>11</td>
<td>11</td>
<td></td>
<td>101</td>
<td>110101</td>
</tr>
<tr>
<td></td>
<td>12</td>
<td>12</td>
<td></td>
<td>110</td>
<td>110110</td>
</tr>
<tr>
<td></td>
<td>13</td>
<td>13</td>
<td></td>
<td>111</td>
<td>110111</td>
</tr>
</tbody>
</table>

Alternate run-length code
(courtesy: K. Chakrabarty and S.K. Goel, Duke Univ)

5. Power-Aware BIST and Test Data Compression

5.2 Coding-based compression

Maximum scan chain length (m)

| T1: | x x x 1 | 0 x 0 0 x x 1 x x x |
| L1=4 | L2=7 | L3=6 |
| T2: | x 0 x x x x x x 1 x x 0 x x |
| L2=7 | L1=4 | L4=8 |
| T3: | 0 x x 1 x x 0 1 x x 1 x x |
| L3=6 | L5=1 | L5=9 |
| T4: | x 1 0 x 0 x 1 x x 1 x x |
| L7=6 | L6=9 |

Characters (Block Sizes) (L_i)

<table>
<thead>
<tr>
<th>Character</th>
<th>Occurrence Frequency (f)</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>4</td>
<td>3</td>
</tr>
<tr>
<td>5</td>
<td>1</td>
</tr>
<tr>
<td>6</td>
<td>1</td>
</tr>
<tr>
<td>7</td>
<td>2</td>
</tr>
<tr>
<td>8</td>
<td>1</td>
</tr>
<tr>
<td>9</td>
<td>2</td>
</tr>
</tbody>
</table>

Final Sequence (L_i[v]):


The application of run-length coding
(courtesy: K. Chakrabarty and S.K. Goel, Duke Univ)
5. Power-Aware BIST and Test Data Compression

5.2 Coding-based compression

The application of Huffman coding
(courtesy: K. Chakraborty and S.K. Goel, Duke Univ)

5. Power-Aware BIST and Test Data Compression

5.2 Coding-based compression

(N- bit buffer)

The application of Huffman coding
(courtesy: K. Chakraborty and S.K. Goel, Duke Univ)

5. Power-Aware BIST and Test Data Compression

5.2 Coding-based compression

(N- bit buffer)

The application of Huffman coding
(courtesy: K. Chakraborty and S.K. Goel, Duke Univ)
5. Power-Aware BIST and Test Data Compression

5.2 Coding-based compression

<table>
<thead>
<tr>
<th>Slice</th>
<th>Slice code</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>XX00 010X</td>
<td>00 0101</td>
<td>Start a new slice, map X to 0, set bit 5 to 1</td>
</tr>
<tr>
<td>1110 0001</td>
<td>00 0111</td>
<td>Start a new slice, map X to 0, set bit 7 to 1</td>
</tr>
<tr>
<td></td>
<td>11 0000</td>
<td>Enter group-copy-mode, starting from bit 0 (i.e., group 0)</td>
</tr>
<tr>
<td></td>
<td>11 1110</td>
<td>The data is 1110</td>
</tr>
<tr>
<td>XXXX XX11</td>
<td>01 1000</td>
<td>Start a new slice, map X to 1, no bits are set to 0</td>
</tr>
</tbody>
</table>

Selective encoding - example


5.3 LFSR-based compression

Conventional EDT does random filling

Identify clusters in the test cube that can have all the scan slices mapped onto the same value

Push the decompressor in self-loop states during encoding for low-power fill

(F. Czyz et al., Proc. VTS, pp. 75-83, 2007)
5. Power-Aware BIST and Test Data Compression

5.4 Low-power BIST

Pattern suppression during BIST with re-seeding
(S. Manich et. al., JETTA, vol. 16, issue 3, 2000)

Dual-speed LFSR - concept
5. Power-Aware BIST and Test Data Compression

5.4 Low-power BIST

S | N | S | N
---|---|---|---
0001 | 00 | 01 | 01
1000 | 10 | 00 | 01 | 10
0100 | 01 | 00 | 01 | 11
0010 | 00 | 10 | 01 | 00
1001 | 10 | 01 | 10 | 01
1100 | 11 | 00 | 10 | 10
0110 | 01 | 10 | 10 | 11
1011 | 10 | 11 | 10 | 00
0101 | 01 | 11 | 11 | 01
1010 | 10 | 10 | 11 | 10
1110 | 11 | 01 | 11 | 11
1111 | 11 | 11 | 00 | 01
0111 | 01 | 11 | 00 | 10
0011 | 00 | 11 | 00 | 11
0000 | 00 | 00 | 00 | 00

Dual-speed LFSR - example


Low power test pattern generator with staggered clocks

(P. Girard et. al., Proc. VTS, pp. 306-311, 2001)
5. Power-Aware BIST and Test Data Compression

5.4 Low-power BIST

- Circuit partitioning for low-power BIST
  - Centralized BIST and controller
  - Distributed approach where BIST sessions are power-conscious

(courtesy: K. Chakrabarty and S.K. Goel, Duke Univ)

6. Power-Aware System-Level Test Planning

Chapter 6
Power-Aware System-Level Test Planning
6. Power-Aware System-Level Test Planning

6.1 Overview of power-aware system test

- Core-based test architecture and planning
- Power modeling and estimation
- Power-constrained test planning

6.2 Core-based test architecture

- Test planning using basing components
  - Test source and sink
  - Test access mechanism (TAM)
  - Test schedule

(courtesy: E. Larsson, Linkoping Univ)
6. Power-Aware System-Level Test Planning

6.3 Power modeling and estimation

A sample test schedule

(courtesy: E. Larsson, Linkoping Univ)
6. Power-Aware System-Level Test Planning

6.4 Power-constrained test planning

![Resource Allocation Graph]

![Test Incompatibility Graph]


![Power Profile Graph]

Power profile can be modified by pattern modification and/or test set reordering.
6. Power-Aware System-Level Test Planning

6.4 Power-constrained test planning

- Test power:
  - \( T_0, T_2 \) 5W
  - \( T_1, T_3 \) 3W
  - \( T_0, T_3 \) 12W
  - \( T_2, T_3 \) 2W

- Power constraint at 15 W:
  - Time: T0-T1-T2-T3

- Power constraint at 13 W:
  - Time: T0-T1-T3

---

6. Power-Aware System-Level Test Planning

6.4 Power-constrained test planning

- Temperature dependent test:
  - TS1 = (T0, T2) 130 C
  - TS2 = (T1, T3) 70 C
  - TS3 = (T0, T3) 80 C
  - TS4 = (T1) 60 C
  - TS5 = (T2) 80 C

- Temp constraint at 90 C:
  - Time: TS1-TS2

- Temp constraint at 90 C:
  - Time: TS3-TS4-TS5
6. Power-Aware System-Level Test Planning

6.4 Power-constrained test planning

Avoiding peak capture power violations by skewing the shift cycles

(courtesy: E. Larsson, Linkoping Univ)

7. Low-Power Design Techniques and Test Implications

Part 3
Chapter 7
Low-Power Design Techniques
and Test Implications
### 7. Low-Power Design Techniques and Test Implications

#### 7.1 Low Power Design Trends

<table>
<thead>
<tr>
<th>System &amp; Architecture</th>
<th>IC Design &amp; Implementation</th>
</tr>
</thead>
<tbody>
<tr>
<td>• Voltage / Frequency Scaling</td>
<td>• Clock Gating</td>
</tr>
<tr>
<td>• Architecture (parallel, well managed pipeline, etc.)</td>
<td>• Multiple Supply Voltage</td>
</tr>
<tr>
<td>• Others (H/S partitioning, instruction set, algorithms, etc.)</td>
<td>• Multiple Threshold Voltage</td>
</tr>
<tr>
<td>• Gate sizing to equalize paths</td>
<td>• Substrate-Bias</td>
</tr>
<tr>
<td>• Logic restructuring to avoid hazards</td>
<td>• Power Gating</td>
</tr>
<tr>
<td>• Memory Bit Cell and Compiler</td>
<td>• Others</td>
</tr>
<tr>
<td>• Others</td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Circuit (Logic) Design</th>
<th>Process Technology</th>
</tr>
</thead>
<tbody>
<tr>
<td>• Low Power Cell Library</td>
<td>• Reduce Vdd</td>
</tr>
<tr>
<td>• Clock gating and power gating</td>
<td>• Threshold Voltage Option</td>
</tr>
<tr>
<td>• Logic design</td>
<td>• Low Capacitance Dielectric</td>
</tr>
<tr>
<td>• Technology mapping</td>
<td>• New Gate Oxide Material</td>
</tr>
<tr>
<td>• Cell sizing and wire sizing</td>
<td>• Transistor Sizing</td>
</tr>
<tr>
<td>• Voltage scaling, multi-Vdd, multi-Vth</td>
<td>• Others</td>
</tr>
<tr>
<td>• Floor planning and placement</td>
<td></td>
</tr>
<tr>
<td>• …</td>
<td></td>
</tr>
</tbody>
</table>

#### Efficiency of LPD Techniques *

<table>
<thead>
<tr>
<th>Technique</th>
<th>Efficiency</th>
</tr>
</thead>
<tbody>
<tr>
<td>Micro architecture (pipelining, parallelism)</td>
<td>50%</td>
</tr>
<tr>
<td>Clock gating and power gating</td>
<td>40%</td>
</tr>
<tr>
<td>Logic design</td>
<td>20%</td>
</tr>
<tr>
<td>Technology mapping</td>
<td>30%</td>
</tr>
<tr>
<td>Cell sizing and wire sizing</td>
<td>30%</td>
</tr>
<tr>
<td>Voltage scaling, multi-Vdd, multi-Vth</td>
<td>75%</td>
</tr>
<tr>
<td>Floor planning and placement</td>
<td>25%</td>
</tr>
<tr>
<td>…</td>
<td></td>
</tr>
</tbody>
</table>

(K. Keutzer, UC Berkeley, SDC panel, 2005)
7. Low-Power Design Techniques and Test Implications

7.1 Low Power Design Trends

Leakage dominates Power equation beyond 70nm

* Today, leakage power amounts to almost 40% of the total power of a microprocessor

(source: INTEL)

7. Low-Power Design Techniques and Test Implications

7.1 Low Power Design Trends

Main LPD techniques

<table>
<thead>
<tr>
<th>Power reduction</th>
<th>Dynamic</th>
<th>Leakage</th>
</tr>
</thead>
<tbody>
<tr>
<td>Clock gating</td>
<td>✓</td>
<td></td>
</tr>
<tr>
<td>Power gating</td>
<td>✓</td>
<td>✓</td>
</tr>
<tr>
<td>Multi-Voltage domains</td>
<td></td>
<td>✓</td>
</tr>
<tr>
<td>Multi-Threshold cells</td>
<td></td>
<td>✓</td>
</tr>
</tbody>
</table>

These techniques are often combined together to achieve the maximum power optimization value

(source: INTEL)
7. Low-Power Design Techniques and Test Implications

7.1 LPD Trends - PM infrastructure

<table>
<thead>
<tr>
<th>PM Structure</th>
<th>Usage</th>
</tr>
</thead>
<tbody>
<tr>
<td>Gated Clocks</td>
<td>Gates clock going to a sequential block</td>
</tr>
<tr>
<td>Power Switches</td>
<td>(SLEEP=1) cuts off VDD from going to core</td>
</tr>
<tr>
<td>Isolation Cells</td>
<td>(Enable=1) sets output of domain to 0 when power is shut down</td>
</tr>
<tr>
<td>State Retention FF</td>
<td>(Save=1) ensures that data is saved when power domain is OFF.</td>
</tr>
<tr>
<td></td>
<td>(Restore=1) Restores the saved data when power domain wakes up</td>
</tr>
</tbody>
</table>

(courtesy: Srivaths Ravi, TI)

7. Low-Power Design Techniques and Test Implications

7.1 LPD Trends - Clock Gating

- Reduces dynamic power by restricting clock distribution
  (clock switching in idle logic blocks is shut off)
- Low overhead technique
- Widely adopted and supported by existing EDA tools
7. Low-Power Design Techniques and Test Implications

7.1 LPD Trends - Multiple Supply Voltage

- Vdd scaling results in quadratic power reduction ($P=kCV^2f$)
- Adopted techniques are:
  - Static voltage scaling
  - Dynamic voltage scaling
  - DVFS
- Creation of "power islands"
- Reduces dynamic power dissipation
- Requirement / implications:
  - Level shifters to let signals cross power domain boundaries
  - Lower margin for IR drops / increased noise susceptibility

(source: CADENCE, 2007)
7. Low-Power Design Techniques and Test Implications

7.1 LPD Trends - Multiple Supply Voltage

• Dynamic voltage and frequency scaling (example)

![Chart showing processor utilization and energy used for conventional on/off power management and dynamic voltage scaling]

• Design flow is similar to MSV flow, but more complex control mechanism and more complicated timing analysis

(R. Aitken, ARM, SDC panel, 2005)

7. Low-Power Design Techniques and Test Implications

7.1 LPD Trends - Multiple Threshold Voltage

• Threshold voltage scales down (with supply voltage) to deliver circuit performance, but leakage power increases exponentially with threshold voltage reduction

• Speed cost to decrease leakage !

![Diagram showing leakage reduction and speed improvement for low Vt cell (high speed), normal Vt cell, and high Vt cell (low leakage)]

Leakage Reduction

Speed Improvement

(source: CADENCE, 2007)
7. Low-Power Design Techniques and Test Implications

7.1 LPD Trends - Multiple Threshold Voltage

- MTV designs use high-Vt cells to decrease leakage current where performance is not critical (transistors on non-critical paths)
- Leakage power reduction while meeting timing and no area overhead
- Well established and supported by existing EDA tools

7.1 LPD Trends - Power Gating

- Used to shut down blocks (power domains) that are not in use (idle mode) hence reducing leakage power and dynamic power
- Done by using Multiple-Threshold CMOS (MTCMOS) switches
7. Low-Power Design Techniques and Test Implications

7.1 LPD Trends - Power Gating

• Adopted in regular structures (data paths) where the gating transistor can be easily shared
• Power gating cells usually cost 10% area overhead and about 2% performance degradation
• Power-on and power-down sequences can be extremely complex to design and verify
• Careful sizing of gating transistors (wide enough to sustain worst-case switching condition)
• Impact on the design flow may be high

7.2 Power Specification Formats

• Need of specifying properties of a circuit block wrt power dissipation during design, verification and implementation
• Interpreted by designers and design automation tools
• To achieve a unified and efficient design flow, various EDA tools need to use a common language
  • Common Power Format (CPF) created by Cadence
  • Unified Power Format (UPF) supported Synopsys, Mentor and Magma and standardized by IEEE (P1801 std. working group)
• 90% same concepts, but different syntaxes
• Both formats are based on the Tool Control Language (TCL) embedded in most EDA tools
7. Low-Power Design Techniques and Test Implications

7.2 Power Specification Formats

- Used to describe the following specifications:
  - Voltage domains, Power domains, Isolation logic, Retention registers, Always-on cells, Power switches
  - The various power modes also need to be specified
  - Operating environment details (process, temperature, operating voltage data, and leakage calculation) are not part of UPF
  - UPF-based low power design flow →
  - Example: creation of power domains

\[
\text{create\_power\_domain pdTOP} \\
\text{create\_power\_domain pd1 \cdot \text{elements U1}} \\
\text{create\_power\_domain pd2 \cdot \text{elements U2}} \\
\text{create\_power\_domain pd3 \cdot \text{elements U3}}
\]

7.3 Implications to Test Requirement and Cost

- Power management techniques are attractive since they can achieve maximum performance under a power-temperature envelope
- But they can have undesirable consequences on Test !!!

Power during test is even more critical for low-power devices !!
7. Low-Power Design Techniques and Test Implications

7.3 Implications to Test Requirement and Cost

- Reduce (even more) test power by using the PM infrastructure and/or applying the previous dedicated solutions (chapters 3 to 6)
- Preserve the functionality of the test infrastructure
- Test the power management (PM) structures
- Consider the influence of LPD on test conditions
- Provide high quality delay fault coverage

And still target:

High fault coverage, short test application time, small test data volume, low area overhead, etc ... while making test power dissipation (dynamic and leakage) comparable to functional power

---

7. Low-Power Design Techniques and Test Implications

7.3 Implications use of PM infrastructure

- Existing solutions
  - ATPG and DfT for multi-voltage designs (addressed in Chapter 8)
  - ATPG and DfT for gated clock designs (addressed in Chapter 9)
  - Test planning for multi-power domains architectures (addressed in Chapter 11)
7. Low-Power Design Techniques and Test Implications

7.3 Implications □ Preserving Test Functionality

**Main Challenge**

- Test infrastructures like scan chain or TAM may cross several power domains and can be broken if some of these domains are temporarily powered-down for low-power constraints

Specific test control strategies and extra DfT insertion are required for preserving functionality of the test infrastructure!

- Similar challenges exist for gated clock designs (addressed in Chapter 9)

---

**Example: MSMV-aware scan chain assembly**

- Minimize the occurrence of chains that cross voltage domains

- Avoid unwanted power down during scan shift/capture
- Chain must not span domains that must be independently on/off
- Chains traversing through powered down domains must bypass them
- ...

(M. Hirech, Synopsys, DATE 2008)
7. Low-Power Design Techniques and Test Implications

7.3 Implications  Test the PM Structures

- Power management structures have to be tested and need dedicated test / DfT / ATPG methods for:
  - Level shifters
  - Isolation cells
  - State retention registers
  - Power controller

- Power switches (by far the most challenging task !!)

(Addressed in Chapter 10)

(courtesy: M. Hirech, Synopsys)

7. Low-Power Design Techniques and Test Implications

7.3 Implications  Adapt Test Conditions

- Combined impact of advanced PM techniques and process-induced uncertainty in device parameters (transistor length, width, oxide thickness, random dopant fluctuations, etc. that have a great impact, especially on the threshold voltage) imposes a large number of test conditions to ensure correct timing operation (ex. speed binning based on structural delay testing)
  - Testing at multiple Vdd (when DVS in used) (chapter 8)
  - Testing at multiple frequencies for a each Vdd (for DVFS)
  - Testing at multiple temperatures …
Different activity levels in different parts of a die causes $T^\circ$ variations. Circuit delay change non-linearly with voltage and temperature.

- Defining the worst-case timing conditions during test will be even more a challenge!!

[Graph showing normalized frequency versus temperature for different Vdd values]

- Traditional Methods of testing at only high temperature for Fmax will be insufficient for low power devices
  - Testing at additional temperatures will increase test time

(G. Singer, Intel, ITC, 2007)
7. Low-Power Design Techniques and Test Implications

7.3 Implications □ Delay Fault Coverage

• By using power optimization techniques such as gate downsizing or MTV, more paths become clustered in a narrow region around the cycle time, resulting in a large population of paths which are sensitive to small delay perturbations
  ▪ PDF selection more complex
  ▪ More test data are needed
  ▪ Sensitivity to variations

• PSN has a significant impact on the timing behavior
  ▪ Need to integrate PSN effects in delay test pattern generation

High quality at-speed (delay) test becomes more complex

Chapter 8
Test Strategies for Multi-Voltage Designs
8. Test Strategies for Multi-Voltage Designs

8.1 Test for Multi-Voltage (MV) Design

• Multi-Voltage design
  • Use dedicated multiple power supplies on chip, or adaptive voltage scaling circuitry consisting of DC-DC converters and voltage controlled oscillators.

• MV design and Test
  • Some manufacturing defects have Vdd-dependency
  • Impact on defect coverage
  • Testing at all Vdd settings is costly!!
  • Multi-Vdd Test determines the minimum number of voltage settings to ensure the highest level of defect coverage

8. Test Strategies for Multi-Voltage Designs

8.1 Test for MV Design □ Bridge Defects

• Bridge defects
  • Most of them are resistive
  • Hard-shorts ~ stuck-at faults → detection is independent of Vdd

• Resistive bridge defects
  • A finite number of discrete intervals can be used to deal with all bridge $R_{sh}$ values
8. Test Strategies for Multi-Voltage Designs

8.1 Test for MV Design □ Bridge Defects

• Resistive bridge behavior at MV settings
  • Voltage Vo does not scale linearly with the input threshold voltage of S1, S2 and S3 when changing Vdd
  • Resistance intervals differ from one voltage setting to another
  • Example: a bridge with $R_{sh} = R_{3B}$ will cause a logic fault at $V_{ddB}$ but not at $V_{ddA}$
  • A test pattern targeting a particular logic fault will detect different ranges of defects at different MV settings

8. Test Strategies for Multi-Voltage Designs

8.1 Test for MV Design □ Bridge Defects

• Resistive bridge behavior at MV settings
  • Case 1: a test set detecting LF1, LF2 and LF3 will achieve full fault coverage when applied at $V_{ddB}$
  • Case 2: testing at both Vdd settings is required!
8. Test Strategies for Multi-Voltage Designs

8.1 Test for MV Design □ Bridge Defects

- Goal of test generation for MV settings
  - Generate test patterns for MV settings to maximize coverage of Resistive Bridging Faults (RBF) while minimizing test sequence length
  - Identification of Vdd specific test patterns for a given fault f
  - Example*: for a given circuit, 621 test patterns are needed at three Vdd settings with a commercial tool (207 at each Vdd), while only 391 test patterns applied at two Vdd settings (0.8V and 1.2V) are needed with the proposed technique (to achieve 100% fault coverage)

(U. Ingelson et. al., ATS, 2007)

8.2 Test for MV Design □ Open Defects

- Open defects
  - Due to unconnected nodes
  - Full opens → logic failures tested using static faults
- Transmission gate opens
- Interconnect resistive opens
  - Resistive opens → time-dependent → tested using delay tests
  - Better detectability at elevated Vdd settings *

(* B. Kruseman et. al., DATE, 2006)
8. Test Strategies for Multi-Voltage Designs

8.3 DfT for Multi-Voltage Designs

Multi-Voltage Aware Scan Cell Ordering

• Multi-voltage aware scan chain assembly considers the voltage domains of scan cells during scan cell ordering so as to minimize the occurrence of chains that cross voltage domains

• Minimize number (area overhead) of level shifters (by 93%)

- Implemented in Synopsys Design Compiler

<table>
<thead>
<tr>
<th>Position</th>
<th>Logical</th>
<th>Physical</th>
<th>Multi-Voltage</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>A1</td>
<td>A2</td>
<td>A2</td>
</tr>
<tr>
<td>2</td>
<td>A2</td>
<td>A1</td>
<td>A1</td>
</tr>
<tr>
<td>3</td>
<td>A3</td>
<td>A3</td>
<td>A3</td>
</tr>
<tr>
<td>4</td>
<td>B1</td>
<td>B3</td>
<td>C1</td>
</tr>
<tr>
<td>5</td>
<td>B2</td>
<td>B1</td>
<td>C2</td>
</tr>
<tr>
<td>6</td>
<td>B3</td>
<td>B2</td>
<td>C3</td>
</tr>
<tr>
<td>7</td>
<td>C1</td>
<td>C1</td>
<td>B2</td>
</tr>
<tr>
<td>8</td>
<td>C2</td>
<td>C2</td>
<td>B3</td>
</tr>
<tr>
<td>9</td>
<td>C3</td>
<td>C3</td>
<td>B1</td>
</tr>
</tbody>
</table>

(A. De Colle et al., JOLPE Vol. 1, N°1, April 2005)

8. Test Strategies for Multi-Voltage Designs

8.3 DfT for Multi-Voltage Designs

Power-Aware Scan Chain Assembly for MV Designs

• Bypass multiplexers allow testing of specific power domains in MSMV environment (switched-off power domains are bypassed)

• Preserve test functionality!

- Implemented in Cadence Encounter™

(V. Chickermane et al., ITC 2008)
8. Test Strategies for Multi-Voltage Designs

8.3 DfT for Multi-Voltage Designs

Power-Managed Scan Using AVS

- Adaptive Supply Voltage (AVS) Scaling infrastructure for DVS is widely used in modern SoCs to reduce dynamic and leakage power
- Re-use of this infrastructure in the test mode to propose a scaled-voltage scan test scheme. The goal is to reduce dynamic and leakage power dissipation by using a lower supply voltage during scan shifting
- At-speed testing with a LOC or a LOS test scheme is assumed, as well as the fact that the scan shift speed is usually lower than the functional (capture) speed

- Example: functional supply voltage ($V_{max}$) = 1.1 V, functional frequency ($F_{max}$) = 500 MHz, threshold voltage of scan FFs ($V_t$) = 0.35 V, shift frequency ($F_{shift}$) = 125 MHz $\rightarrow V_{shift} = 0.635$ V

(V.R. Devanathan et. al., ITC 2007)

- Around 45% reduction of dynamic (average and peak) power and 90% reduction of leakage power, with negligible physical design impact and minimum area overhead
9. Test Strategies for Gated Clock Designs

Chapter 9
Test Strategies for Gated Clock Designs

Content

9.1 Functional Clock Gating
- Basic Concept
- Glitch-Free Clock Gating

9.2 DFT for Clock Gating Logic

9.3 Capture Power Reduction Through Clock Gating
- Static In-ATPG Technique
- Dynamic In-ATPG Technique
- Post-ATPG Technique
9.1 Functional Clock Gating

**Basic Concept**

- In a CMOS circuit, state changes at FFs triggered by clocks propagate to logic gates, resulting switching activity in the whole circuit.
- No all FFs need to be triggered in order to perform a function (e.g., in a mobile-phone SoC, camera control logic needs not to be active when a call is being made).
- “Stopping” the clock to functionally-noncontributing FFs reduce dynamic power dissipation, *not only in logic portions and also in clock trees (> 50%).*
- “Stopping” of clocks should be conditional.

---

**Glitch-Free Clock Gating**

- Simply gating a clock $Clk$ with a clock gating signal from the clock gating logic (not controlled by $Clk$) may create glitches.
9.2 DFT for Clock Gating Logic

Case of MUXed Scan

Guaranteeing Correct Scan Test Operations

- Disable clock gating in shift mode (SE = 1): Unconditionally-ON Clock
- Enable clock gating in capture mode (SE = 0): Conditionally-ON Clock

Shift (SE = 1)

Capture (SE = 0)

9.2 DFT for Clock Gating Logic

Case of MUXed Scan

Technique for Improving ATPG Efficiency

- If ATPG needs to easily generate a test to gate-off or enable a clock in capture mode, one can improve the 0-controllibility and 1-controllibility of the clock gating signal.
9.3 Capture Power Reduction Through Clock Gating

Static In-ATPG Technique

Basic Flow

- A default value is a value assignment to be made on a bit if ATPG does not require a different value on the bit.
  - A preset value to be used to fill an X-bit in a test cube.

- Flow for obtaining default values for clock gating:
  - Identify all clock gators.
  - For each clock gator, calculate a set of bits settings that can set the clock off. (This process only needs to be conducted once up front.)
  - The values in each bit setting are default values.

- Flow for using default values for clock gating:
  - Generate a test cube for fault detection.
  - Assign default values to the X-bits in the test cube. (If there are multiple choices of default values, use the one that can turn-off more FFs.)


9.3 Capture Power Reduction Through Clock Gating

Static In-ATPG Technique

Example

Test Cubes after ATPG

```
Test Cubes | s0 | s1 | s2 | s3 | s4 | s5
--- | --- | --- | --- | --- | --- | ---
T1 | X | 0 | X | 0 | X |
T2 | 0 | 1 | X | X | 0 |
T3 | X | X | 0 | X | 0 |
```

Test Cubes after Adding Default Values

```
Test Cubes | s0 | s1 | s2 | s3 | s4 | s5
--- | --- | --- | --- | --- | --- | ---
T1 | X | 0 | D | 0 | 1 | 0 |
T2 | 0 | 1 | X | X | 0 | False
T3 | X | X | 0 | 1 | 0 | X |
```

9.3 Capture Power Reduction Through Clock Gating

Static In-ATPG Technique

Experimental Results

- The number of active clocks are reduced by > 50%.
- Capture toggle activity is reduced by 35%.
- Fault coverage and test set size remain almost unchanged.

Circuit Statistics
89K FFs
6 Scan Clocks
2200 Clock Gators

9.3 Capture Power Reduction Through Clock Gating

Dynamic In-ATPG Technique

- Generate a test cube for primary fault detection.
- Conduct dynamic compaction to secondary fault detection.
- Identify the clocks that can be turned off without any impact of the fault detection capability of the test cube.
- Order the clocks by the decreasing number of FFs that are controlled by each clock.
- Justify 0 to clock gators of the clocks in the pre-determined order.

A Low-Capture-Power Test Cube for Fault Detection
9.3 Capture Power Reduction Through Clock Gating

**Post-ATPG Techniques**

**Goals of CTX**

CTX (Clock-Gating-Based Test Relaxation and X-Filling)

- To explore the collective LSA-reducing capability of clock-gating in at-speed scan testing.
- To further reduce LSA by equalizing the input and output values of FFs.
- To avoid ATPG change, test data inflation, and fault coverage loss.
- To reduce LSA with as few X-bits as possible.

---

(H. Furukawa et al., Proc. ATS, pp. 397-402, 2008)
9.3 Capture Power Reduction Through Clock Gating

Post-ATPG Techniques

* X-Filling for Disabling Clock Gators

<table>
<thead>
<tr>
<th>PI1</th>
<th>PI2</th>
<th>EN1</th>
<th>PO1</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>X</td>
<td>X</td>
</tr>
<tr>
<td>X</td>
<td>X</td>
<td>X</td>
<td>X</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>X</td>
<td>X</td>
</tr>
<tr>
<td>1</td>
<td>X</td>
<td>X</td>
<td>X</td>
</tr>
</tbody>
</table>

↓

Try to justify 0 on each neutral clock control signals.

(H. Furukawa et. al., Proc. ATS, pp. 397-402, 2008)

10. Test of Power Management Structures

Chapter 10

Test of Power Management Structures
10. Test of Power Management Structures

Content

10.1 Clock Gating Logic
   • Clock Gator
   • Testing of the Clock Gator and its Control Logic

10.2 Power Control Logic
   • Power Gating Basics
   • Power Gating Logic

10.3 Power Switches
   • Types of Power Switches
   • Techniques for Testing Power Switches
   • Discharge Circuitry

10.4 Low-Power Cells
   • State Retention Register and its Testing
   • Isolation Cell and its Testing
   • Level Shifter and its Testing

10.1 Clock Gating Logic

Clock Gator and Control Logic

• Clock Gator for enabling/disabling a clock in a glitch-free clock manner.
• Functional Control Logic for implementing clock gating conditions.

Determine CEN when TEST_EN = 0

Control Logic

Clock Gator

Enable GCLK when CEN = 1
Disable GCLK when CEN = 0
10.1 Clock Gating Logic

Testing of the Clock Gator and its Control Logic

\[ \text{TEST\_EN = 1 for Shift and TEST\_EN = 0 Capture} \]

- In shift mode, GCLK is always active, allowing the correct shift operation.
- In capture mode, the functional control logic controls the clock gator.
- Chances are high that faults in the clock gator and its functional control logic are detected.
- No need to add any observe point.
- Larger test vector count / lower capture power.

10.2 Power Control Logic

Power Gating Basics

- Power gating = conditional turning on or off the power supply of a logic block (power domain) in order to reduce both dynamic and static (leakage) power.

- Power gating is implemented by adding
  1. power switches
  2. low-power cells (state retention register, isolation cell, level shifter)
  3. power management unit (PMU)
10.2 Power Control Logic

**Power Gating Logic**

1. **Power Switch** for connecting or disconnecting a power supply or ground of a logic block (power domain).
2. **State Retention Register** for retaining its state when the corresponding power domain is powered-off.
3. **Isolation Cell** for providing a reliable voltage level and corresponding logic value into a power-on power domain when the source power domain is powered-off.
4. **Level Shifter** for connecting power domains that operate at different voltage levels and converting the voltage levels up or down as needed by the driven power domain.
5. **Power Management Unit (PMU)** for controlling power modes and provide safe transition sequences between power modes.

**Testing of Power Control Logic**

**Method-1:** Observe the logic controlled by the power control logic.

**Method-2:** Observe at the output of the power control logic.
10.2 Power Control Logic

Testing of Power Clock Control Logic

Observing the Logic Controlled by the Power Control Logic

s-a-1 ?

Fault-Free: ON / Faulty: OFF

- Observe faulty effects in the downstream logic.
- Advantages:
  - Minimizing additional test logic and thus area overhead.
  - Allowing at-speed testing of the power control logic.
- Disadvantage:
  - Causing difficulty in fault diagnosis.

10.2 Power Control Logic

Testing of Power Clock Control Logic

Observing at the Output of the Power Control Logic

- Add test points to make the outputs of the power control logic observable.
- Advantage:
  - Easy to observe faulty effects.
- Disadvantage:
  - Hard to conduct at-speed testing of the power control logic.
### 10.3 Power Switches

#### Types of Power Switches

- **Header Switch**
  - Also called sleep transistors.
  - Should be large enough to provide sufficient current to the circuit.

- **Footer Switch**
  - Individual transistors can be small.
  - Preferable in practice due to concerns about layout, design for manufacturability, and limiting inrush current when switching on a power domain.

- **Symmetric Switch**

- **Segmented Switch**

#### Techniques for Testing Power Switches

**Test for Header Switch with Comparator --- Basic Method**

- **Pattern 1:**  
  - \( TE = 1 / \text{standby}_1 = 1 \) (test for short)  
  - Turn-off the power switch. After sufficient discharge, \( V_{core} \) should be much lower than \( V_{DD} \).  
  - \( \text{Out (fault-free)} = 1 / \text{Out (faulty)} = 0 \)

- **Pattern 2:**  
  - \( TE = 1 / \text{standby}_1 = 0 \) (test for open)  
  - Turn-on the power switch. \( V_{core} \) should be close to \( V_{DD} \).  
  - \( \text{Out (fault-free)} = 1 / \text{Out (faulty)} = 0 \)

\( (\text{Goal et. al., Proc. ETS, pp. 145-150, 2006}) \)
10.3 Power Switches

Techniques for Testing Power Switches

Test for Header Switch with Comparator --- Improved Method

- Use a separate signal \( n \) to provide comparison data.
- Make it possible to test for both stuck-at-1 and stuck-at-0 of \( Out \).
- Two dedicated test signals, \( \text{standby}_t \) and \( n \), are needed.
  \( \Rightarrow \) Large number of additional test signals when there are many power switches.

Discharge Circuitry

The Design

- Discharge Control: power switch = OFF / discharge transistor = ON
- Need an additional control test signal \( \text{discharge}_c \).

(Peng et al., Technical Report, CADT-12/01/2008, 2008)
10.4 Low-Power Cells

State Retention Register

Basic Design

- A SRR cell is for keeping its state when the power supply is turned off.
- Normal Mode: $VDD1 = ON / VDD2 = ON / RET = 0$
- Retention Mode: $VDD1 = OFF / VDD2 = ON / RET = 1$

Design with Scan Function
10.4 Low-Power Cells

State Retention Register

**Test Method**

- Turn Power Domain 2 on ($SLEEP_2 = 0$).
- Shift in value $v$ to SRR.
- Enable retention ($RET_1 = 1$).
- Enable the isolation cell ($ISO_1 = 1$).
- Turn Power Domain 1 off ($SLEEP_1 = 1$).
- Turn Power Domain 1 on ($SLEEP_1 = 0$).
- Disable the isolation cell ($ISO_1 = 0$).
- Disable retention ($RET_1 = 0$).
- Shift out the value of SRR and see if its is value $v$.

Repeat for $V = 0$ and $V = 1$

---

10.4 Low-Power Cells

Isolation Cell

- Turn Power Domain 1 off ($SLEEP_1 = 1$).
- Turn Power Domain 2 on ($SLEEP_2 = 0$).
- Apply a test to detect the stuck-at-0 at the output of the isolation cell ($ISO_1 = 1$).
10.4 Low-Power Cells

Level Shifter

Test as a whole under all power supply voltage combinations of Power Domain 1 and Power Domain 2.

11. EDA Solutions for Power-Aware Design-for-Test

Chapter 11
EDA Solutions for Power-Aware Design-for-Test
11. EDA Solutions for Power-Aware Design-for-Test

11.1 Overview of EDA solutions

• Design flow for power management
• Test automation objectives
• Integration of power management techniques
• Test planning

11.2 Design flow for power management

UPF-based low-power design flow
(courtesy: M. Hirech, Synopsys)
11. EDA Solutions for Power-Aware Design-for-Test

11.3 Test automation objectives

(courtesy: M. Hirech, Synopsys)

11.4 Integration of power management techniques

Level shifter / isolation cell reuse after scan stitching

(courtesy: M. Hirech, Synopsys)
11. EDA Solutions for Power-Aware Design-for-Test

11.4 Integration of power management techniques

Voltage annotation and scan segment modeling

(courtesy: M. Hirech, Synopsys)
11. EDA Solutions for Power-Aware Design-for-Test

11.5 Test planning

Power dissipation vs test application time

(courtesy: M. Hirech, Synopsys)

Illustration of multi-mode DFT architecture

(courtesy: M. Hirech, Synopsys)
Summary and Conclusions

• Reliability, test throughput and manufacturing yield may be affected by excessive test power
• Therefore, lowering test power has been and is still a focus of intense research and development
• Following points have been surveyed:
  ▪ Test power parameters and contributors
  ▪ Problems induced by an increased test power
  ▪ Structural and algorithmic solutions for low power test along with their impacts on parameters such as fault coverage, test time, area overhead, circuit performance penalty, and design flow modification
  ▪ Test implications of low power design and emerging test strategies for low-power devices
Summary and Conclusions

• Past work explored primarily four directions:
  • ATPG-based low power test approaches are not intrusive; they are pattern dependent and may incur additional computational cost in the design flow.
  • DFT-based low power test approaches change the scan infrastructure; can be pattern independent and hence can guarantee power reduction in a certain range.
  • Low power test data compression techniques combines reduction of test data volume and test application time with power consumption during test
  • System-level low power test considerations can be combined with any of the above and improve test concurrency.

• Future work will be mainly on:
  • Test strategies for low power designs

THANK YOU!
And let us make testing “cool”.