FIRMWARE DEBUGGING INSTRUCTION TRACING

Have you run your firmware, found a repeatable bug, but weren’t able to reproduce the issue when single stepping through the code with your debugger? Maybe your internal watchdog block times out, resetting the system, and you have no idea why.

Firmware programmers often must deal with difficult bugs late in a product cycle. Fixing simpler bugs can often reveal more complex bugs underneath that need to be fixed before the product ships.

Instruction tracing is one powerful tool that can speed up debugging, especially for bugs involving complex firmware interactions. Many ARM processors, such as the Cortex-M in conjunction with the SEGGER J-Trace probe, allow instruction tracing with a minimal number of external pins. The J-Trace probe also contains a built-in license for the SEGGER Ozone debugger.

Advantages to Instruction Tracing

1. Ability to debug complex code interactions that are covered up by single-step debugging. Trace captures code sequences, including all interrupts and context switches. This temporal observability provides significant reduction in debug time

2. Zero-overhead function profiling

3. Zero-overhead code coverage measurement

More Info: https://www.segger.com/products/development-tools/ozone-j-link-debugger/technology/trace-features/

Disadvantages to Instruction Tracing

Consumes pins (4 data and clock) that aren’t used in the final product
Cost of J-Trace probe. This can be mitigated by sharing the J-Trace probe among firmware engineers. https://shop-us.segger.com/TraceProbe_s/41.htm
Up front cost of board space and design time. Tracing signals require tight constraints on wire lengths and termination resistors. Fortunately, required components can be surface mount and no-loaded to save cost. Parts may be post-loaded to debug issues, if needed. More details on board design in the following document: https://www.segger.com/downloads/jlink/UM08001
Off-the-shelf evaluation boards that include instruction trace often cost more
Instruction tracing (even over a few seconds) can generate gigabyte instruction listings that are time consuming to parse through. Licensed debugging software (such as Keil and IAR) often contain much better trace analysis tools than the free Ozone software.

Basic Debug Strategy with Instruction Trace

The most common strategy for debugging with instruction trace involves triggering a breakpoint during a failing condition, running the code up to that breakpoint, and using instruction trace to observe the sequence of code that caused it. Adjust your system parameters (such as minimizing timeouts or queue sizes) to reduce the time between the failing condition and the breakpoint, if possible.

The following list describes some example scenarios and ways to trigger the breakpoint:

Process taking too little or too much time:

Read a microsecond timer and calculate the duration or interval between loops. Trigger a breakpoint if the duration falls outside expected timing bounds.

Periodic processes getting the wrong number of events between runs:

Trigger a breakpoint when the number of events (such as interrupt) is outside of expected bounds.

Watchdog times out:

Set a breakpoint near the top of the reset handler, run the code that triggers the watchdog and observe the sequence just before the reset. An extensive loop or wait without a timeout condition is often the culprit.

Memory Fault:

Set a breakpoint in the hard-fault handler and observe the code that leads up to it.

Data corruption:

If a variable or memory location changes to an invalid value, trigger a breakpoint when the value becomes invalid. You can also use the ARM hardware breakpoints to trigger when a specific memory location changes.

Code Hangs where the watchdog doesn’t time out:

Turn on the tracing feature, run the code until it hangs and then press the “pause” or “halt” button in the debugger to look at the code trace.

Interactions with MCU peripherals:

Instrument your interrupt handlers to read relevant peripheral registers. Trigger a breakpoint if unexpected values are detected.

Interactions with external hardware:

Add a GPIO interrupt handler that fires when external hardware misbehaves. Set a breakpoint in the GPIO handler to stop the CPU and get the instruction trace.

Advantages to Instruction Tracing

Disadvantages to Instruction Tracing

Basic Debug Strategy with Instruction Trace

Process taking too little or too much time:

Periodic processes getting the wrong number of events between runs:

Watchdog times out:

Memory Fault:

Data corruption:

Code Hangs where the watchdog doesn’t time out:

Interactions with MCU peripherals:

Interactions with external hardware:

Offices:

Connect:

Advantages to Instruction Tracing

Disadvantages to Instruction Tracing

Basic Debug Strategy with Instruction Trace

Process taking too little or too much time:

Periodic processes getting the wrong number of events between runs:

Watchdog times out:

Memory Fault:

Data corruption:

Code Hangs where the watchdog doesn’t time out:

Interactions with MCU peripherals:

Interactions with external hardware:

Offices:

Connect:

Phase 3: Design Verification And Design Transfer

Design & Engineering

<img class="wp-image-1805 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_software_icon-150x150.png" alt="" width="40" height="40" />Software: Design Complete

<img class=" wp-image-1804 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_hardware_icon-150x150.png" alt="" width="40" height="40" />Hardware: Pre-production units for design verification

Test: Design verification test

NPI

<img class="wp-image-1811 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/siplexity_manufacturing_icon-150x150.png" alt="" width="40" height="40" />MFG. Readiness: CM schedule and budget, Unit build tracking

Quality: Quality metrics verification process, Process validation support

Typical deliverables:

Gate definition:

Phase 2C: Detailed Design Prototype 2

Design & Engineering

<img class="wp-image-1805 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_software_icon-150x150.png" alt="" width="40" height="40" />Software: Full feature implementation

<img class=" wp-image-1804 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_hardware_icon-150x150.png" alt="" width="40" height="40" />Hardware: Prototype 2 units with production-representative materials and processes

Test: Engineering confidence test, integration test

NPI

MFG. Readiness: CM onboarding Design transfer prep

<img class=" wp-image-1803 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_quality_icon-150x150.png" alt="" width="40" height="40" />Quality: Build Quality Plan

2C. Prototype 2 Design, Build And Test

Typical deliverables:

Gate definition:

Phase 2B: Detailed Design

Design & Engineering

<img class="wp-image-1805 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_software_icon-150x150.png" alt="" width="40" height="40" />Software: Core functionality implementation

<img class=" wp-image-1804 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_hardware_icon-150x150.png" alt="" width="40" height="40" />Hardware: Prototype 1 units with rapid prototyped components

Test: Engineering confidence test, unit test

NPI

<img class=" wp-image-1811 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/siplexity_manufacturing_icon-150x150.png" alt="" width="40" height="40" />

MFG: Readiness: Project build plan CM selection

<img class=" wp-image-1803 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_quality_icon-150x150.png" alt="" width="40" height="40" /> Quality: Critical manufacturing process identification

2B. Prototype 1 Design, Build And Test

Typical deliverables:

Gate definition:

Phase 1: Requirements & Planing

Design & Engineering

Project Plan Requirements

ID/UX Concepts

Risk Analysis

Manufacturing Strategy Identification

Typical deliverables:

Gate definition:

Production

Design & Engineering

Manufacturing design guidance and ongoing engineering support

Ongoing quality metrics monitoring & optimization

Typical deliverables:

Phase 2: Detailed Design

Design & Engineering

<img class="wp-image-1805 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_software_icon-150x150.png" alt="" width="40" height="40" />Software: Architecture design: block, sequence and state diagrams

<img class=" wp-image-1804 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_hardware_icon-150x150.png" alt="" width="40" height="40" />Hardware: Major Component definition & Proof of Concept subsystems build

Test: Characterization and qualification of high risk subsystems & components

NPI

<img class=" wp-image-1803 alignleft" src="https://www.simplexitypd.com/wp-content/uploads/2020/07/simplexity_quality_icon-150x150.png" alt="" width="40" height="40" />Quality: Design for Manufacturing tradeoffs evaluation

2A. Architecture and Technology Feasability

Typical deliverables:

Gate definition:

Phase 0: Exploration

Exploration

Research

Concept Work

Architecture explorations

Feasibility study

Typical deliverables:

Gate definition:

Software: Design Complete

Hardware: Pre-production units for design verification

MFG. Readiness: CM schedule and budget, Unit build tracking

Software: Full feature implementation

Hardware: Prototype 2 units with production-representative materials and processes

Quality: Build Quality Plan

Software: Core functionality implementation

Hardware: Prototype 1 units with rapid prototyped components

Quality: Critical manufacturing process identification

Software: Architecture design: block, sequence and state diagrams

Hardware: Major Component definition & Proof of Concept subsystems build

Quality: Design for Manufacturing tradeoffs evaluation