

What's new at Embedded Systems Conference 2008

# Working successfully behind the scenes

Embedded systems have shown rapid development in the last 15 years. There was a time when they were only to be found in complex technical products that were never used in the average household, but now they are everywhere. The general public may be unaware of them since they are embedded and



work behind the scenes, but we in the "embedded community" know that every ordinary member of society comes into contact with them every day. Whether in the car, cell phone, MP3 player, home cinema, washing machine, or cooker – embedded systems are in daily use in almost all technical devices, using one or more microcontrollers, digital signal processors, and of course their respective software. And we, the "embedded community", have to make sure that everything works perfectly and without any fuss.

# TRACE32 – a premium product

Lauterbach was founded 29 years ago in the pioneering age of embedded systems. It has managed to turn the challenges of this market into technological and economic success by a mixture of the right know-how and enormous commitment. Just as we can no longer imagine items of daily use without embedded systems, a developer in the embedded systems industry cannot imagine work without TRACE32. Whether in Europe, the USA, or Asia, TRACE32 is recognized as the premier product and the world's leading brand in microprocessor development tools, which is borne out by some very impressive figures: In 2007 alone, over

7000 new workplaces worldwide were equipped with TRACE32!

TRACE32 stands for continuity, but mainly for progress and inventiveness. One or the other is always in the foreground, depending on the branch of industry. In this coming year, we intend to push ahead strongly in the development of our products and invest in solid growth so that you can continue to rely on Lauterbach as an innovative partner.

The past year, 2007, was of course not a year of standing still. In this newsletter for 2008, we wish to show you the technological progress we have made during the last year. We would like to

encourage you to try some of these new features for your development to improve your efficiency. We will be presenting many of these innovations at the ESC Silicon Valley, to which, as every year, we cordially invite you.

### **Contents**

| Hardware debugging in UML             | 3  |
|---------------------------------------|----|
| New supported processors              | 8  |
| Debugging with power-saving modes     | 9  |
| New debugging concept for SMP systems | 13 |
| New preprocessors/NEXUS adapters      | 17 |
| CombiProbe                            | 18 |



NEWS 2008 党

DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

# **TRACE32** for Virtual Prototypes

| Virtual P | rototypes                             |                                                                         |
|-----------|---------------------------------------|-------------------------------------------------------------------------|
| CoWare®   | CoWare<br>Virtual Platform            | ARM                                                                     |
| Synopsys® | Virtual Platform<br>(formerly Virtio) | ARM<br>XScale                                                           |
| VaST™     | VaST<br>Virtual System<br>Prototype   | ARM<br>OAK<br>PowerPC/eTPU                                              |
|           | i iototypo                            | SH2A<br>StarCore<br>TeakLite                                            |
|           |                                       | TriCore/PCP<br>V850                                                     |
|           | CoWare®<br>Synopsys®                  | Virtual PlatformSynopsys®Virtual Platform<br>(formerly Virtio)VaST™VaST |

Today, fast and efficient software development is often the decisive factor in the launching of new products. To avoid having to wait for the first hardware prototypes at the start of the development phase, *Virtual Prototypes* (software models of the hardware) are becoming increasingly common. As soon as a *Virtual Prototype* is available for the target hardware, work can start on debugging the drivers, the operating system, and the application.

So that we can offer our customers the professional development environment TRACE32 in this early phase of their project, Lauterbach has been supporting the debugging of software models since 2007. The debugging interface used here is the debugging API of the *Virtual Prototype*.

# TRACE32 as GDB front end

### **GDB** front end

| ARM     | Available |
|---------|-----------|
| 1386    | Available |
| MIPS32  | Available |
| PowerPC | Available |
| SH4     | Planned   |
| XScale  | Available |
|         |           |

Since early 2007, the TRACE32 GUI can also be used for process debugging with GDB. The supported communication interfaces to the target hardware are Ethernet or RS-232.

# **Debugging application processes**

To debug two or more application processes simultaneously, Lauterbach provides its own debugging agent – *t32server*, which is started first in the Linux terminal window. Subsequently via the TRACE32 GUI several GDB servers for testing application processes can be started. The *t32server* service is then responsible for data exchange between the TRACE32 GUI and the GDB servers (see Fig. 1).

# **Other innovations**

In the 2007 Newsletter, Lauterbach announced "Integrated run- & stop-mode debugging for embedded Linux applications". This year this is followed by a new concept for debugging SMP Linux (see page 13).



Fig. 1: The t32server service, enabling simultaneous debugging of two or more application processes



Ρο

Ρο

Ρο

Ρο

Po

Inte

**NEWS 2008** 

# Integration, testing, and debugging of C code together with UML-generated sources



Fig. 2: Adding tram traffic lights to a traffic intersection

# Introduction

It will soon no longer be possible to master the growing complexity of software without the use of CASE tools. UML, especially, has enabled the uniform design of modular software components, including automatic code generation, which prevents common coding errors right from the start.

However many companies shy away from the perceived effort required for the changeover to these new techniques. They have a code base that has been built up over decades and although the software has been constantly extended and improved. This has also led to the creation of more and more "spaghetti code". Millions of functioning lines of code would need to be analyzed, re-programmed, and finally re-tested.

In most cases these fears are unfounded. Existing C code can be transferred to a UML model element with almost no changes necessary. Future extensions can then be written as UML models, generated, and integrated with the "old" C code. As a result, an easy and progressive transition from C to UML is possible. The following example explains these procedures:

- 1. Transfer from "old" C code to UML.
- 2. Integration of a new functionality coded in UML.
- 3. Uniform testing and debugging in UML, C++ and C.

# Old C code

At the outset, we have a finished project in C code. The following example uses a traffic light system (which is a favorite of software designers due to its simplicity). There is a standard intersection with four sets of traffic lights. The traffic lights located opposite each other are controlled in parallel. The traffic lights at the right angle to them are, of course, controlled complementary. The different traffic light phases are: red – red and yellow – green – yellow – red. During the red phase of one set of traffic lights, right of way switches to the opposite direction. As a result, there is a short period when the lights facing in all directions are on red (this is important, as we shall see later).

This circuit is coded in C as a simple control (switchcase) in a continuous loop (while (1)). This continuous loop is found directly in the "main()" routine of the application, and is called immediately after initialization (see Fig. 3).

| int main (void)                         |
|-----------------------------------------|
| {                                       |
| <pre>horizontal = vertical = red;</pre> |
| <pre>state = closed_to_h;</pre>         |
|                                         |
| while (1)                               |
| {                                       |
| switch (state)                          |
| {                                       |
| case closed_to_h:                       |
| wait (1);                               |
| <pre>horizontal = yellowred;</pre>      |
| <pre>state = yellowred_h;</pre>         |
| break;                                  |
|                                         |
| case yellowred_h:                       |
| wait (3);                               |
| horizontal = green                      |
| <pre>state = green_h;</pre>             |
| break;                                  |
|                                         |

Fig. 3: Existing C code for traffic-light control; the "main()" routine includes a continuous loop that contains the logic as a switch-case construct.

### New requirements

The street planners have decided that a tramline should cross the intersection in both directions  $\rangle\rangle$ 



**PowerView** 

PowerDebug

PowerTrace

PowerProbe

# **NEWS 2008**\*

### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

#### **PowerView**

PowerDebug

**PowerTrace** 

**PowerProbe** 

Power-Integrator (vertically as well as horizontally, refer to Fig. 2 on page 3). Therefore, the trams must be stopped at the intersection by their own set of traffic lights. At the request of the tram driver, the intersection is blocked for cars at the next red light phase, and the tram is given right of way.

### UML wrapper for C code

We could, of course, just "patch" these requirements into our C code. But doing so would only make the function larger and more complicated, which is exactly what we want to avoid. So instead of this, we will take a modular approach to the problem and solve it with the help of UML.

| CarJunction                               |
|-------------------------------------------|
|                                           |
| rocess():void                             |
|                                           |
| 4: The "CarJunction                       |
| 4: The "CarJunction<br>containing the com |

plete "old" application

Firstly, we need to "transfer" the existing application over to our UML model, which is easier than it sounds: We create one class, called "Car-Junction", which contains the complete "old" application (see Fig. 4).

The only thing that we need to adapt are the corresponding interfaces. The usual continuous loop in

| <pre>void processJunction (void) {     horizontal = vertical = red;     state = closed_to_h;</pre> |
|----------------------------------------------------------------------------------------------------|
| do                                                                                                 |
| ł                                                                                                  |
| switch (state)                                                                                     |
| ť                                                                                                  |
| case closed_to_h:                                                                                  |
| wait (1);                                                                                          |
| horizontal = yellowred                                                                             |
| <pre>state = yellowred_h;</pre>                                                                    |
| break;                                                                                             |
| /**/                                                                                               |
| case yellow_v:                                                                                     |
| wait (3);                                                                                          |
| <pre>vertical = red;</pre>                                                                         |
| <pre>state = closed_to_h;</pre>                                                                    |
| break;                                                                                             |
|                                                                                                    |
| } // switch (state)                                                                                |
| <pre>} while ((state != closed_to_h)</pre>                                                         |
| <pre>&amp;&amp; (state != closed_to_v));</pre>                                                     |
| <pre>} // processJunction()</pre>                                                                  |

Fig. 5: Modified C code that can be integrated in the UML tool in this form

main() is already part of the framework and therefore has to be omitted. Instead, "main()" is renamed "processJunction()", and this becomes our new starting point. The end point of the function will be the phase when all traffic lights are on red. Now, if required, we can execute additional actions during this red phase (see Fig. 5). – And that's it.

### UML design for new requirements

Now we can take care of the design for the new requirements. We build a class called "Junction", which contains the old traffic light controls, called "CarJunction", as well as two new tram traffic lights (Fig. 6). The design is now finished, and we can focus on the behavior of the extended intersection, using a state transition diagram.



Fig. 6: The new class, "Junction", integrating the "old" traffic-light switching and the two new traffic lights for the tram

The old application is once again used as a starting point, and its whole behavior is incorporated into a single state of the new intersection (state: cars), see Fig. 7 on the opposite page. When the new model has this state, it works in exactly the same way as the old application. Only when the new feature – the tram driver requests right of way – is required, the old code is left and the new model starts. The phases and the transition of states for both tramlines are defined in the state diagram: request – wait – release – wait (possibly a switch between tramlines). This completes our model of the new requirements.  $\rangle\rangle$ 



### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

# **Code generation**

We can now generate the executable application. For good UML tools this is all that is required for generating the finished application for the abovementioned model. However, now is the time to note a few points about integration. Our initial application is written, and remains, in C. UML tools, however, normally generate C++ code, meaning that our "CarJunction" is a wrapper class that references the C code. Here, the usual adaptations between C and C++ must be taken into account.

The second

Typically, an RTOS is used to manage the model and must be integrated as required. Once all this has been done, a simple press of a button generates a ready-to-run application from the model.

### Target run

In the field of embedded technology, the target hardware is often different from the computers used for development. This means that we need to load the generated application onto the so-called "target" and run it there. This can be done using any external tool that can install code on the target.

This is especially easy if the UML tool and the debugger communicate over a common (software) interface, such as the integration between the UML tool Rhapsody from Telelogic and the TRACE32 debugger from Lauterbach. This allows you, at the press of a button in the modeling tool, to load the application to the target over the debug interface, and – if required – start it straight away. The complete cycle of design – modeling – generation – run is possible in one GUI.

# Testing in C

As was mentioned at the start, performing testing and debugging in a heterogeneous environment pose certain requirements. It must be ensured that the whole application can be tested in its respective function and implementation.

Although we are assuming that we already had a functioning application in C, debugging should still be possible. After all, the transfer of new features to the "old" code needs to be tested, and the old source code maintained. At least for this part of the application any commercial debugger can be used, as it is now standard in the field of embedded technology to debug C code.

# Testing in C++

The process becomes more interesting when we begin testing the new features at source level. Most UML tools generate C++ code with all the  $\rangle\rangle$ 

**PowerView** 

PowerDebug

PowerTrace

PowerProbe



Fig. 7: State diagram of new traffic-light switching



### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

#### **PowerView**

PowerDebug

PowerTrace

PowerProbe

Power-Integrator characteristics that belong to it, such as templates, polymorphism, and exception handling. It must be remembered that the debugger used must provide full support for the C++ dialect of the compiler. The TRACE32 debugger from Lauterbach supports all current C++ compilers, and so guarantees comfortable debugging of object-oriented C++ code.

### Testing in UML

But of course, we didn't write our code in C++, but rather in UML. And what would

make more sense than to carry out debugging on the modeling level? Rhapsody from Telelogic offers several options to serve this purpose.

If the behavior of the application is completely modeled in UML, the sequence can be simulated directly

Entire Model View sequence can be simulated directly Junction Components TRACE32 le Edit View Var Break Run CPU Misc Trace Perf Cov MPC8xx OSEDelta Window Help B::Data.List Tramway::check N Step Nover Next A Return C Up Go Bread line source 37 int Tramway::check() {
 //#[ operation check(
 return checkEvents();
 //#] Program Address ↓ Go Till //#1 Breakpoint... 41 } Breakpoints Display Memory void Tramway::release(int tram) { 43 Toggle Bookmark //#[ operation release(int)
open != 1 << tram;</pre> 45 set PC Here Edit 47 🛓 View Info B:: emulate trigger devices trace Data Var other previous UP:00020788 \\TheJunction\Tramw stopped at breakpoint

from the model. For simple behavior analysis, the actual target hardware is not required. As the target will have a different timing than the simulation, the application can also be run as an "animation" on the real target. The UML tool controls and visualizes the target process over a communication channel (serial or Ethernet). So it is possible to perform single steps through state charts or to induce events.

In combination with a TRACE32 debugger, this communication can also take place over the de-

bug interface if there is no other interface available. Direct debugging in the UML model is made possible via "animation".

# Integrated UML -> C++ -> C debugging

Now we have a situation spread over three levels: UML, C++, and C. When an error is found, especially in C++ code, it is better to repair it in the model, and not in the generated C++ code itself. On the



**Fig 8:** The integration of TRACE32 and Rhapsody allows a debugging in UML, C++ and C.

other hand, the debugging of the implementation of a model element is best carried out in the source, without having to search for it.

The integration of TRACE32 and Rhapsody offers extensive functionality in order to link these levels together (Fig. 8), such as the implementation of simple, interdependent navigation. A single mouse click on a model element in Rhapsody is enough to display the corresponding source code in TRACE32. This then allows the extensive analysis of variables, function calls, and so on. It also works the other way around; with a simple click on a line of source code in the debugger, the accompanying model element is highlighted in the UML tool. »



**NEWS 2008** 

This is especially helpful if a problem is found with the debugger, which needs to be repaired in the model. A lengthy search and navigation to the relevant element thus becomes unnecessary.

**XXX** 

Debugger breakpoints can also be defined in the model and the target can be set into Go or Stop status. This enables you to set a breakpoint at a class and start the target, all from within the UML tool. The debugger stops the application as soon as the model element under examination is invoked.

# Summary

A complete redesign is not always necessary. Particularly for projects where re-engineering is not possible due to time or organization constraints, a step-by-step approach is often appropriate. This

# Latest news in brief

# CFI flash programming

Automatic generation of the flash declaration via the *Common Flash Memory Interface* – CFI – has been supported since September 2007.

| B::FLASH.CFI                                     | • 💌   |
|--------------------------------------------------|-------|
| Flash declaration<br>Flash startaddress or range |       |
| 0xC0000000                                       | -     |
| Data bus width Flash unit number (opt)           |       |
| Target based flashing                            |       |
| Enable target based flashing                     |       |
| Code address or range                            |       |
|                                                  | -     |
| Data address or range                            |       |
|                                                  | -     |
| Data buffer size                                 |       |
| To Clipboard Ok C                                | ancel |

To program a flash, you always had to have detailed knowledge about the exact type and organization of the device. Thanks to the special query mode of CFI-conformant flash devices, TRACE32 can now identify the parameters needed for programming and use this data to create a flash declaration automatically. limits the effort required and allows to import the existing software step by step into a modern and innovative CASE tool.

It is, of course, essential that the tools used in this process help and don't hinder; the selection of the correct tools is an important factor. They should not just be able to perform these tasks, but should also encourage them.

Lauterbach and Telelogic have achieved just this with the integration of their tools TRACE32 and Rhapsody. Both companies want to help their customers keep software quality at high levels, despite increasing complexity and ever more requirements. Now nothing stands in the way of modernizing your existing code bases with UML.

> Power-Integrator

**PowerView** 

**PowerDebug** 

**PowerTrace** 

**PowerProbe** 

# Serial flash / NAND flash

Since September 2007, TRACE32 has supported the programming of NAND flash devices as well as serial flash devices with a separate command (*FLASHFILE*).

# **Code overlays**

Since December 2007, TRACE32 has supported the symbol management for code overlays in ELF files with the command *sYmbol.OVERLAY*. The ARC, ARM, and PowerPC architectures are currently supported. Further architectures are planned for 2008.

# **PCP** debugger

Since November 2007, a separate TRACE32 instance has to be started to debug the Peripheral Control Processor – PCP – of the Infineon TriCore processor. This innovation was needed to enable the loading and testing of C code for the PCP. Of course this instance is also used to configure the PCP trace and analyze the trace information. The license for the PCP debugging and tracing is included in the scope of delivery of the TRACE32 debugger for TriCore.



# NEWS 2008 🐧

DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

# New supported processors

#### **PowerView**

Power-

Integrator

# New architectures

|            | P.A. Semi   | PWRficient <sup>™</sup> | Q2/2008   |
|------------|-------------|-------------------------|-----------|
| PowerDebug | Renesas     | H8SX                    | Available |
| PowerTrace | Tensilica   | Xtensa<br>Processors    | Available |
|            | Texas       | TMS320™C28xx            | Available |
| PowerProbe | Instruments |                         |           |
| FUWEIFIUDE |             |                         |           |

### **PWRficient**

The PA6T-1682M dual-core processor will be the first 64-bit PowerPC of the PWRficient architecture supported by a Lauterbach debugger.

### TMS320C28xx

With the debugger for the TMS320C28xx DSPs, Lauterbach is providing even more support for the TMS320 Processor Platform from Texas Instruments.

# **New derivatives**

| AMCC                   | <b>PPC405</b><br>- PPC405EX/EXR                                                         |
|------------------------|-----------------------------------------------------------------------------------------|
|                        | <b>PPC44x</b><br>- PPC460EX                                                             |
| ARC                    | ARC<br>- ARC <sup>®</sup> 700 Core<br>- ARCtangent-A4<br>- ARCtangent-A5                |
| Freescale <sup>™</sup> | ColdFire<br>- MCF52110/MCF52100<br>- MCF5221x<br>- MCF523x/MCF532x<br>- MCF5372/MCF5373 |
|                        | <b>MPC5500</b><br>- MPC551x<br>- MPC56xx                                                |
|                        | MPC5200<br>- MPC512x with<br>AXE audio processor                                        |
|                        | PowerQUICC II<br>- MPC8377<br>PowerQUICC III<br>- MPC8572                               |

| Freescale™<br>(cont.)   | ARM11/StarCore<br>- MXC91321                                         |
|-------------------------|----------------------------------------------------------------------|
| Infineon                | <b>C166S</b><br>- XC22xx/XC23xx/XC27xx<br>- XE166                    |
|                         | <b>TriCore</b><br>- TC1736<br>- TC1767/TC1767ED<br>- TC1797/TC1797ED |
| Microchip               | <b>MIPS32</b><br>- PIC32™                                            |
| NXP                     | <b>ARM9</b><br>- LPC3000<br><b>MIPS32</b><br>- PNX83xx/PNX85xx       |
| STMicro-<br>electronics | MPC5500<br>- SPC563Mxx<br>- SPC560Bxx/Pxx/Sxx<br>Cortex-M<br>- STM32 |
| Texas<br>Instruments    | <b>Cortex-A/DSP</b><br>- OMAP3430                                    |



DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

# Debugging with power-saving modes for ARM cores

No.

In response to constantly rising demands for economical use of energy and long standby times, modern cores often have various different energy-saving mechanisms. These allow the software, whenever possible, either to reduce the core frequency or to switch off the core completely. However, these so-called energy-saving modes can adversely affect communication between the debugger and the core. In the following, with the ARM architecture as an example, we show variants for problem-free debugging of programs using energy-saving modes.

### Ignoring energy-saving modes

Of course, the first and easiest way is to tell the core to ignore energy-saving modes during debugging. For example, ARM11 has a so-called *Powerdown Disable Bit*. When this bit is set, the core activates a line that informs the external power and clock management about the debugging. If then the program tries to activate an energysaving mode, the power and clock management can simply ignore it to avoid problems during debugging. Setting the *Powerdown Disable Bit* is one of the basic settings of the TRACE32 debugger for ARM11, Cortex-A, and Cortex-M.



Fig. 9: The PWRDWN option, defining whether energy-saving modes are ignored or not during debugging If developers want to test the behavior of the design with regard to energy-saving modes, they can of course reset the *Powerdown Disable Bit* with a debugger option (*SYStem.Option PWRDWN ON*, see Fig. 9).

However, there are many ARMbased chips that do not have a *Powerdown Disable Bit*. In this case, a special debug version of the software without instructions for activating energy-saving modes could be created. However, the disadvantage of this is that the debug version of the software would be different from the release version. But why can debugging and use of energy-saving modes lead to problems? In the first part of this article, we look at single-core chips: How does switching off (core) power or clock affect debugging? Next, we look at multi-core chips. From the large number of possible challenges, we concentrate on the effect that switching off the clock of a single core has on the debugging of the entire chip.

# Switching off power

At the VTREF pin (target voltage reference pin) of the JTAG connector, the debugger can detect a powerdown. If the user configures the debugger to use this energy-saving method, the debugger can wait until power-up instead of issuing an error message (StandBy mode).



Fig. 10: Instead of issuing an error message, the debugger waits for power-up following a powerdown.

In practice, the debugger can be in one of the two following states:

- Up (StandBy): the processor has power and there is communication between the debugger and the processor.
- **StandBy:** power is off and the debugger keeps the ARM processor in reset over the SRST line. The debugger waits for power to be switched on again (see Fig. 10).

When power is switched on again, the debugger releases the reset line and the processor runs as it would after a power-up reset.

However, the power-up reset of the ARM processor resets the debug logic, the ETM/ETB logic, and the assignment for the ETM pins. The settings that were active before powerdown are lost.  $\rangle\rangle$ 



PowerDebug

PowerTrace

PowerProbe



### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

PowerView

PowerDebug

**PowerTrace** 

**PowerProbe** 

Power-

Integrator

Seamless continuation of debugging is possible only if the debugger manages to restore these settings before the ARM processor starts executing the program.

For the debugger to be able to carry out the necessary restore actions at power-up, the target hardware must meet some requirements.

### The best case

The best case is when the following requirements are met:

- 1. The debugger can keep the ARM processor in reset over the SRST line.
  - 2. The processor reset (SRST) and the reset for the debug logic (TRST) are separate lines.
- 3. Via the VTREF pin of the JTAG interface, the debugger can quickly detect that power has been switched off.

If these requirements are met, the debugger only releases the reset for the debug logic when it detects power-up. The settings of the debug logic and the ETM/ETB configuration are now restored. Next, the processor reset is released and debugging can be continued seamlessly, albeit somewhat delayed (see Fig. 11).



Fig. 11: Following power-restore, first the debug logic and the ETM/ETB configuration are restored (green arrow). Only then does the ARM processor start to execute the program.

Since the I/O pins are configured via memory mapped registers, they cannot be restored for the ETM trace port before the processor reset is released and the program starts, so the program itself has to activate the ETM port as fast as possible.

# **Other cases**

If requirements 1 and/or 2 are not met, the ARM processor starts up at power-up (see Fig. 12), while the debugger simultaneously restores the debug



(\* = reset)

Fig. 12: Undesirable side effects resulting from a processor restart while the debugger is restoring the debug logic and the ETM/ETB configuration (red arrow)

logic and the ETM/ETB configuration. This can have undesirable side effects. At the same time, of course, the on-chip breakpoints and the trace broadcasting are inactive in the restore phase.

# Switching off the clock

Switching off the processor clock is a completely different challenge for the debugger. We are dealing here with the communication between the debugger and the processor, or to be more precise, the communication between the debugger and the on-chip debug logic. This communication is almost always running, even when the ARM processor is processing the program.

As long as the debug logic is independent from the processor clock the communication works, even if the processor clock is switched off. For clocking the (as asynchronously standardized) debug logic, however, often a delayed and divided processor clock is used. You know this is the case if you have to use the command *SYStem.JtagClock RTCK* when you configure the debugger.

Switching off the processor clock can affect communication between the debugger and the on-chip debug logic if two requirements are met:

- 1. The debug logic clock depends on the processor clock.
- 2. The processor clock is switched off during communication between the debugger and the on-chip debug logic.

Now, when the processor clock is switched off, the following happens:

- 1. The ARM processor freezes its current state.
- 2. The debug logic also freezes its current state since its clock depends on the processor clock. >>>



### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

3. The debugger breaks off communication with the on-chip debug logic after some timeout since the on-chip debug logic no longer responds.

NON CON

When the processor clock is switched on again, the debug logic resumes communication with the debugger at exactly the place where it was frozen, while the debugger has long since re-initialized communication since it received no response from the debug logic.

Since the debug logic of the ARM processor is controlled by the debugger, the debugger would have to reset the debug logic in order to synchronize communication again. However, at least for a brief period during the program run, all on-chip breakpoints would be deleted. This is far from ideal.

The following solutions have proved reliable for customers:

#### 1. Checking the processor clock

We recommend configuring the debugger so that it checks whether the processor clock is running before starting communication with the on-chip debug logic. For this the debugger puts the debug logic in Run Test/Idle mode at the end of each communication. In this mode, the processor clock can be easily tested before resumed communication.

#### 2. Reducing status queries

While the ARM core is processing the program, the debugger sends status queries to the debug logic 10 times a second. This way it checks whether the program is still running or whether it has been stopped. A next step is to reduce the frequency of these status queries. We recommend a query every 2-3 seconds (*SETUP.URATE 3.s*). This solution is quite simple: fewer queries reduce the risk of communication problems.

#### 3. Increasing timeout

If the program switches off the processor clock very often but always only very briefly, the debugger can be configured so that it simply waits longer before breaking off communication with the debug logic (*SYStem.POLLING SLOW*). In certain circumstances, it might not even notice that the processor clock was switched off.

### **Multi-core chips**

Multi-core chips, which use only one JTAG interface for debugging daisy-chained cores, require additional measures if energy-saving modes are used for individual cores. Consider the following situation as an example: the clocking of the debug logic depends on the core clock (RTCK) for core 2 and now the clock is switched off for only this core (Fig. 13).



PowerDebug PowerTrace PowerProbe Power-

Integrator

**PowerView** 

Fig. 13: Switching off the clock for core 2 blocks debugging for the whole chip.

As described in the section "Switching off the clock", the debug logic of core 2 is frozen in this case. Since the chaining of the cores means that debug commands for core 1 / core 3 have to run through core 2, the debugging for the entire chip is now blocked.

To be able to continue debugging without complications when the clock for core 2 is switched on again, we recommend configuring the debugger for the individual cores so that a check is made on whether the core clock is running, before communication with the debug logic starts.

However, this is no solution to the problem that debugging is blocked for the entire chip as soon as the debug logic of a single core is frozen. Some good solutions exist for this problem today, but the chip manufacturers consider their solutions to be confidential, so we cannot present them here.

We will therefore restrict ourselves in the following to explaining how the CoreSight DAP technology solves this problem.

In the case of multi-core chips whose debug and trace functionality is based on the CoreSight technology, the TRACE32 debugger no longer communicates directly with the debug logic of the  $\rangle$ 



### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

**PowerView** 

#### PowerDebug

**PowerTrace** 

PowerProbe

Power-Integrator individual cores but with a so-called DAP (Debug Access Port) instead. The DAP's task is to distribute the debug commands to the individual cores.

For debugging with energy-saving modes, the most important innovation of the DAP technology is that the DAP has its own power and clock domain. This means that the DAP is unaffected by the energysaving modes of the individual cores. At the same time, the cores are no longer daisy-chained under CoreSight. Instead,

- Their debug logic is controlled by bus accesses if it is implemented via memory mapped debug registers.
- Their debug logic is controlled over a JTAG access port if the core works with the traditional debug logic.

These new control mechanisms guarantee that the DAP can always address the individual cores directly and that no further debug commands are blocked by any frozen cores (see Fig. 14).

It can be generally said for multi-core chips that if the debugging of energy-saving modes is particularly important in the development phase, you should discuss this subject with the semiconductor manufacturer when choosing your chips. The experts of the Lauterbach team will be happy to advise you on this subject.

# Analyzing energy consumption

For all ARM-based chips with ETM and an off-chip trace port, Lauterbach offers the option of analyzing whether the energy-saving modes really result in a maximum reduction of energy consumption for the design. To answer this question, a measuring arrangement is required that records the currents

| 🔑 Setup 🚻 Groups          |                     |                  | List all    | TREE         | 🖽 Ch | ort 🛛 🞯 | Init |
|---------------------------|---------------------|------------------|-------------|--------------|------|---------|------|
|                           | samples: 9<br>total | 123165. 1<br>avr | otal:<br>1% | 8.323J<br>2% | 5%   | 18%     | 20   |
| (other)                   | 0.000               | 0.000            |             |              |      |         |      |
| \User\Global\ <b>JUMP</b> | 21.062uJ            |                  |             |              |      |         |      |
| \User\Global\LED1         | 482.740nJ           | 3.473n           |             |              |      |         |      |
| AUser\Global\LED2         | 500.216nJ           | 3.599n           |             |              |      |         |      |
| Muser/Global/LED3         | 710.987nJ           | 5.115n           |             |              |      |         |      |
| \User\Global\LED4         | 917.761nJ           | 6.603n.          |             |              |      |         |      |
| \User\Global\LEDS         | 1.129J              | 8.119n.          |             |              |      |         |      |
| \User\Global\LED6         | 1.333J              | 9.588n.          |             |              |      |         |      |
| \User\Global\LED7         | 1.525J              | 11.051n.         |             |              |      |         | - 1  |
| \User\Global\LED8         | 1.722J              | 12.476n          |             |              |      |         | _    |

Fig. 15: Graphic showing the connection between the program and the current/power consumption of the hardware



**Fig. 14:** The DAP can always address the debug logic of the individual cores directly. This means it is no longer possible for the switching off of a clock for an individual core to block debugging of the entire chip.

and voltages at selected measurement points and correlates them with the program flow information output at the trace port.

Lauterbach offers such a measuring arrangement since 2007. For this you need additionally to the TRACE32 debugger:

- A real-time trace for the ETM
- An Analog Probe
- A Lauterbach logic analyzer; either the *IProbe Logic Analyzer* in PowerTrace II or the *TRACE32*-*PowerIntegrator*

Since in this measuring arrangement both the currents and voltages as well as the program flow are synchronously timestamped by the TRACE32 software, the links between the control software flow and the current/power consumption can be easily displayed and analyzed. Fig. 15 shows as an example the absolute and percentage share of total energy consumption calculated for each function executed during the measurement time. In this way, the important energy-consumption and standbytime data can be verified.



DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

# New debugging concept for symmetric multiprocessing (SMP)

Lauterbach, the leading manufacturer worldwide of high-quality debuggers and real-time trace tools, is presenting its new debugging concept for SMP systems at the ESC 2008. SMP systems use an operating system to distribute processes dynamically to several cores or hardware threads. SMP Linux on a MIPS 34K will be demonstrated live at the fair.

W. Sak

# Hardware parallelization

To achieve higher processor performance for complex applications and to save energy, more and more parallelization of tasks is used. The most popular method is to provide several identical execution units that can all process the same task.

To most readers, the term "identical execution units" means symmetric multi-core processors. It is easy to imagine that the kernel of an operating system is running permanently on a core and ensures that the application processes are evenly distributed to all cores (see Fig. 16).

However, for the parallelization of tasks not necessarily a multi-core processor is required. Hardware multithreading, for example, is an approach that enables parallelization also for single-core processors. Here, we deal with a basic problem of



Fig. 16: SMP Linux on a dual-core processor

proces

cores with pipeline architecture: cache misses or data dependencies between the instructions mean that the pipelined instruction processing has to be stalled in order to wait for the availability of required data. The greater the difference between instruction processing time and memory access time, the higher the performance loss.

Hardware multithreading deals with this situation by making the core process several mutually independent tasks quasi-simultaneously. In the case of an operating system, "tasks" and "processes" are equivalent.

In principle, this works as follows: As soon as a task can no longer be further processed because

required data is not yet available, the processing of another task is continued.

Fig. 17 shows the processing of 3 tasks on 3 cores with simple pipeline architecture (at the top) and then (at the bottom) shows the execution on a multithreaded core.

In order for hardware multithreading to work, fast switching between tasks must be possible. This can be done simply by giving each task a separate register set for its context. With this method, it is easy to deduce that the number of register sets provided by the core defines how many tasks can be processed in parallel. For the MIPS 34K the number is five. In this way, one »



Power-Integrator

**PowerView** 

**PowerDebug** 

Processing on 3 cores with simple pipeline architecture



load load process update load process update loa

Fig. 17: No more stalls in the pipeline, thanks to the quasi-simultaneous processing of several tasks



# **NEWS 2008**\*

### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER



PowerProbe

Power-Integrator Fig. 18: SMP Linux on a multithreaded core - here MIPS 34K

"execution unit" (a so-called hardware thread) derives from each register set (see Fig. 18).

# SMP operating systems

At the hardware level, the parallelization of processes is implemented by symmetric multi-core processors or multithreaded cores. Now software is needed to organize the parallelization. An operating system is normally used for this purpose.

One variant, which is primarily suitable for hardware with identical execution units, is operating systems that implement symmetric multiprocessing (SMP). The central feature of SMP operating systems is the dynamic distribution of task/processes to the available cores or hardware threads at program runtime.

Other features:

- One instance of the operating system operates all cores or hardware threads.
- Each application process can run on each core or hardware thread. As a rule, only the kernel is rigidly assigned to a core or hardware thread.
- All cores or hardware threads have equal rights to request and use resources (e.g.: memory, external interfaces, external devices).
- The operating system provides the functions for distributing resources to the cores or hardware threads.

# New debug concept

At present, the TRACE32 concept enables just one core to be debugged with one instance of the debugger (*Core View*). Up to now, this concept has

also worked perfectly for multi-core processors since these processors were also operated as asymmetric multiprocessing systems (AMP). Asymmetric multiprocessing means: An independent instance of an operating system runs on each core. For AMP systems, this always statically defines which process is running on which core. In this way, the debug information can also be uniquely assigned to the corresponding core.

In contrast, on SMP systems, the processes are not assigned to the cores or hardware threads until runtime. For this reason, it no longer makes sense to start a debugging instance specifically for the debugging of a selected core or hardware thread.

#### System View

As an alternative to the *Core View*, which works very well for AMP systems, there is now the *System View* for SMP systems.

With the *System View*, only one instance of the TRACE32 debugger is started to debug all cores or hardware threads (see Fig. 19). Here too, root of information display is one core or hardware thread. The new aspect here is that the information  $\rangle$ 



Fig. 19: Only one instance of the TRACE32 debugger is used for debugging all cores or hardware threads.



**NEWS 2008** 

display can be switched by a command to another core or hardware thread.

| nagic    | Connand   | state    | uid | pid | spaceid | ttu | 1          | cpu   |  |
|----------|-----------|----------|-----|-----|---------|-----|------------|-------|--|
| 80335998 |           | sleeping | 8.  | 26. | 8888    | 8   |            |       |  |
| 80335578 |           | sleeping | 0.  | 27. | 0000    | 8   | 88<br>9869 | Ø.    |  |
| 80335158 |           | sleeping | 0.  | 28. | 0000    | 8   | 8888       | · · · |  |
| 80334038 | aio/1     | sleeping | 0.  | 29. | 0000    | 8   | 0000804    | 1     |  |
| 80334918 | aio/2     | sleeping | 0.  | 30. | 0000    | 8   | 00008040   | 1 2.  |  |
| 803344F8 | aio/3     | sleeping | 0.  | 31. | 0000    | 8   | 8888884    |       |  |
| 80334008 | aio/4     | sleeping | 0.  | 32. | 0000    | 0   | 00000010   | 1 1.  |  |
| 800D99F8 | sh        | sleeping | 0.  | 43. | 002B    | 0   | 00402000   | 3 4.  |  |
| 8002DA18 | sieve     | running  | 0.  | 44. | 882C    | 0   | 8848288    | 3 1.  |  |
| 83861538 | helloloop | sleeping | 0.  | 47. | 002F    | 0   | 00402000   | 3 0.  |  |

Fig. 20: The TASK.DTASK window, showing the core or hardware thread to which SMP Linux has assigned a task/process

Process debugging can now be done as follows:

- The current list of all active task/processes is used to identify the core or hardware thread on which a task/process is running (Fig. 20). Linux uses here the term CPU instead of using core or hardware thread.
- 2. A command is used to switch the information displayed in the debugger to the required core or hardware thread (Fig. 21).

#### **Common breakpoints**

The fact that an SMP operating system does not assign application processes to a core or hardware thread until runtime also has consequences for the setting of on-chip breakpoints.

A simple example: The "sieve" process is to be stopped as soon as it writes to the variable "xyz". In order for the debugger to be able to implement this request, it must program the break logic of all cores or hardware threads for this break condition since it cannot know beforehand the core or hardware thread on which the "sieve" application process will run. This means that even if each core or hardware thread has its own break logic, the debugger programs the breakpoints for the user as if there were only one break logic shared by all.

When setting of software breakpoints, for the implementation of which the original instruction in memory is temporarily overwritten by a break instruction, there are no changes compared to AMP systems. Operating systems protect the address spaces of processes from each other. However, if the same application process is started more than once, Linux (for example) loads the program code only once. Each instance of the application process thus sees the same program memory as well as the software breakpoints set there. To make sure that the program execution is stopped only in the desired application process, the TRACE32 debugger enables the setting of process-specific software breakpoints.

### Summary

The systematic extension of the TRACE32 concept enables Lauterbach to offer its customers simple debugging of embedded designs that use an SMP operating system for controlling several cores or hardware threads. To be able to use this new concept for debugging SMP operating systems, you just need a TRACE32 debugger whose debug cable has not only a license for the processor architecture but also a second license such as a multi-core license. PowerDebug PowerTrace

**PowerView** 

PowerProbe

Power-Integrator

Following the support for the MIPS 34K, the debugging of SMP systems for ARM and PowerPC architectures is also planned for early 2008.



Fig. 21: The TRACE32 window, which always shows only the context of a core or hardware thread. The number of the core or hardware thread is shown in the status line.



DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

# **Further RTOS innovations**

# Windows CE 6.0 for ARM and SH4

#### TRACE32 now also supports Windows Embedded CE 6.0 for ARM and SH4 architecture.

Since the end of 2006, developers have been able to use the new version of Windows CE for their projects. Version 6.0 introduced some fundamental changes in the kernel architecture. For example, Windows CE now supports process address spaces of 2 GB. For Version 5.0, these were limited to 32 MB.

Since late 2007, an updated version of *TRACE32 Windows CE Awareness* has been available, so that a TRACE32 debugger can now be used to monitor and test all processes, threads and libraries simultaneously. In combination with a real-time trace tool, a runtime analysis of Windows CE threads is also possible. The configuration of *Windows CE Awareness* has been simplified:

- The debugger now supports automatic detection of the kernel objects. This means that the symbol information for the kernel no longer has to be loaded explicitly.
- The Windows CE Autoloader of the debugger simplifies symbol management for the different processes and libraries. If the autoloader is configured accordingly (sYmbol.AutoLOAD.CHECK-WINCE), the matching symbol information is automatically loaded to the debugger when needed.

# Integration in the Windows CE Platform Builder

Since the beginning of 2007, Lauterbach debuggers can be used as hardware debug back end for the Windows CE Platform Builder.

To debug the kernel and application processes in Windows CE, Lauterbach offers an eXDl2 driver that permits control of a Lauterbach debugger through the internal debugger of the *Windows CE Platform Builder*.

By means of target device connectivity, the TRACE32 eXDI2 driver is first integrated into the platform builder (PB). As a result of this integration, all debug commands of the PB debugger are now translated into TRACE32 commands in the background and passed to the Lauterbach debugger.

The eXDI2 driver is available for Windows CE 5.0 and 6.0 as well as for Windows Mobile Versions 5 and 6.

# **Extensions**

- Linux Support for Xenomai Threads
- Linux Symbol Autoloader
- OSE Support for 5.2 Load Modules
- PXROS Redesign for C167

| New suppo | orted RTOS |
|-----------|------------|
|-----------|------------|

| DSP/BIOS for TMS320C28xx           | Available |
|------------------------------------|-----------|
| eCOS for MIPS                      | Available |
| FreeRTOS for ARM<br>and MicroBlaze | Available |
| HI7000 for SH4                     | Available |
| Linux for ARC and MicroBlaze       | Planned   |
| LynxOS-SE for PowerPC              | Planned   |
| NetBSD for ARM and PowerPC         | Available |
| OS/9 for ARM and PowerPC           | Available |

| RX4000 for MIPS                | Available |
|--------------------------------|-----------|
| ThreadX for ARC and Blackfin   | Available |
| ThreadX for Xtensa             | Planned   |
| Windows CE 6.0 for ARM and SH4 | Available |
| µClinux for Blackfin           | Planned   |
| µClinux for MicroBlaze         | Available |
| μC/OS-II for V850              | Available |
| µC/OS-II for Xtensa            | Planned   |
|                                |           |



# Integrator also Awa

**PowerView** 

**PowerDebug** 

**PowerTrace** 

**PowerProbe** 

Power-

**NEWS 2008** 

# New preprocessors/NEXUS adapters



NON CON

Many SoCs (system on-chip designs) now have a trace port that makes runtime information visible off-chip. Such information can be the instructions executed by the program as well as data transfers. If an operating system is used, output of the current task ID can be helpful. The best-known trace ports are certainly ETM for the ARM architectures and NEXUS for the PowerPC architecture.

To record runtime information Lauterbach offers TRACE32-PowerTrace, a real-time trace tool. At the hardware level an architecture-specific preprocessor or NEXUS adapter receives the data from the trace port and transfers it to the trace memory (up to 4 GB) of the PowerTrace hardware. This recorded information serves the user as the basis for efficient troubleshooting as well as for comprehensive runtime analyses of his design.

Lauterbach is now offering preprocessors and NEXUS adapters for more than 30 processor architectures. Many more will be supported in 2008.

# New NEXUS adapter for MPC5500 architecture

From March 2008, the NEXUS adapter for the MPC5500 architecture will be available in an improved version using an advanced technology. To respond flexibly and quickly to new variants of the NEXUS port, this adapter supports a port width of up to 16-bit MDO and a voltage range of 1.0V to 5.0V.

Lauterbach offers converters to adapt to the specific processors' NEXUS port.

To guarantee optimum sampling of trace signals at high trace port data rates, the new NEXUS adapter also contains FPGAs for the automatic setting of the optimum sampling instant (*AUTOFOCUS* technology).

# NEXUS adapter MPC5500 AF

- NEXUS port width up to 16-bit MDO, 2-bit MSEO, EVTI, EVTO
- Data rate up to 200 MBit/s
- 200 MHz at SDR or 100 MHz at DDR
- Voltage range 1.0V to 5.0V
- Adapter for different variants of the NEXUS port
- AUTOFOCUS technology

# **New AUTOFOCUS II preprocessors**

To enable preprocessors to handle trace port frequencies of over 500 MHz, the *AUTOFOCUS II* technology was developed in 2006. The most important feature of this technology is the automatic setting of the optimum sampling instant for trace data. Based on this technology, further preprocessors are now offered for a series of DSPs.

# New AUTOFOCUS II preprocessors

| Preprocessor for Ceva-X                       | Q1/2008 |
|-----------------------------------------------|---------|
| Preprocessor for StarCore                     | Q3/2008 |
| Preprocessor for TMS320C55x<br>and TMS320C64x | Q1/2008 |

# **MicroBlaze**

Since November 2007, a preprocessor for the MicroBlaze architecture has also been available. Using a 22-pin wide trace port, the *MicroBlaze Trace Core* makes program flow as well as (optionally) the data flow visible off-chip. Trace port frequencies of over 100 MHz can now be handled by the preprocessor.



PowerDebug

PowerTrace

PowerProbe



DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

# **CombiProbe**

**PowerView** 

PowerDebug

PowerTrace

**PowerProbe** 

Power-Integrator For fast output of important system information at runtime, some cores now have simple trace ports. These ports enable an application program to make user-defined data visible offchip. For recording and processing this data, Lauterbach offers a new tool option called CombiProbe. Here we describe the hardware and operation of this new product.

### Instrumentation

Most developers are familiar with test scenarios for which a simple printf() is the most efficient implementation. Typical examples are output of diagnostic information or logging of important system events, i.e., test scenarios where the application is already working as intended, and now operational tests have to be conducted. Usually, code instrumentation is used for these tests.

Two steps are necessary to make important system information visible with instrumentation at program runtime:

- 1. Instructions are inserted in the application program to provide the necessary information.
- A way has to be found to make this information visible. An external communication interface such as RS-232 or Ethernet is usually used for this.

The great advantage of instrumentation is that only the information of interest to the tester is made visible. At the same time, however, large communication overhead is created since, as a rule, a further communication interface has to be operated by the operating system. If the interface is slow, considerable runtime problems can result for the actual application.

# System Trace

The communication overhead created during instrumentation can be largely reduced if the core offers a separate trace port for the output of system information generated by the application program. As an IP provider, ARM offers such a solution as part of its CoreSight technology.

The CoreSight Instrumentation Trace Macrocell (ITM) basically works as follows:

1. The application program writes the information that has to be made visible to a 32-bit memory

mapped register assigned to the ITM.

2. The ITM makes the information visible either directly over the Serial Wire Output (see Fig. 22) or together with other trace data over the CoreSight Trace Port Interface Unit (TPIU).

Many mobile phone manufacturers are already working with similar proprietary solutions for their

chips. However, to save money, they are interested in a uniform standard for such a trace port. In May 2007 the "Test and Debug" workgroup of the MIPI Alliance specified, under the name "System Trace", a standard for a 4-bit trace port as well as a 34-pin debug and trace connector.

Along with the standard- Hardware data

ization, the functionality of the trace port was extended. Apart from the system information generated by the application program, the System Trace can now output hardware information too (see Fig. 23). A programmable bus watcher, e.g., can detect and display specific bus cycles, or a signal monitor can return the status of selected chipinternal signals.



Software data

32-bit register

Instrumentation

Trace

**Macrocell** 

Serial Wire Output

(asynchronous)

Fig. 22: CoreSight Instru-

mentation Trace over Serial

Wire Output

Fig. 23: System Trace specified by MIPI Alliance

# CombiProbe hardware

As a "Test and Debug" workgroup member, Lauterbach took an active part in the *System Trace* specification and at the same time in the development of a suitable debug and trace option named "CombiProbe". Since October 2007, Lauterbach has been marketing this new product.

CombiProbe is a combination of a special debug cable and 128-MB of trace memory. It can, for example, be plugged into the universal Lauterbach  $\rangle\rangle$ 



**NEWS 2008** 

base hardware such as the POWER DEBUG INTERFACE / USB 2 (see Fig. 24). As target system connection, the 34-pin debug and trace plug from the MIPI Standard is used, comprising connectors for a JTAG port and a 4-bit System Trace Port. Lauterbach offers also adapters for other target system connectors.



Fig. 24: POWER DEBUG INTERFACE / USB 2 with a Combi-Probe

# **Operating concept**

Additionally to the traditional debug functions CombiProbe provides new configuration commands as well as commands for displaying and analyzing recorded trace information.

Configuration commands are:

- ITM.<subcommand> for configuring the Instrumentation Trace Macrocell of CoreSight technology (see Fig. 25)
- STM.<subcommand> for configuring a System Trace Module
- SystemTrace.<subcommand> for configuring . the CombiProbe

The SystemTrace command is also used to analyze and process recorded trace data.

System information visible at the trace port can be recorded as usual into the trace memory of the CombiProbe and analyzed using TRACE32 commands. An intuitive analysis of system information is supported by formatting the raw trace data according to application with the help of the so-called TRACE32 Protocol API.

However, the size of the trace memory in CombiProbe limits the recording time.

To implement longer recording intervals, we offer a PIPE mode for the CombiProbe. In PIPE mode, the trace data is transferred directly to the host. In this mode the trace memory in the CombiProbe is used only as a buffer.

On the host, TRACE32 can be configured to write the trace data immediately to a file. After recording, the system information can also be formatted and analyzed by an external application.



Fig. 25: ITM commands are used to configure the Instrumentation Trace Macrocell.

The System Trace commands enable recorded trace information to **PowerView PowerDebug PowerTrace PowerProbe** Power-

Integrator

be displayed and analyzed.

configured to collect the trace data over the TRACE32 *FDXAPI* and process it while recording (see Fig. 26).  $\rangle\rangle$ 

As an alternative, external analysis software can be



Fig. 26: CombiProbe can be configured so that external software can process trace information at recording time.



### DEBUGGER, REAL-TIME TRACE, LOGIC ANALYZER

#### **PowerView**

PowerDebug

PowerTrace

PowerProbe

Power-Integrator

# Summary

The CombiProbe is a new tool option for recording runtime information in the broad Lauterbach product range. For complex test cases, CombiProbe can also be used in conjunction with TRACE32-PowerTrace. Since all Lauterbach tools share a common time base, important system information can be placed in a direct chronological context with the program/data flow of the core.

### CombiProbe data

- Debug cable and 128-MB of trace memory
- JTAG support for ARM cores
  - Standard JTAG
  - Serial Wire Debug Port from ARM
  - cJTAG (IEEE P1149.7), planned
- Standard JTAG possible for other processor architectures
- Trace port support for
  - MIPI System Trace
  - ITM over Serial Wire Output
  - ITM over 4-bit CoreSight TPIU
  - 4-Bit ETMv3 in continuous mode, planned
- Bandwidth of 200 MBit/s per trace channel with up to 4 trace channels
- Voltage range 0.3V to 3.3V, 5V tolerant
- 34-pin half-size connector to target hardware
- Adapters also for 10- and 20-pin half-size connector

# 3G/DigRF report

From the summer of 2007, Lauterbach has offered a new set of probes for PowerIntegrator, which allow the recording of data traffic from a 3G/DigRF interface.

The following sampling rates are used for recording:

- 2 GigaSample/s for the RX/TX lines
- 250 MegaSample/s for the SysClkEnable line

A recording duration between 25.156 ms at full load and up to several hours is possible, depending on the data traffic on the lines. The high sensitivity of the probes and the very low load of the input signals guarantee error-free recording.

The measured data can be evaluated and analyzed at protocol level as well as time-correlated with the program flow.



# Please inform us:

If you would like us to remove your name from our mailing list or receive additional information, send an e-mail to:

# info\_us@lauterbach.com

