Dr. Dobb's is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them. Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.


Channels ▼
RSS

Parallel

Microarchitecture Performance


Micro-op Fusion: An Additional Energy-saving Technique

Typical mainstream processors break down x86 program instructions (macro-ops) into small pieces--the internal instructions called "micro-ops"--before sending the instructions down the processor pipeline to be processed.

Micro-op fusion takes macrofusion down another level. In advanced micro-op fusion, the execution core "fuses" common micro-ops derived from the same macro-op. This reduces the number of micro-ops that need to be executed. The result is more efficient scheduling and better performance at lower power. In fact, studies have shown that micro-op fusion can reduce the number of micro-ops handled by the out-of-order logic by more than 10 percent.

Manage Power Intelligently

Power gating is a technique that reduces power consumption, including the runtime power consumption of a processor's execution cores. The technique is used to power up logic subsystems only if and when they are needed.

Power gating has traditionally been challenging for industry because of the energy consumed in powering the subsystem down and ramping it back up for use. Gating has also been challenging because of the need to maintain overall system responsiveness when returning the subsystem to full power.

Advanced power gating in 65-nm CMOS now allows for intelligent, ultra fine-grained logic control of the individual processor logic subsystems. Only those individual subsystems that are currently required are powered on. With a finer granularity of subsystems, power gating also minimizes the number of subsystems that require power.

In addition, some manufacturers, such as Intel, split many buses and arrays, so that the data required in some modes of operation can be put into a low power state when not needed. The result is optimized energy use in a design that delivers more performance per watt without sacrificing responsiveness.

Challenges In Optimizing Cache

One of the major advantages of moving to a 65-nm process is having enough additional transistors to increase resources and/or improve performance in many critical areas. One such area is cache.

When processors have multiple cores, parallelism must be applied at the core level, not just the processor level. This means optimizing the way the execution cores exchange and share data. Without that optimization, multiple cores will introduce data and memory contention--a traditional issue in microarchitecture design.

Industry addresses this challenge by using one L2 cache for each core. It's simple. No core has to fight with other cores for access to its own cache. The problem in this approach is twofold: When two execution cores need the same data, each core must store that data in its own L2 cache, duplicating the work. Also, when one core isn't fully using its cache, other cores cannot access that underutilized cache for other tasks.


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.