Blog Archives

May, 2009

Below C level ...

I guess it all depends on where you jump into the fray with this parallelism stuff. In the server world, most of the applications are concerned with transaction throughput and the number of simultaneous user requests that can be successfully processed. So various schemes are concocted to deal with the load of simultaneous user requests. So multicore configurations have an obvious application at the server level. In the application world, things are not always so clear, but requirements for concurrency pop up.

Alternatives for Parallelizing Legacy Serial Code

Applications developed using modern programming languages can become legacy code faster than expected. Multicore microprocessors have been found guilty of transforming outstanding serial code into legacy code in just a few years. However, there are many alternatives to translate multicore power into application performance making small changes to the legacy serial code.

Are the Emperor's New Clothes Transparent or Just Invisible?

Surely some vendor should be able to come along and make the number of cores or processors transparent to our application. After all, isn't one of the operating system's jobs to hide the details of the hardware from the programmer?

New Garbage Collectors Designed With Parallelism in Mind

On the one hand, Garbage Collectors simplify developers' lives, but on the other hand, they can become the greatest enemies of a parallelized algorithm's performance. Finally, Java 7 and .Net 4 are going to offer new Garbage Collectors really targeted for multicore microprocessors with large memories.

Going Parallel: Part 3: Let's Get Started!

When I first dipped my toes into parallel programming I hunted around for a legacy application to change from serial to parallel. Rather than work on one of my own apps, I looked for a serial application on the web that had lots of CPU activity. I ended up choosing the Dhrystone benchmark.

Multicores in 3D

GPUs are dedicated processors that perform 3D graphics rendering. GPU manufactures are in a frenzy to deliver the latest technology that accelerates performance at a lower cost while still delivering high-end 3D graphics. Maybe this is something to think about. Maybe not. I do.

Of Quarks and Practical Parallel Programming

In this conversation with Timothy G. Mattson, Senior Research Scientist Intel, Computational Software Lab, we find close agreement on the best approach to applying parallelism to business and general application programming. But first we discuss quantum physics.


Defining Cloud Computing

If you've been trying figure out what "cloud computing" is, you're not alone -- and coming to the rescue is the National Institute of Standards and Technology. But as nice as the NIST folks are, they didn't have you in mind when they decided to put together an understandable definition of cloud computing.

Microsoft Visual Studio 2010 and .Net Framework 4.0 Beta 1 Finally Arrived

After a long silence since the last CTP (Community Technology Preview), Microsoft released the Visual Studio 2010 Beta 1 that comes with the new .Net Framework 4.0. This new release will bring new promising features that empower parallel programming using the supported languages.

Data Center Power Consumption Can Be a FAWNy Business

I'm a sucker for a good pun. (Of course, some would say that there's no such thing as a "good" pun.) So when I came across a paper entitled FAWNdamentally Power-efficient Clusters, I had to read it. And you know what? Not only did I enjoy reading the paper, I actually learned a thing or two about a very important topic.

Designing Rich Internet Applications With Parallelism in Mind

Are you still designing RIAs (Rich Internet Applications) using a serial programming approach? If that's the case, then you are losing a big opportunity to achieve a sustainable competitive advantage over competitors.

Measuring Parallelism: How Much Power Are You Wasting?

Management professionals love metrics. However, they don't like the idea to optimize applications that are currently running. Thus, what about showing them some metrics about the power that they are wasting using a free tool?

Efficiency Of the Programmer Needs To Trump Efficiency Of the Machine For Parallel Programming To Go Mainstream

You can tell when technology has matured, in part by how much the user needs to be an expert in using it. A step in maturing for parallel programming will come as we see that the efficiency of a programmer is more important than the efficiency of the machine.

Multicore Performance Benefits, Or Not

Many embedded system designers are still struggling to determine whether multicore technology really buys them anything in terms of performance. Resolving this quandary requires a thorough understanding of the target application, the amount of time that must be invested to make the transition, and the characteristics of multicore processors that could be used.

Getting Started with OpenMP

A couple of quick ideas for getting started with OpenMP.


  1. Compile a cute example

  2. Read a book

Calendar

June 2009
May 2009
April 2009
March 2009
February 2009
January 2009
December 2008

Real World Parallelism Webinar Series
  • November 17, 2009
    Visual Effects for Animation - presented by DreamWorks Animation
    Speaker: Ron Henderson (Bio)

    Ron Henderson manages the FX Tools group at DreamWorks Animation, where he is responsible for developing physical simulation and procedural modeling tools. These systems have been used for key visual effects in recent films such as Kung Fu Panda and Monsters vs. Aliens (March 2009).

    Prior to joining DreamWorks in 2002 he was a senior scientist at Caltech with a joint appointment to the Applied Math and Aeronautics departments, where he worked on efficient techniques for the direct numerical simulation of fluid turbulence.

    Abstract:
    In this webinar, Ron Henderson will show examples of visual effects, from hair and feathers to smoke and fire, from a variety of DreamWorks Animation feature films. He will discuss in general terms the kinds of techniques used to achieve particular visual effects. Finally, Henderson will show a detailed breakdown of the dam-breaking scene from Madagascar: Escape 2 Africa, demonstrating how different elements of key frame animation, simulation, and rendering are combined in a real production shot.

  • December 1, 2009
    A Quick and Easy Way to Parallelize a Legacy Codebase with Intel® Threading Building Blocks (TBBs)
    Speaker: Bernard Laberge, Avid, Senior Principal Engineer (Bio)

    Bernard Laberge is a senior principal engineer in the video editors division at Avid. During his seven years with the company he has been actively involved in the replacement of the legacy video processing engines used by Avid editors with a common hardware-abstracted, component-based video processing engine currently running on the CPU with SIMD optimized code, GPU, and dedicated hardware.

    Abstract:
    Learn how to overcome the limitations of a thread-based scheduler, including dealing with the absence of recursive parallelism support and the inefficient handling of unbalanced processing load. Bernard Laberge addresses how Avid resolved the expensive refactoring of their thread-based scheduler into a task-based solution by choosing Intel® Threading Building Blocks (TBBs). He explores how Avid was able to easily integrate the Intel TBBs into their video editor applications and more than 5 million lines of code.

  • December 15, 2009
    How to Use Intel® Parallel Studio to Streamline Code Development in a Multicore Environment
    Speaker: Matt Dunbar, Director for Performance Technology, SIMULIA (Bio)

    Matt Dunbar is the director for performance technology at SIMULIA. Since joining the company in 1993, he has worked on parallelization of the Abaqus suite of products, initially for shared memory architectures and more recently for distributed memory architectures. Dunbar has also been intimately involved in selecting both the hardware and software tools used in the development of the Abaqus product line.

    Abstract:
    Resolve elusive, costly multithreading errors quickly and efficiently with Intel® Parallel Studio. While many coding problems that lead to bugs in software applications are typically straightforward logic errors, errors in managing memory and in multithreading code can sometimes take weeks to months to diagnose and fix. Matt Dunbar explores how and why taking advantage of multicore processors through multithreaded code is critical for compute-intensive applications. While spotlighting his work on SIMULIA's Abaqus finite element solver, Dunbar addresses the need for multicore execution and shares his experiences using Intel Parallel Studio to streamline code development in a multicore environment.