Pattern Name: Introduction |
AlgorithmStructure Design Space |
This note is the introduction to the AlgorithmStructure design space. The patterns in this design space help the programmer organize exploitable concurrency (i.e., collections of tasks and shared data) into parallel algorithms.
To create a parallel program, you need two things. First, you must have concurrency: It must be possible to decompose your problem into tasks that can execute simultaneously. Second, you must be able to map these tasks onto units of execution (usually threads or processes) so you can take advantage of the concurrency in a parallel program.
The first of these requirements -- finding exploitable concurrency -- is addressed in the FindingConcurrency design space. After you work through the patterns in the FindingConcurrency design space, you will have decomposed your problem into tasks, ordered groups of tasks, and data that could be concurrently updated by the tasks.
The second requirement -- how to take those tasks/data/groups and map them onto units of execution -- is addressed in the present design space, which we call AlgorithmStructure. In a way, in this design space we are refining the parallel algorithm design from abstract concurrency into a form that can be realized on the target machine.
The patterns in this design space are used after the designer has decomposed the problem to expose exploitable concurrency; i.e., he or she has broken the problem down into one or more collections of tasks, task-local data, and global or shared data. Decomposing a problem to identify concurrency was addressed in the FindingConcurrency design space and will not be addressed here.
It is equally important to understand the issues not addressed by these patterns. At this stage of the pattern language, we are still dealing with high-level algorithms. Some issues pertaining to the final program and the target machine are considered, but for the most part, we are still dealing with high-level descriptions of the algorithms and not with specific implementation issues. These issues will be address in the lower two design spaces, SupportingStructures and ImplementationMechanisms.
Observe that the goal of this note is to introduce the patterns in the AlgorithmStructure design space. We won't discuss how the patterns are used or even how you select a given pattern. That is the topic of the ChooseStructure pattern.
The patterns in this design space fall into three broad groups. The grouping reflects the major organizing principle used by the designer in understanding the parallel algorithm. For example, if the concurrency is best understood in terms of the way the data is decomposed, that data decomposition becomes the "major organizing principle". This idea will become clearer as we start working with the patterns. For now, we just want to introduce the major patterns in this design space and briefly discuss the role of composition or hierarchy in applying these patterns.
These patterns are used when the ordering of groups of tasks is the major organizing principle for the parallel algorithm. This group has two members, reflecting two ways task groups can be ordered. One choice represents "regular" orderings that do not change during the algorithm; the other represents "irregular" orderings that are more dynamic and unpredictable.
These patterns are those for which the tasks themselves are the best organizing principle. There are many ways to work with such "task-parallel" problems, making this the largest pattern group.
These patterns are those for which the decomposition of the data is the major organizing principle in understanding the concurrency. There are two patterns in this group, differing in how the decomposition is structured (linearly in each dimension or recursively).
This design space also includes the ChooseStructure pattern, which addresses the question of how to select an appropriate pattern.
Finally, at this point we need to discuss the hierarchical nature of the AlgorithmStructure patterns. In many cases, a single AlgorithmStructure pattern will not let you take advantage of all the available concurrency. For example, a problem may naturally map into a pipeline of task groups. This is usually very coarse-grained, however, and to take full advantage of the concurrency available in the problem, the task groups themselves may need to be parallelized. In all likelihood, these task groups would map onto other AlgorithmStructure patterns, leading to a hierarchical design with the pipeline pattern at the top organizing the task-groups and other patterns within each task group.
A second example may help drive this important point home. The Accelerated Strategic Computing Initiative (ASCI) is a large program to use extreme-scale computers with thousands of processors to solve complex simulation problems. When you need to run on so many processors, you can't let any concurrency go to waste. Hence, most of the ASCI algorithms are hierarchical, the most common combinations being a GeometricDecomposition pattern for the high-level program organization and a ProtectedDependencies pattern for the update of local data.