ACM Home Page
Please provide us with feedback. Feedback
Feedback-directed thread scheduling with memory considerations
Full text PdfPdf (398 KB)
Source
High Performance Distributed Computing archive
Proceedings of the 16th international symposium on High performance distributed computing table of contents
Monterey, California, USA
SESSION: Scheduling table of contents
Pages: 97 - 106  
Year of Publication: 2007
ISBN:978-1-59593-673-8
Authors
Fengguang Song  University of Tennessee
Shirley Moore  University of Tennessee
Jack Dongarra  University of Tennessee
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 57,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1272366.1272380
What is a DOI?

ABSTRACT

This paper describes a novel approach to generate an optimized schedule to run threads on distributed shared memory (DSM) systems. The approach relies upon a binary instrumentation tool to automatically acquire the memory sharingrelationship between user-level threads by analyzing their memory trace. We introduce the concept of Affinity Graph to model the relationship. Expensive I/O for large trace files is completely eliminated by using an online graph creation scheme. We apply the technique of hierarchical graph partitioning and thread reordering to the affinity graph to determine an optimal thread schedule. We have performed experiments on an SGI Altix system. The experimental results show that our approach is able to reduce the totalexecution time by 10% to 38% for a variety of applications through the maximization of the data reuse within a single processor, minimization of the data sharing between processors, and a good load balance.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
T. Davis. University of Florida sparse matrix collection. In http://www.cise.ufl.edu/research/sparse, 1997.
 
2
 
3
 
4
B. Hendrickson and R. Leland. The Chaco user's guide: Version 2.0. In Sandia Tech Report SAND94-2692, 1994.
5
 
6
7
8
9
 
10
11
 
12
J. C. Pichel, D. B. Heras, J. C. Cabaleiro, and F. F. Rivera. Improving the locality of the sparse matrix-vector production shared memory multiprocessors. In PDP, pages 66--71. IEEE Computer Society, 2004.
 
13
 
14
15
 
16
 
17
F. Song, S. Moore, and J. Dongarra. Modeling of L2 cache behavior for thread-parallel scientific programs on Chip Multi-Processors. In UT Computer Science Technical Report UT-CS-06-583, 2006.
 
18
SGI. Linux resource administration guide. In SGI Techpubs Library 007-4413-011, 2006.
 
19

Collaborative Colleagues:
Fengguang Song: colleagues
Shirley Moore: colleagues
Jack Dongarra: colleagues