The Community for Technology Leaders
Computer Architecture and High Performance Computing, Symposium on (2004)
Foz do Igua?u, PR - Brazil
Oct. 27, 2004 to Oct. 29, 2004
ISSN: 1550-6533
ISBN: 0-7695-2240-8
TABLE OF CONTENTS

Cache filtering techniques to reduce the negative impact of useless speculative memory references on processor performance (PDF)

O. Mutlu , Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
H. Kim , Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
D.N. Armstrong , Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
Y.N. Patt , Dept. of Electr. & Comput. Eng., Texas Univ., Austin, TX, USA
pp. 2-9

Self-monitored adaptive cache warm-up for microprocessor simulation (PDF)

Y. Luo , Texas Univ., Austin, TX, USA
L.K. John , Texas Univ., Austin, TX, USA
pp. 10-17

The eDRAM based L3-cache of the BlueGene/L supercomputer processor node (PDF)

M. Ohmacht , IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
D. Hoenicke , IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
R. Haring , IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
A. Gara , IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
pp. 18-22

A study of errant pipeline flushes caused by value misspeculation (PDF)

D. Balkan , Dept. of Comput. Sci., SUNY, Binghamton, NY, USA
pp. 32-39

Design space exploration using T&D-Bench (PDF)

S.N. Soares , PGCC, Univ. Fed. do Rio Grande do Sul, Brazil
pp. 40-47

Value predictors for reuse through speculation on traces (PDF)

M.L. Pilla , Comput. Sci. Inst., Univ. Fed. do Rio Grande do Sul, Porto Alegre, Brazil
P.O.A. Navaux , Comput. Sci. Inst., Univ. Fed. do Rio Grande do Sul, Porto Alegre, Brazil
pp. 48-55

IATO: a flexible EPIC simulation environment (PDF)

A. Darsch , Campus de Beaulieu, IRISA, Rennes, France
A. Seznec , Campus de Beaulieu, IRISA, Rennes, France
pp. 58-65

ArchC: a systemC-based architecture description language (PDF)

S. Rigo , Comput. Syst. Lab., Campinas Univ., Brazil
G. Araujo , Comput. Syst. Lab., Campinas Univ., Brazil
M. Bartholomeu , Comput. Syst. Lab., Campinas Univ., Brazil
R. Azevedo , Comput. Syst. Lab., Campinas Univ., Brazil
pp. 66-73

Optimizations for compiled simulation using instruction type information (PDF)

M. Bartholomeu , Inst. of Comput., Univ. of Campinas, Brazil
R. Azevedo , Inst. of Comput., Univ. of Campinas, Brazil
S. Rigo , Inst. of Comput., Univ. of Campinas, Brazil
G. Araujo , Inst. of Comput., Univ. of Campinas, Brazil
pp. 74-81

High performance communication system based on generic programming (PDF)

A.L.G. Sanches , Software & Hardware Integration Lab., Univ. Fed. de Santa Catarina, Florianopolis, Brazil
F.R. Secco , Software & Hardware Integration Lab., Univ. Fed. de Santa Catarina, Florianopolis, Brazil
A.A. Frohlich , Software & Hardware Integration Lab., Univ. Fed. de Santa Catarina, Florianopolis, Brazil
pp. 92-99

Performance evaluation of a prototype distributed NFS server (PDF)

R.B. Avila , Inst. de Informatica, Univ. Fed. do Rio Grande do Sul, Porto Alegre, Brazil
P.O.A. Navaux , Inst. de Informatica, Univ. Fed. do Rio Grande do Sul, Porto Alegre, Brazil
pp. 100-105

Scheduling in Bag-of-Task grids: the PAUA case (PDF)

N. Andrade , Univ. Fed. de Campina Grande, Brazil
W. Cirne , Univ. Fed. de Campina Grande, Brazil
L. Costa , Univ. Fed. de Campina Grande, Brazil
D. Paranhos , Univ. Fed. de Campina Grande, Brazil
E. Santos-Neto , Univ. Fed. de Campina Grande, Brazil
F. Brasileiro , Univ. Fed. de Campina Grande, Brazil
pp. 124-131

A parallel engine for graphical interactive molecular dynamics simulations (PDF)

E.R. Rodrigues , Post-graduation Program in Appl. Comput., Brazilian Inst. for Space Res., Brazil
pp. 150-157

Combining a shared-memory high performance computer and a heterogeneous cluster for the simulation of light interaction with human skin (PDF)

Aravind Krishnaswamy , Natural Phenomena Simulation Group, Waterloo Univ., Ont., Canada
G.V.G. Baranoski , Natural Phenomena Simulation Group, Waterloo Univ., Ont., Canada
pp. 166-171

Revisiting a BSP/CGM transitive closure algorithm (PDF)

E.N. Caceres , Dept. de Computacao e Estatistica, Fed. Univ. of Mato Grosso do Sul, Campo Grande, Brazil
C.C.A. Vieira , Dept. de Computacao e Estatistica, Fed. Univ. of Mato Grosso do Sul, Campo Grande, Brazil
pp. 174-179

Improving parallel execution time of sorting on heterogeneous clusters (PDF)

C. Cerin , LaRIA, Univ. de Picardie Jules Verne, Amiens, France
pp. 180-187

Graph partitioning with the Party library: helpful-sets in practice (PDF)

B. Monien , Fakultat fur Elektrotechnik, Paderborn Univ., Germany
S. Schamberger , Fakultat fur Elektrotechnik, Paderborn Univ., Germany
pp. 198-205

On the combined scheduling of malleable and rigid jobs (PDF)

J. Hungershofer , Paderborn Center for Parallel Comput., Germany
pp. 206-213

A cluster-based strategy for scheduling task on heterogeneous processors (PDF)

C. Boeres , Inst. de Computacao, Univ. Fed. Fluminense, Niteroi, Brazil
J.V. Filho , Inst. de Computacao, Univ. Fed. Fluminense, Niteroi, Brazil
V.E.F. Rebello , Inst. de Computacao, Univ. Fed. Fluminense, Niteroi, Brazil
pp. 214-221

Characterizing the dynamic behavior of workload execution in SVM systems (PDF)

S. Petit , Dep. Informatica de Sistemas y Computadores, Univ. Politecnica de Valencia, Spain
J. Sahuquillo , Dep. Informatica de Sistemas y Computadores, Univ. Politecnica de Valencia, Spain
A. Pont , Dep. Informatica de Sistemas y Computadores, Univ. Politecnica de Valencia, Spain
pp. 230-237

A performance evaluation of ARM ISA extension for elliptic curve cryptography over binary finite fields (PDF)

S. Bartolini , Dept. of Inf. Eng., Siena Univ., Italy
I. Branovic , Dept. of Inf. Eng., Siena Univ., Italy
R. Giorgi , Dept. of Inf. Eng., Siena Univ., Italy
E. Martinelli , Dept. of Inf. Eng., Siena Univ., Italy
pp. 238-245

PEMPIs: a new methodology for modeling and prediction of MPI programs performance (PDF)

E.T. Midorikawa , Dept. of Comput. Eng. & Digital Syst., Sao Paulo Univ., Brazil
H.M. de Oliveira , Dept. of Comput. Eng. & Digital Syst., Sao Paulo Univ., Brazil
J.M. Laine , Dept. of Comput. Eng. & Digital Syst., Sao Paulo Univ., Brazil
pp. 246-253

Performance characterisation of intra-cluster collective communications (PDF)

L.A. Barchet-Estefanel , ID-IMAG Lab., APACHE Project, St. Martin, France
G. Mounie , ID-IMAG Lab., APACHE Project, St. Martin, France
pp. 254-261
Session 6: High Performance Applications
Session 7: Parallel and Distributed Algorithms

Revisiting a BSP/CGM Transitive Closure Algorithm (Abstract)

Edson Norberto C?ceres , Federal University of Mato Grosso do Sul, Brazil
Cristiano Costa Argemom Vieira , Federal University of Mato Grosso do Sul, Brazil
pp. 174-179

Improving Parallel Execution Time of Sorting on Heterogeneous Clusters (Abstract)

Christophe C?rin , Universit? de Picardie Jules Verne, France
Michel Koskas , Universit? de Picardie Jules Verne, France
Hazem Fkaier , ?cole Sup?rieure des Sciences et Techniques de Tunis, Tunisie
Mohamed Jemni , ?cole Sup?rieure des Sciences et Techniques de Tunis, Tunisie
pp. 180-187

An Approach for Pre Runtime Scheduling in Embedded Hard Real Time Systems with Power Constraints (Abstract)

Eduardo Tavares , Federal University of Pernambuco (UFPE)
Raimundo Barreto , Federal University of Amazonas (UFAM)
Meuse Oliveira J?nior , Federal University of Pernambuco (UFPE)
Paulo Maciel , Federal University of Pernambuco (UFPE)
Mar?lia Neves , Federal University of Pernambuco (UFPE)
Ricardo Lima , Pernambuco State University
pp. 188-195
Session 8: Load Balancing and Scheduling

Graph Partitioning with the Party Library: Helpful-Sets in Practice (Abstract)

Burkhard Monien , Universit?t Paderborn
Stefan Schamberger , Universit?t Paderborn
pp. 198-205

On the Combined Scheduling of Malleable and Rigid Jobs (Abstract)

Jan Hungersh?fer , Paderborn Center for Parallel Computing, Germany
pp. 206-213

A Cluster-based Strategy for Scheduling Task on Heterogeneous Processors (Abstract)

Cristina Boeres , Universidade Federal Fluminense (UFF), Brazil
Jos? Viterbo Filho , Universidade Federal Fluminense (UFF), Brazil
Vinod E. F. Rebello , Universidade Federal Fluminense (UFF), Brazil
pp. 214-221
Session 9: Benchmarking, Performance Measurements and Analysis

Characterizing the Dynamic Behavior of Workload Execution in SVM systems (Abstract)

Salvador Petit , Universidad Polit?cnica de Valencia, Spain
Julio Sahuquillo , Universidad Polit?cnica de Valencia, Spain
Ana Pont , Universidad Polit?cnica de Valencia, Spain
David Kaeli , Northeastern University, Boston, Massachusetts
pp. 230-237

A Performance Evaluation of ARM ISA Extension for Elliptic Curve Cryptography over Binary Finite Fields (Abstract)

Sandro Bartolini , University of Siena, Italy
Irina Branovic , University of Siena, Italy
Roberto Giorgi , University of Siena, Italy
Enrico Martinelli , University of Siena, Italy
pp. 238-245

PEMPIs: A New Methodology for Modeling and Prediction of MPI Programs Performance (Abstract)

Edson T. Midorikawa , Polytechnic School, University of S?o Paulo, Brazil
Helio M. de Oliveira , Polytechnic School, University of S?o Paulo, Brazil
Jean M. Laine , Polytechnic School, University of S?o Paulo, Brazil
pp. 246-253

Performance Characterisation of Intra-Cluster Collective Communications (Abstract)

Luiz Angelo Barchet-Estefanel , ID - IMAG Laboratory, France
Gr?gory Mouni? , ID - IMAG Laboratory, France
pp. 254-261

Author Index (PDF)

pp. 263-264
94 ms
(Ver 3.3 (11022016))