The Community for Technology Leaders
SC Conference (2007)
Reno, Nevada
Nov. 10, 2007 to Nov. 16, 2007
ISBN: 978-1-59593-764-3
TABLE OF CONTENTS
Papers
Front Matter

Front Matter (PDF)

pp. i-xix
Papers

Programming bits and atoms (Abstract)

Neil Gershenfeld , Massachusetts Institute of Technology
pp. 1

A preliminary investigation of a neocortex model implementation on the Cray XD1 (Abstract)

Kenneth L. Rice , Clemson University, Clemson, SC
Christopher N. Vutsinas , Clemson University, Clemson, SC
Tarek M. Taha , Clemson University, Clemson, SC
pp. 1-8

Anatomy of a cortical simulator (Abstract)

Rajagopal Ananthanarayanan , IBM Almaden Research Center, San Jose, CA
Dharmendra S. Modha , IBM Almaden Research Center, San Jose, CA
pp. 1-12

Large-scale maximum likelihood-based phylogenetic analysis on the IBM BlueGene/L (Abstract)

Michael Ott , Technical University of Munich
Jaroslaw Zola , Iowa State University
Alexandros Stamatakis , School of Computer and Communication Sciences
Srinivas Aluru , Iowa State University
pp. 1-11

Age-based packet arbitration in large-radix k-ary n-cubes (Abstract)

Dennis Abts , Cray Inc., Chippewa Falls, Wisconsin
Deborah Weisser , Google Inc., Mountain View, California
pp. 1-11

Evaluating network information models on resource efficiency and application performance in lambda-grids (Abstract)

Nut Taesombut , University of California, La Jolla, CA
Andrew A. Chien , University of California, La Jolla, CA
pp. 1-12

Virtual machine aware communication libraries for high performance computing (Abstract)

Wei Huang , The Ohio State University, Columbus, OH
Matthew J. Koop , The Ohio State University, Columbus, OH
Qi Gao , The Ohio State University, Columbus, OH
Dhabaleswar K. Panda , The Ohio State University, Columbus, OH
pp. 1-12

Investigation of leading HPC I/O performance using a scientific-application derived benchmark (Abstract)

Julian Borrill , Lawrence Berkeley National Laboratory, Berkeley, CA
Leonid Oliker , Lawrence Berkeley National Laboratory, Berkeley, CA
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Hongzhang Shan , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

Automatic resource specification generation for resource selection (Abstract)

Richard Huang , University of California, San Diego
Henri Casanova , University of Hawai'i at Manoa
Andrew A. Chien , University of California, San Diego
pp. 1-11

Performance and cost optimization for multiple large-scale grid workflow applications (Abstract)

Rubing Duan , University of Innsbruck
Radu Prodan , University of Innsbruck
Thomas Fahringer , University of Innsbruck
pp. 1-12

Inter-operating grids through delegated matchmaking (Abstract)

Alexandru Iosup , Delft University of Technology, Delft, NL
Dick H. J. Epema , Delft University of Technology, Delft, NL
Todd Tannenbaum , University of Wisconsin, Madison, WI, US
Matthew Farrellee , University of Wisconsin, Madison, WI, US
Miron Livny , University of Wisconsin, Madison, WI, US
pp. 1-12

Automatic software interference detection in parallel applications (Abstract)

Vahid Tabatabaee , University of Maryland at College Park
Jeffrey K. Hollingsworth , University of Maryland at College Park
pp. 1-12

DMTracker: finding bugs in large-scale parallel programs by detecting anomaly in data movements (Abstract)

Qi Gao , The Ohio State University, Columbus, OH
Feng Qin , The Ohio State University, Columbus, OH
Dhabaleswar K. Panda , The Ohio State University, Columbus, OH
pp. 1-12

Scalable security for petascale parallel file systems (Abstract)

Andrew W. Leung , University of California, Santa Cruz, CA
Ethan L. Miller , University of California, Santa Cruz, CA
Stephanie Jones , University of California, Santa Cruz, CA
pp. 1-12

The Cray BlackWidow: a highly scalable vector multiprocessor (Abstract)

Dennis Abts , Cray Inc., Chippewa Falls, Wisconsin
Abdulla Bataineh , Cray Inc., Chippewa Falls, Wisconsin
Steve Scott , Cray Inc., Chippewa Falls, Wisconsin
Greg Faanes , Cray Inc., Chippewa Falls, Wisconsin
Jim Schwarzmeier , Cray Inc., Chippewa Falls, Wisconsin
Eric Lundberg , Cray Inc., Chippewa Falls, Wisconsin
Tim Johnson , Cray Inc., Chippewa Falls, Wisconsin
Mike Bye , Cray Inc., Chippewa Falls, Wisconsin
Gerald Schwoerer , Cray Inc., Chippewa Falls, Wisconsin
pp. 1-12

GRAPE-DR: 2-Pflops massively-parallel computer with 512-core, 512-Gflops processor chips for scientific computing (Abstract)

Junichiro Makino , National Astronomical Observatory of Japan, Tokyo, Japan
Kei Hiraki , The University of Tokyo, Tokyo, Japan
Mary Inaba , The University of Tokyo, Tokyo, Japan
pp. 1-11

A case for low-complexity MP architectures (Abstract)

Hâkan Zeffer , Uppsala University, Uppsala, Sweden
Erik Hagersten , Uppsala University, Uppsala, Sweden
pp. 1-12

Variable latency caches for nanoscale processor (Abstract)

Serkan Ozdemir , Northwestern University, Evanston, IL
Arindam Mallik , Northwestern University, Evanston, IL
Ja Chun Ku , Northwestern University, Evanston, IL
Gokhan Memik , Northwestern University, Evanston, IL
Yehea Ismail , Northwestern University, Evanston, IL
pp. 1-10

Data access history cache and associated data prefetching mechanisms (Abstract)

Yong Chen , Illinois Institute of Technology, Chicago, IL
Surendra Byna , Illinois Institute of Technology, Chicago, IL
Xian-He Sun , Illinois Institute of Technology, Chicago, IL and Fermi National Accelerator Laboratory, Batavia, IL
pp. 1-12

Scaling performance of interior-point method on large-scale chip multiprocessor system (Abstract)

Mikhail Smelyanskiy , Microprocessor Technology Labs, Intel
Victor W Lee , Microprocessor Technology Labs, Intel
Daehyun Kim , Microprocessor Technology Labs, Intel
Anthony D Nguyen , Microprocessor Technology Labs, Intel
Pradeep Dubey , Microprocessor Technology Labs, Intel
pp. 1-11

Data exploration of turbulence simulations using a database cluster (Abstract)

Eric Perlman , Johns Hopkins University, Baltimore, MD
Randal Burns , Johns Hopkins University, Baltimore, MD
Yi Li , Johns Hopkins University, Baltimore, MD
Charles Meneveau , Johns Hopkins University, Baltimore, MD
pp. 1-11

Parallel hierarchical visualization of large time-varying 3D vector fields (Abstract)

Hongfeng Yu , University of California at Davis
Chaoli Wang , University of California at Davis
Kwan-Liu Ma , University of California at Davis
pp. 1-12

Low-constant parallel algorithms for finite element simulations using linear octrees (Abstract)

Hari Sundar , University of Pennsylvania, Philadelphia, PA
Rahul S. Sampath , University of Pennsylvania, Philadelphia, PA
Santi S. Adavani , University of Pennsylvania, Philadelphia, PA
Christos Davatzikos , University of Pennsylvania, Philadelphia, PA
George Biros , University of Pennsylvania, Philadelphia, PA
pp. 1-12

Noncontiguous locking techniques for parallel file systems (Abstract)

Avery Ching , Northwestern University, Evanston, Illinois
Wei-keng Liao , Northwestern University, Evanston, Illinois
Alok Choudhary , Northwestern University, Evanston, Illinois
Robert Ross , Argonne National Laboratory, Argonne, IL
Lee Ward , Sandia National Laboratories, Albuquerque, NM
pp. 1-12

Integrating parallel file systems with object-based storage devices (Abstract)

Ananth Devulapalli , Ohio Supercomputer Center
Dennis Dalessandro , Ohio Supercomputer Center
Pete Wyckoff , Ohio Supercomputer Center
Nawab Ali , The Ohio State University
P. Sadayappan , The Ohio State University
pp. 1-10

Evaluation of active storage strategies for the lustre parallel file system (Abstract)

Juan Piernas , Pacific Northwest National Laboratory, Richland, WA
Jarek Nieplocha , Pacific Northwest National Laboratory, Richland, WA
Evan J. Felix , Pacific Northwest National Laboratory, Richland, WA
pp. 1-10

The ghost in the machine: observing the effects of kernel operation on parallel application performance (Abstract)

Aroon Nataraj , University of Oregon, Eugene, OR
Alan Morris , University of Oregon, Eugene, OR
Allen D. Malony , University of Oregon, Eugene, OR
Matthew Sottile , Los Alamos National Lab
Pete Beckman , National Lab
pp. 1-12

PNMPI tools: a whole lot greater than the sum of their parts (Abstract)

Martin Schulz , Lawrence Livermore National Laboratory, Livermore, CA
Bronis R. de Supinski , Lawrence Livermore National Laboratory, Livermore, CA
pp. 1-10

Multi-threading and one-sided communication in parallel LU factorization (Abstract)

Parry Husbands , Lawrence Berkeley National Laboratory, Berkeley, CA
Katherine Yelick , University of California at Berkeley, Berkeley, CA
pp. 1-10

Workstation capacity tuning using reinforcement learning (Abstract)

Aharon Bar-Hillel , Intel Research Israel
Amir Di-Nur , Intel Inc.
Liat Ein-Dor , Intel Research Israel
Ran Gilad-Bachrach , Intel Research Israel
Yossi Ittach , Intel Research Israel
pp. 1-11

Anomaly detection and diagnosis in grid environments (Abstract)

Lingyun Yang , University of Chicago, Chicago, IL
Chuang Liu , Microsoft, Redmond, WA
Jennifer M. Schopf , Argonne National Laboratory, Argonne, IL
Ian Foster , University of Chicago, Chicago, IL and Argonne National Laboratory, Argonne, IL
pp. 1-9

User-friendly and reliable grid computing based on imperfect middleware (Abstract)

Rob V. van Nieuwpoort , Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
Thilo Kielmann , Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
Henri E. Bal , Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
pp. 1-11

Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP (Abstract)

P. Balaji , Argonne National Laboratory
W. Feng , Virginia Tech
S. Bhagvat , Dell Inc.
D. K. Panda , Ohio State University
R. Thakur , Argonne National Laboratory
W. Gropp , Argonne National Laboratory
pp. 1-12

Evaluating NIC hardware requirements to achieve high message rate PGAS support on multi-core processors (Abstract)

Keith D. Underwood , Sandia National Laboratories, Albuquerque, NM
Michael J. Levenhagen , Sandia National Laboratories, Albuquerque, NM
Ron Brightwell , Sandia National Laboratories, Albuquerque, NM
pp. 1-10

High-performance ethernet-based communications for future multi-core processors (Abstract)

Michael Schlansker , Hewlett-Packard Labs/Advanced Architecture Lab
Nagabhushan Chitlur , Intel Corporation/Corporate Technology Group
Erwin Oertli , VMware
Paul M. Stillwell , Intel Corporation/Corporate Technology Group
Linda Rankin , Intel Corporation/Corporate Technology Group
Dennis Bradford , Intel Corporation/Corporate Technology Group
Richard J. Carter , Hewlett-Packard Labs/Advanced Architecture Lab
Jayaram Mudigonda , Hewlett-Packard Labs/Advanced Architecture Lab
Nathan Binkert , Hewlett-Packard Labs/Advanced Architecture Lab
Norman P. Jouppi , Hewlett-Packard Labs/Advanced Architecture Lab
pp. 1-12

Optimization of sparse matrix-vector multiplication on emerging multicore platforms (Abstract)

Samuel Williams , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
Leonid Oliker , Lawrence Berkeley National Laboratory, Berkeley, CA
Richard Vuduc , Lawrence Livermore National Laboratory, Livermore, CA
John Shalf , Lawrence Berkeley National Laboratory, Berkeley, CA
Katherine Yelick , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California at Berkeley, Berkeley, CA
James Demmel , University of California at Berkeley, Berkeley, CA
pp. 1-12

Cray XT4: an early evaluation for petascale scientific simulation (Abstract)

Sadaf R. Alam , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Jeffery A. Kuehn , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Richard F. Barrett , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Jeff M. Larkin , Cray Inc, Seattle, Washington
Mark R. Fahey , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Ramanan Sankaran , Oak Ridge National Laboratory, Oak Ridge, Tennessee
Patrick H. Worley , Oak Ridge National Laboratory, Oak Ridge, Tennessee
pp. 1-12

An adaptive mesh refinement benchmark for modern parallel programming languages (Abstract)

Tong Wen , IBM T. J. Watson Research Center, Hawthorne, NY
Jimmy Su , University of California, Berkeley, CA
Phillip Colella , Lawrence Berkeley National Laboratory, Berkeley, CA
Katherine Yelick , University of California, Berkeley, CA
Noel Keen , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-12

Exploring event correlation for failure prediction in coalitions of clusters (Abstract)

Song Fu , Wayne State University, Detroit, MI
Cheng-Zhong Xu , Wayne State University, Detroit, MI
pp. 1-12

Advanced data flow support for scientific grid workflow applications (Abstract)

Jun Qin , University of Innsbruck, Innsbruck, Austria
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
pp. 1-12

Falkon: a Fast and Light-weight tasK executiON framework (Abstract)

Ioan Raicu , University of Chicago, IL
Yong Zhao , University of Chicago, IL
Catalin Dumitrescu , University of Chicago, IL
Ian Foster , University of Chicago and Argonne National Laboratory, Argonne, IL
Mike Wilde , University of Chicago and Argonne National Laboratory, Argonne, IL
pp. 1-12

RobuSTore: a distributed storage architecture with robust and high performance (Abstract)

Huaxia Xia , University of California, San Diego, La Jolla, CA
Andrew A. Chien , University of California, San Diego, La Jolla, CA
pp. 1-11

A user-level secure grid file system (Abstract)

Ming Zhao , University of Florida
Renato J. Figueiredo , University of Florida
pp. 1-11

Efficient gather and scatter operations on graphics processors (Abstract)

Bingsheng He , Hong Kong Univ. of Science and Technology
Naga K. Govindaraju , Microsoft Corp.
Qiong Luo , Hong Kong Univ. of Science and Technology
Burton Smith , Microsoft Corp.
pp. 1-12

A genetic algorithms approach to modeling the performance of memory-bound computations (Abstract)

Mustafa M Tikir , San Diego Supercomputer Center, La Jolla, CA
Laura Carrington , San Diego Supercomputer Center, La Jolla, CA
Erich Strohmaier , Lawrence Berkeley National Laboratory, One Cyclotron Road, CA
Allan Snavely , San Diego Supercomputer Center, La Jolla, CA
pp. 1-12

Performance under failures of high-end computing (Abstract)

Ming Wu , Illinois Institute of Technology, Chicago, Illinois
Xian-He Sun , Illinois Institute of Technology, Chicago, Illinois and Fermi National Accelerator Laborator, Batavia, Illinois
Hui Jin , Illinois Institute of Technology, Chicago, Illinois
pp. 1-11

Bounding energy consumption in large-scale MPI programs (Abstract)

Barry Rountree , University of Georgia, Athens, GA
David K. Lowenthal , University of Georgia, Athens, GA
Shelby Funk , University of Georgia, Athens, GA
Vincent W. Freeh , North Carolina State University, Raleigh, NC
Bronis R. de Supinski , Lawrence Livermore National Laboratory, Livermore, CA
Martin Schulz , Lawrence Livermore National Laboratory, Livermore, CA
pp. 1-9

Application development on hybrid systems (Abstract)

Roger D. Chamberlain , Washington University, St. Louis, Missouri
Mark A. Franklin , Washington University, St. Louis, Missouri
Eric J. Tyson , Washington University, St. Louis, Missouri
Jeremy Buhler , Washington University, St. Louis, Missouri
Saurabh Gayen , Washington University, St. Louis, Missouri
Patrick Crowley , Washington University, St. Louis, Missouri
James H. Buckley , Washington University, St. Louis, Missouri
pp. 1-10

Multi-level tiling: M for the price of one (Abstract)

DaeGon Kim , Colorado State University, Fort Collins, Colorado
Lakshminarayanan Renganarayanan , Colorado State University, Fort Collins, Colorado
Dave Rostron , Colorado State University, Fort Collins, Colorado
Sanjay Rajopadhye , Colorado State University, Fort Collins, Colorado
Michelle Mills Strout , Colorado State University, Fort Collins, Colorado
pp. 1-12

Implementation and performance analysis of non-blocking collective operations for MPI (Abstract)

Torsten Hoefler , Indiana University, Bloomington, IN
Andrew Lumsdaine , Indiana University, Bloomington, IN
Wolfgang Rehm , Chemnitz University of Technology, Chemnitz, Germany
pp. 1-10

Efficient operating system scheduling for performance-asymmetric multi-core architectures (Abstract)

Tong Li , Intel Corporation
Dan Baumberger , Intel Corporation
David A. Koufaty , Intel Corporation
Scott Hahn , Intel Corporation
pp. 1-11

A job scheduling framework for large computing farms (Abstract)

Gabriele Capannini , Information Science and Technologies Institute, Pisa, Italy
Ranieri Baraglia , Information Science and Technologies Institute, Pisa, Italy
Diego Puppin , Information Science and Technologies Institute, Pisa, Italy
Laura Ricci , Largo B. Pontecorvo, Pisa, Italy
Marco Pasquali , Information Science and Technologies Institute, Pisa, Italy
pp. 1-10

Optimizing center performance through coordinated data staging, scheduling and recovery (Abstract)

Zhe Zhang , North Carolina State University
Chao Wang , North Carolina State University
Sudharshan S. Vazhkudai , Oak Ridge National Laboratory
Xiaosong Ma , North Carolina State University and Oak Ridge National Laboratory
Gregory G. Pike , Oak Ridge National Laboratory
John W. Cobb , Oak Ridge National Laboratory
Frank Mueller , North Carolina State University
pp. 1-11

A 281 Tflops calculation for X-ray protein structure analysis with special-purpose computers MDGRAPE-3 (Abstract)

Yousuke Ohno , Nagoya University, Keio University and University of Fukui
Eiji Nishibori , Nagoya University, Keio University and University of Fukui
Tetsu Narumi , Nagoya University, Keio University and University of Fukui
Takahiro Koishi , Nagoya University, Keio University and University of Fukui
Tahir H. Tahirov , Nagoya University, Keio University and University of Fukui
Hideo Ago , Nagoya University, Keio University and University of Fukui
Masashi Miyano , Nagoya University, Keio University and University of Fukui
Ryutaro Himeno , Nagoya University, Keio University and University of Fukui
Toshikazu Ebisuzaki , Nagoya University, Keio University and University of Fukui
Makoto Sakata , Nagoya University, Keio University and University of Fukui
Makoto Taiji , Nagoya University, Keio University and University of Fukui
pp. 1-10

First-principles calculations of large-scale semiconductor systems on the earth simulator (Abstract)

Takahisa Ohno , Material Science Center (NIMS-CMSC), Tsukuba, Ibaraki, Japan
Takenori Yamamoto , Toho University, Funabashi, Chiba, Japan
Tatsunobu Kokubo , NEC Corporation, Fuchu, Tokyo, Japan
Akira Azami , NEC Informatec Systems, Ltd., Takatsu, Kawasaki, Japan
Yuta Sakaguchi , Advanced Soft Engineering, Inc., Chuo, Tokyo, Japan
Tsuyoshi Uda , AdvanceSoft Corporation, Minato, Tokyo, Japan
Takahiro Yamasaki , University of Tokyo, Tokyo, Japan
Daisuke Fukata , NEC Soft, Ltd., Koto, Tokyo, Japan
Junichiro Koga , AdvanceSoft Corporation, Minato, Tokyo, Japan
pp. 1-6

Extending stability beyond CPU millennium: a micron-scale atomistic simulation of Kelvin-Helmholtz instability (Abstract)

J. N. Glosli , Lawrence Livermore National Laboratory, Livermore, CA
D. F. Richards , Lawrence Livermore National Laboratory, Livermore, CA
K. J. Caspersen , Lawrence Livermore National Laboratory, Livermore, CA
R. E. Rudd , Lawrence Livermore National Laboratory, Livermore, CA
J. A. Gunnels , IBM Corporation, Yorktown Heights, New York
F. H. Streitz , Lawrence Livermore National Laboratory, Livermore, CA
pp. 1-11

WRF nature run (Abstract)

John Michalakes , University Corporation for Atmospheric Research (UCAR), Boulder, CO
Josh Hacker , University Corporation for Atmospheric Research (UCAR), Boulder, CO
Richard Loft , University Corporation for Atmospheric Research (UCAR), Boulder, CO
Michael O. McCracken , PMaC Laboratory San Diego Supercomputer Center, La Jolla, CA
Allan Snavely , PMaC Laboratory San Diego Supercomputer Center, La Jolla, CA
Nicholas J. Wright , PMaC Laboratory San Diego Supercomputer Center, La Jolla, CA
Tom Spelce , Lawrence Livermore National Laboratory, Livermore, CA
Brent Gorda , Lawrence Livermore National Laboratory, Livermore, CA
Robert Walkup , IBM Thomas J. Watson Research Center, Yorktown Heights, NY
pp. 1-6
Back Matter

Back Matter (PDF)

pp. z-z17
86 ms
(Ver 3.3 (11022016))