The Community for Technology Leaders
SC Conference (2008)
Austin, Texas
Nov. 15, 2008 to Nov. 21, 2008
ISBN: 978-1-4244-2835-9
TABLE OF CONTENTS

Entering the petaflop era: The architecture and performance of Roadrunner (PDF)

Kevin J. Barker , Performance and Architecture Lab (PAL), Los Alamos National Laboratory, USA
Kei Davis , Performance and Architecture Lab (PAL), Los Alamos National Laboratory, USA
Adolfy Hoisie , Performance and Architecture Lab (PAL), Los Alamos National Laboratory, USA
Darren J. Kerbyson , Performance and Architecture Lab (PAL), Los Alamos National Laboratory, USA
Mike Lang , Performance and Architecture Lab (PAL), Los Alamos National Laboratory, USA
Scott Pakin , Performance and Architecture Lab (PAL), Los Alamos National Laboratory, USA
Jose C. Sancho , Performance and Architecture Lab (PAL), Los Alamos National Laboratory, USA
pp. 1-11

Efficient management of data center resources for Massively Multiplayer Online Games (PDF)

Vlad Nae , Institute for Computer Science, University of Innsbruck, Technikerstrasse 21a, A-6020, Austria
Alexandru Iosup , Parallel and Distributed Systems Group, Delft University of Technology, Mekelweg 4, 2628CD, The Netherlands
Stefan Podlipnig , Institute for Computer Science, University of Innsbruck, Technikerstrasse 21a, A-6020, Austria
Radu Prodan , Institute for Computer Science, University of Innsbruck, Technikerstrasse 21a, A-6020, Austria
Dick Epema , Parallel and Distributed Systems Group, Delft University of Technology, Mekelweg 4, 2628CD, The Netherlands
Thomas Fahringer , Institute for Computer Science, University of Innsbruck, Technikerstrasse 21a, A-6020, Austria
pp. 1-12

Performance optimization of TCP/IP over 10 Gigabit Ethernet by precise instrumentation (PDF)

Takeshi Yoshino , Google Japan Inc., Japan
Yutaka Sugawara , The University of Tokyo, Japan
Katsushi Inagami , The University of Tokyo, Japan
Junji Tamatsukuri , The University of Tokyo, Japan
Mary Inaba , The University of Tokyo, Japan
Kei Hiraki , The University of Tokyo, Japan
pp. 1-12

Asymmetric interactions in symmetric multi-core systems: Analysis, enhancements and evaluation (PDF)

T. Scogland , Dept. of Computer Science, Virginia Tech, USA
P. Balaji , Math. and Computer Science, Argonne National Lab, USA
W. Feng , Dept. of Computer Science, Virginia Tech, USA
G. Narayanaswamy , Dept. of Computer Science, Virginia Tech, USA
pp. 1-12

Dendro: Parallel algorithms for multigrid and AMR methods on 2:1 balanced octrees (PDF)

Rahul S. Sampath , Georgia Institute of Technology, Atlanta, 30332, USA
Santi S. Adavani , University of Pennsylvania, Philadelphia, 19104, USA
Hari Sundar , University of Pennsylvania, Philadelphia, 19104, USA
Ilya Lashuk , Georgia Institute of Technology, Atlanta, 30332, USA
George Biros , Georgia Institute of Technology, Atlanta, 30332, USA
pp. 1-12
Papers

High performance discrete Fourier transforms on graphics processors (Abstract)

Naga K. Govindaraju , Microsoft Corporation
Brandon Lloyd , Microsoft Corporation
Yuri Dotsenko , Microsoft Corporation
Burton Smith , Microsoft Corporation
John Manferdelli , Microsoft Corporation
pp. 1-12

Dynamically adapting file domain partitioning methods for collective I/O based on underlying parallel file system locking protocols (Abstract)

Wei-keng Liao , Northwestern University, Evanston, Illinois
Alok Choudhary , Northwestern University, Evanston, Illinois
pp. 1-12

A novel domain oriented approach for scientific Grid workflow composition (PDF)

Jun Qin , Institute of Computer Science, University of Innsbruck, Technikerstr. 21a, A-6020, Austria
Thomas Fahringer , Institute of Computer Science, University of Innsbruck, Technikerstr. 21a, A-6020, Austria
pp. 1-12
Papers

Bandwidth intensive 3-D FFT kernel for GPUs using CUDA (Abstract)

Akira Nukada , Tokyo Institute of Technology, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
Yasuhiko Ogata , Tokyo Institute of Technology, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
Toshio Endo , Tokyo Institute of Technology, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
Satoshi Matsuoka , Tokyo Institute of Technology, Tokyo, Japan and National Institute of Informatics, Tokyo, Japan and Japan Science and Technology Agency, Kawaguchi, Saitama, Japan
pp. 1-11

Using server-to-server communication in parallel file systems to simplify consistency and improve performance (Abstract)

Philip H. Carns , Argonne National Laboratory, Argonne, IL
Bradley W. Settlemyer , Clemson University, Clemson, SC
Walter B. Ligon , Clemson University, Clemson, SC
pp. 1-8

Nimrod/K: Towards massively parallel dynamic Grid workflows (PDF)

David Abramson , Faculty of Information Technology, Monash University, Clayton, 3800, Victoria, Australia
Colin Enticott , Faculty of Information Technology, Monash University, Clayton, 3800, Victoria, Australia
Ilkay Altinas , San Diego Supercomputer Center, 9500 Gilman Drive, MC 0505 La Jolla, CA 92093-0505, USA
pp. 1-11

SMARTMAP: Operating system support for efficient data sharing among processes on a multi-core processor (PDF)

Ron Brightwell , Scable System Software Department, Sandia National Laboratories, Albuquerque, New Mexico 81785-1319, USA
Kevin Pedretti , Scable System Software Department, Sandia National Laboratories, Albuquerque, New Mexico 81785-1319, USA
Trammell Hudson , Operating Systems Research, 1527 16th NW #5, Washington, DC 20036, USA
pp. 1-12

Lessons learned at 208K: Towards debugging millions of cores (PDF)

Gregory L. Lee , Lawrence Livermore National Laboratory, Computation Directorate, CA 94550, USA
Dong H. Ahn , Lawrence Livermore National Laboratory, Computation Directorate, CA 94550, USA
Dorian C. Arnold , University of Wisconsin, Computer Sciences Department, Madison, 53706, USA
Bronis R. de Supinski , Lawrence Livermore National Laboratory, Computation Directorate, CA 94550, USA
Matthew Legendre , University of Wisconsin, Computer Sciences Department, Madison, 53706, USA
Barton P. Miller , University of Wisconsin, Computer Sciences Department, Madison, 53706, USA
Martin Schulz , Lawrence Livermore National Laboratory, Computation Directorate, CA 94550, USA
Ben Liblit , University of Wisconsin, Computer Sciences Department, Madison, 53706, USA
pp. 1-9
Papers

Efficient management of data center resources for massively multiplayer online games (Abstract)

Vlad Nae , University of Innsbruck, Innsbruck, Austria
Alexandru Iosup , Delft University of Technology, Delft, The Netherlands
Stefan Podlipnig , University of Innsbruck, Innsbruck, Austria
Radu Prodan , University of Innsbruck, Innsbruck, Austria
Dick Epema , Delft University of Technology, Delft, The Netherlands
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
pp. 1-12

A novel migration-based NUCA design for Chip Multiprocessors (PDF)

Mahmut Kandemir , Pennsylvania State University, USA
Feihui Li , NVIDIA, USA
Mary Jane Irwin , Pennsylvania State University, USA
Seung Woo Son , Pennsylvania State University, USA
pp. 1-12

Communication Avoiding Gaussian elimination (PDF)

Laura Grigori , INRIA Saclay-Ile de France, Bat 490, Universite Paris-Sud 11, 91405 Orsay France
James W. Demmel , Computer Science Division and Mathematics Department, UC Berkeley, CA 94720-1776, USA
Hua Xiang , INRIA Saclay-Ile de France, Bat 490, Universite Paris-Sud 11, 91405 Orsay France
pp. 1-12
Papers

Feedback-controlled resource sharing for predictable eScience (Abstract)

Sang-Min Park , University of Virginia, Charlottesville, VA
Marty Humphrey , University of Virginia, Charlottesville, VA
pp. 1-11

Wide-area performance profiling of 10GigE and InfiniBand technologies (Abstract)

Nageswara S. V. Rao , Oak Ridge National Laboratory, Oak Ridge, TN
Weikuan Yu , Oak Ridge National Laboratory, Oak Ridge, TN
William R. Wing , Oak Ridge National Laboratory, Oak Ridge, TN
Stephen W. Poole , Oak Ridge National Laboratory, Oak Ridge, TN
Jeffrey S. Vetter , Oak Ridge National Laboratory, Oak Ridge, TN
pp. 1-12

High-radix crossbar switches enabled by Proximity Communication (PDF)

Hans Eberle , Sun Microsystems, 16 Network Circle, Menlo Park, CA 94025, USA
Pedro J. Garcia , Universidad de Castilla-La Mancha, Escuela Superior de Ingeniería Informática, C.P. 02071, Albacete, Spain
Jose Flich , Universidad Politécnica de Valencia, Camino de Vera, s/n, C.P. 46022, Spain
Jose Duato , Universidad Politécnica de Valencia, Camino de Vera, s/n, C.P. 46022, Spain
Robert Drost , Sun Microsystems, 16 Network Circle, Menlo Park, CA 94025, USA
Nils Gura , Sun Microsystems, 16 Network Circle, Menlo Park, CA 94025, USA
David Hopkins , Sun Microsystems, 16 Network Circle, Menlo Park, CA 94025, USA
Wladek Olesinski , Sun Microsystems, 16 Network Circle, Menlo Park, CA 94025, USA
pp. 1-12
Papers

Efficient auction-based grid reservations using dynamic programming (Abstract)

Andrew Mutz , University of California Santa Barbara, Santa Barbara, CA
Rich Wolski , University of California Santa Barbara, Santa Barbara, CA
pp. 1-8

Asymmetric interactions in symmetric multi-core systems: analysis, enhancements and evaluation (Abstract)

T. Scogland , Virginia Tech
P. Balaji , Argonne National Lab
W. Feng , Virginia Tech
G. Narayanaswamy , Virginia Tech
pp. 1-12

The role of MPI in development time: A case study (PDF)

Lorin Hochstein , USC Information Sciences Institute, USA
Forrest Shull , Fraunhofer Center Maryland, USA
Lynn B. Reid , University of Chicago, USA
pp. 1-10

New algorithm to enable 400+ TFlop/s sustained performance in simulations of disorder effects in high-Tc superconductors (PDF)

J. M. Larkin , Cray Incorporated, Oak Ridge TN 37831-6008, USA
G. Alvarez , Oak Ridge National Laboraotry, TN 37831-6164, USA
D. E. Maxwell , Oak Ridge National Laboraotry, TN 37831-6164, USA
M. Eisenbach , Oak Ridge National Laboraotry, TN 37831-6164, USA
J. S. Meredith , Oak Ridge National Laboraotry, TN 37831-6164, USA
M. S. Summers , Oak Ridge National Laboraotry, TN 37831-6164, USA
J. Levesque , Cray Incorporated, Oak Ridge TN 37831-6008, USA
T. A. Maier , Oak Ridge National Laboraotry, TN 37831-6164, USA
P. R. C. Kent , Oak Ridge National Laboraotry, TN 37831-6164, USA
E. F. D'Azevedo , Oak Ridge National Laboraotry, TN 37831-6164, USA
T. C. Schulthess , Oak Ridge National Laboraotry, TN 37831-6164, USA
pp. 1-10
Papers

Performance prediction of large-scale parallell system and application using macro-level simulation (Abstract)

Ryutaro Susukita , Information Technologies & Nanotechnologies, Fukuoka, Japan
Hisashige Ando , Fujitsu, Tokyo, Japan
Mutsumi Aoyagi , Kyushu University, Fukuoka, Japan
Hiroaki Honda , Information Technologies & Nanotechnologies, Fukuoka, Japan
Yuichi Inadomi , Information Technologies & Nanotechnologies, Fukuoka, Japan
Koji Inoue , Kyushu University, Fukuoka, Japan
Shigeru Ishizuki , Fujitsu, Tokyo, Japan
Yasunori Kimura , Fujitsu, Tokyo, Japan
Hidemi Komatsu , Fujitsu, Tokyo, Japan
Motoyoshi Kurokawa , RIKEN (The Institute of Physical & Chemical Research), Wako, Japan
Kazuaki J. Murakami , Kyushu University, Fukuoka, Japan
Hidetomo Shibamura , Information Technologies & Nanotechnologies, Fukuoka, Japan
Shuji Yamamura , Fujitsu, Tokyo, Japan
Yunqing Yu , Kyushu University, Fukuoka, Japan
pp. 1-9

A novel domain oriented approach for scientific grid workflow composition (Abstract)

Jun Qin , University of Innsbruck, Innsbruck, Austria
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
pp. 1-12

Toward loosely coupled programming on petascale systems (Abstract)

Ioan Raicu , University of Chicago, Chicago, IL
Zhao Zhang , University of Chicago and Argonne National Laboratory, Chicago, IL
Mike Wilde , Argonne National Laboratory, Argonne, IL and University of Chicago and Argonne National Laboratory, Chicago, IL
Ian Foster , Argonne National Laboratory, Argonne, IL and University of Chicago, Chicago, IL and University of Chicago and Argonne National Laboratory, Chicago, IL
Pete Beckman , Argonne National Laboratory, Argonne, IL
Kamil Iskra , Argonne National Laboratory, Argonne, IL
Ben Clifford , University of Chicago and Argonne National Laboratory, Chicago, IL
pp. 1-12

EpiSimdemics: An efficient algorithm for simulating the spread of infectious disease over large realistic social networks (PDF)

Christopher L. Barrett , Network Dynamics and Simulation Science Laboratory, Virginia Tech, Blacksburg, 24061, USA
Keith R. Bisset , Network Dynamics and Simulation Science Laboratory, Virginia Tech, Blacksburg, 24061, USA
Stephen G. Eubank , Network Dynamics and Simulation Science Laboratory, Virginia Tech, Blacksburg, 24061, USA
Xizhou Feng , Network Dynamics and Simulation Science Laboratory, Virginia Tech, Blacksburg, 24061, USA
Madhav V. Marathe , Network Dynamics and Simulation Science Laboratory, Virginia Tech, Blacksburg, 24061, USA
pp. 1-12

Programming the Intel 80-core network-on-a-chip Terascale Processor (PDF)

Timothy G. Mattson , Intel Corp., DuPont, WA, USA
Rob Van der Wijngaart , Intel Corp., Santa Clara, CA USA
Michael Frumkin , Google Inc., Mountain View, CA USA
pp. 1-11

0.374 Pflop/s trillion-particle kinetic modeling of laser plasma interaction on roadrunner (PDF)

K. J. Bowers , D. E. Shaw Research LLC, 120 W 45th Street, 39th Floor, New York, 10036, USA
B. J. Albright , Applied Physics Division / X-1-PTA Plasma Theory and Applications of the Los Alamos National Laboratory, NM 87544, USA
B. Bergen , Computer, Computational, and Statistical Sciences Division / CCS-2 Computational Physics of the Los Alamos National Laboratory, NM 87544, USA
L. Yin , Applied Physics Division / X-1-PTA Plasma Theory and Applications of the Los Alamos National Laboratory, NM 87544, USA
K. J. Barker , Computer, Computational, and Statistical Sciences Division / CCS-1 Computer Science on High Performance Computing of the Los Alamos National Laboratory, NM 87544, USA
D. J. Kerbyson , Computer, Computational, and Statistical Sciences Division / CCS-1 Computer Science on High Performance Computing of the Los Alamos National Laboratory, NM 87544, USA
pp. 1-11

PAM: A novel performance/power aware meta-scheduler for multi-core systems (PDF)

Mohammad Banikazemi , IBM Thomas J. Watson Research Center, Hawthorne, NY, USA
Dan Poff , IBM Thomas J. Watson Research Center, Hawthorne, NY, USA
Bulent Abali , IBM Thomas J. Watson Research Center, Hawthorne, NY, USA
pp. 1-12
Papers

Applying double auctions for scheduling of workflows on the Grid (Abstract)

Marek Wieczorek , University of Innsbruck, Innsbruck, Austria
Stefan Podlipnig , University of Innsbruck, Innsbruck, Austria
Radu Prodan , University of Innsbruck, Innsbruck, Austria
Thomas Fahringer , University of Innsbruck, Innsbruck, Austria
pp. 1-11

A novel migration-based NUCA design for chip multiprocessors (Abstract)

Mahmut Kandemir , Pennsylvania State University
Feihui Li , NVIDIA
Mary Jane Irwin , Pennsylvania State University
Seung Woo Son , Pennsylvania State University
pp. 1-12

Communication avoiding Gaussian elimination (Abstract)

Laura Grigori , Universite Paris-Sud, Orsay France
James W. Demmel , UC Berkeley, CA
Hua Xiang , Universite Paris-Sud, Orsay France
pp. 1-12

Extending CC-NUMA systems to support write update optimizations (Abstract)

Liqun Cheng , Intel Corp. and University of Utah
John B. Carter , IBM Austin Research Laboratory and University of Utah
pp. 1-12

Benchmarking GPUs to tune dense linear algebra (Abstract)

Vasily Volkov , University of California at Berkeley
James W. Demmel , University of California at Berkeley
pp. 1-11

High-radix crossbar switches enabled by proximity communication (Abstract)

Hans Eberle , Sun Microsystems, Menlo Park, CA
Pedro J. Garcia , Universidad de Castilla-La Mancha, Albacete, Spain
José Flich , Universidad Politécnica de Valencia, Valencia, Spain
José Duato , Universidad Politécnica de Valencia, Valencia, Spain
Robert Drost , Sun Microsystems, Menlo Park, CA
Nils Gura , Sun Microsystems, Menlo Park, CA
David Hopkins , Sun Microsystems, Menlo Park, CA
Wladek Olesinski , Sun Microsystems, Menlo Park, CA
pp. 1-12

Massively parallel genomic sequence search on the Blue Gene/P architecture (Abstract)

Heshan Lin , North Carolina State University
Pavan Balaji , Argonne National Laboratory
Carlos Sosa , University of Minnesota, Minneapolis, MN
Xiaosong Ma , North Carolina State University
Wu-chun Feng , Virginia Tech
pp. 1-11
Papers

An efficient parallel approach for identifying protein families in large-scale metagenomic data sets (Abstract)

Changjun Wu , Washington State University, Pullman, WA
Ananth Kalyanaraman , Washington State University, Pullman, WA
pp. 1-10

An adaptive cut-off for task parallelism (Abstract)

Alejandro Duran , Universitat Politècnica de Catalunya
Julita Corbalán , Universitat Politècnica de Catalunya
Eduard Ayguadé , Universitat Politècnica de Catalunya
pp. 1-11

Massively parallel volume rendering using 2–3 swap image compositing (PDF)

Hongfeng Yu , Department of Computer Science, University of California at Davis, USA
Chaoli Wang , Department of Computer Science, University of California at Davis, USA
Kwan-Liu Ma , Department of Computer Science, University of California at Davis, USA
pp. 1-11
Papers

Programming the Intel 80-core network-on-a-chip terascale processor (Abstract)

Timothy G. Mattson , Intel Corp., DuPont, WA
Rob Van der Wijngaart , Intel Corp., Santa Clara, CA
Michael Frumkin , Google Inc., Mountain View, CA
pp. 1-11

The cost of doing science on the cloud: The Montage example (PDF)

Ewa Deelman , USC Information Sciences Institute, Marina del Rey, CA, USA
Gurmeet Singh , USC Information Sciences Institute, Marina del Rey, CA, USA
Miron Livny , University of Wisconsin Madison, USA
Bruce Berriman , Infrared Processing and Analysis Center & Michelson Science Center, California Institute of Technology, Pasadena, USA
John Good , Infrared Processing and Analysis Center, California Institute of Technology, Pasadena, USA
pp. 1-12
Papers

Hiding I/O latency with pre-execution prefetching for parallel applications (Abstract)

Yong Chen , Illinois Institute of Technology, Chicago, IL
Surendra Byna , Illinois Institute of Technology, Chicago, IL
Xian-He Sun , Illinois Institute of Technology, Chicago, IL
Rajeev Thakur , Argonne National Laboratory, Argonne, IL
William Gropp , University of Illinois Urbana-Champaign, Urbana, IL
pp. 1-10

Analysis of application heartbeats: Learning structural and temporal features in time series data for identification of performance problems (PDF)

Emma S. Buneci , Duke University, Department of Computer Science, Durham, NC, USA
Daniel A. Reed , Microsoft Research, 1 Microsoft Way, Redmond, WA 98052, USA
pp. 1-12

Server-storage virtualization: Integration and load balancing in data centers (PDF)

Aameek Singh , IBM Almaden Research Center, USA
Madhukar Korupolu , IBM Almaden Research Center, USA
Dushmanta Mohapatra , Georgia Tech, USA
pp. 1-12
Papers

Proactive process-level live migration in HPC environments (Abstract)

Chao Wang , North Carolina State University, Raleigh, NC
Frank Mueller , North Carolina State University, Raleigh, NC
Christian Engelmann , Oak Ridge National Laboratory, Oak Ridge, TN
Stephen L. Scott , Oak Ridge National Laboratory, Oak Ridge, TN
pp. 1-12

Parallel I/O prefetching using MPI file caching and I/O signatures (Abstract)

Surendra Byna , Illinois Institute of Technology, Chicago, IL
Yong Chen , Illinois Institute of Technology, Chicago, IL
Xian-He Sun , Illinois Institute of Technology, Chicago, IL
Rajeev Thakur , Argonne National Laboratory, Argonne, IL
William Gropp , University of Illinois Urbana-Champaign, Urbana, IL
pp. 1-12

BitDew: a programmable environment for large-scale data management and distribution (Abstract)

Gilles Fedak , Univ Paris-Sud, CNRS, Orsay
Haiwu He , Univ Paris-Sud, CNRS, Orsay
Franck Cappello , Univ Paris-Sud, CNRS, Orsay
pp. 1-12

Global Trees: A framework for linked data structures on distributed memory parallel systems (PDF)

D. Brian Larkins , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
James Dinan , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
Sriram Krishnamoorthy , Pacific Northwest National Laboratory, Richland, WA 99352, USA
Srinivasan Parthasarathy , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
Atanas Rountev , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
P. Sadayappan , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
pp. 1-13

Parallel exact inference on the Cell Broadband Engine processor (PDF)

Yinglong Xia , Computer Science Department, University of Southern California, Los Angeles, 90089, USA
Viktor K. Prasanna , Ming Hsieh Department of Electrical Engineering, University of Southern California, Los Angeles, 90089, USA
pp. 1-12
Papers

Massively parallel volume rendering using 2-3 swap image compositing (Abstract)

Hongfeng Yu , University of California at Davis
Chaoli Wang , University of California at Davis
Kwan-Liu Ma , University of California at Davis
pp. 1-11

Capturing performance knowledge for automated analysis (Abstract)

Kevin A. Huck , University of Oregon, Eugene, OR
Oscar Hernandez , University of Houston, Houston, TX
Van Bui , University of Houston, Houston, TX
Sunita Chandrasekaran , Nanyang Technological University, Singapore
Barbara Chapman , University of Houston, Houston, TX
Allen D. Malony , University of Oregon, Eugene, OR
Lois Curfman McInnes , Argonne National Laboratory, Argonne, IL
Boyana Norris , Argonne National Laboratory, Argonne, IL
pp. 1-10

The cost of doing science on the cloud: the Montage example (Abstract)

Ewa Deelman , USC Information Sciences Institute, Marina del Rey, CA
Gurmeet Singh , USC Information Sciences Institute, Marina del Rey, CA
Miron Livny , University of Wisconsin Madison, Madison, WI
Bruce Berriman , California Institute of Technology, Pasadena, CA
John Good , California Institute of Technology, Pasadena, CA
pp. 1-12

High performance multivariate visual data exploration for extremely large data (Abstract)

Oliver Rübel , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California, Davis, CA and Technische Universität Kaiserslautern, Kaiserslautern, Germany
Prabhat , Lawrence Berkeley National Laboratory, Berkeley, CA
Kesheng Wu , Lawrence Berkeley National Laboratory, Berkeley, CA
Hank Childs , Lawrence Livermore National Laboratory, Livermore, CA
Jeremy Meredith , Oak Ridge National Laboratory, Oak Ridge, TN
Cameron G. R. Geddes , LOASIS program of Lawrence Berkeley National Laboratory, Berkeley, CA
Estelle Cormier-Michel , LOASIS program of Lawrence Berkeley National Laboratory, Berkeley, CA
Sean Ahern , Oak Ridge National Laboratory, Oak Ridge, TN
Gunther H. Weber , Lawrence Berkeley National Laboratory, Berkeley, CA
Peter Messmer , Tech-X Corporation, Boulder, CO
Hans Hagen , Technische Universität Kaiserslautern, Kaiserslautern, Germany
Bernd Hamann , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California, Davis, CA and Technische Universität Kaiserslautern, Kaiserslautern, Germany
E. Wes Bethel , Lawrence Berkeley National Laboratory, Berkeley, CA and University of California, Davis, CA
pp. 1-12

Server-storage virtualization: integration and load balancing in data centers (Abstract)

Aameek Singh , IBM Almaden Research Center
Madhukar Korupolu , IBM Almaden Research Center
Dushmanta Mohapatra , Georgia Tech
pp. 1-12

Materialized community ground models for large-scale earthquake simulation (Abstract)

Steven W. Schlosser , Intel Research Pittsburgh
Michael P. Ryan , Intel Research Pittsburgh
Ricardo Taborda , Carnegie Mellon University
Julio López , Carnegie Mellon University
David R. O'Hallaron , Intel Research Pittsburgh and Carnegie Mellon University
Jacobo Bielak , Carnegie Mellon University
pp. 1-12

Positivity, posynomials and tile size selection (Abstract)

Lakshminarayanan Renganarayana , IBM T.J. Watson Research Center, Yorktown Heights, New York
Sanjay Rajopadhye , Colorado State University, Fort Collins, Colorado
pp. 1-12

A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories (Abstract)

Tiankai Tu , D. E. Shaw Research, New York, NY
Charles A. Rendleman , D. E. Shaw Research, New York, NY
David W. Borhani , D. E. Shaw Research, New York, NY
Ron O. Dror , D. E. Shaw Research, New York, NY
Justin Gullingsrud , D. E. Shaw Research, New York, NY
Morten Ø. Jensen , D. E. Shaw Research, New York, NY
John L. Klepeis , D. E. Shaw Research, New York, NY
Paul Maragakis , D. E. Shaw Research, New York, NY
Patrick Miller , D. E. Shaw Research, New York, NY
Kate A. Stafford , D. E. Shaw Research, New York, NY
David E. Shaw , D. E. Shaw Research, New York, NY
pp. 1-12

Global trees: a framework for linked data structures on distributed memory parallel systems (Abstract)

D. Brian Larkins , The Ohio State University, Columbus, OH
James Dinan , The Ohio State University, Columbus, OH
Sriram Krishnamoorthy , Pacific Northwest National Laboratory, Richland, WA
Srinivasan Parthasarathy , The Ohio State University, Columbus, OH
Atanas Rountev , The Ohio State University, Columbus, OH
P. Sadayappan , The Ohio State University, Columbus, OH
pp. 1-13

Parallel exact inference on the cell broadband engine processor (Abstract)

Yinglong Xia , University of Southern California, Los Angeles, CA
Viktor K. Prasanna , University of Southern California, Los Angeles, CA
pp. 1-12

Prefetch throttling and data pinning for improving performance of shared caches (Abstract)

Ozcan Ozturk , Bilkent University
Seung Woo Son , Pennsylvania State University
Mahmut Kandemir , Pennsylvania State University
Mustafa Karakoy , Imperial College
pp. 1-12

High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors (Abstract)

Laura Carrington , San Diego Supercomputer Center, La Jolla, CA
Dimitri Komatitsch , Université de Pau, Pau, France and Institut Universitaire de France, Paris, France
Michael Laurenzano , San Diego Supercomputer Center, La Jolla, CA
Mustafa M Tikir , San Diego Supercomputer Center, La Jolla, CA
David Michéa , Université de Pau, Pau, France
Nicolas Le Goff , Université de Pau, Pau, France
Allan Snavely , San Diego Supercomputer Center, La Jolla, CA
Jeroen Tromp , California Institute of Technology, Pasadena, CA
pp. 1-11

New algorithm to enable 400+ TFlop/s sustained performance in simulations of disorder effects in high-Tc superconductors (Abstract)

G. Alvarez , Oak Ridge National Laboraotry, Oak Ridge TN
M. S. Summers , Oak Ridge National Laboraotry, Oak Ridge TN
D. E. Maxwell , Oak Ridge National Laboraotry, Oak Ridge TN
M. Eisenbach , Oak Ridge National Laboraotry, Oak Ridge TN
J. S. Meredith , Oak Ridge National Laboraotry, Oak Ridge TN
J. M. Larkin , Cray Incorporated, Oak Ridge, TN
J. Levesque , Cray Incorporated, Oak Ridge, TN
T. A. Maier , Oak Ridge National Laboraotry, Oak Ridge TN
P. R. C. Kent , Oak Ridge National Laboraotry, Oak Ridge TN
E. F. D'Azevedo , Oak Ridge National Laboraotry, Oak Ridge TN
T. C. Schulthess , Oak Ridge National Laboraotry, Oak Ridge TN
pp. 1-10

Scalable adaptive mantle convection simulation on petascale supercomputers (Abstract)

Carsten Burstedde , The University of Texas at Austin, Austin, Texas
Omar Ghattas , The University of Texas at Austin, Austin, Texas
Michael Gurnis , California Institute of Technology, Pasadena, California
Georg Stadler , The University of Texas at Austin, Austin, Texas
Eh Tan , California Institute of Technology, Pasadena, California
Tiankai Tu , The University of Texas at Austin, Austin, Texas
Lucas C. Wilcox , The University of Texas at Austin, Austin, Texas
Shijie Zhong , University of Colorado, Boulder, Colorado
pp. 1-15

0.374 Pflop/s trillion-particle kinetic modeling of laser plasma interaction on Roadrunner (Abstract)

K. J. Bowers , X-1-PTA Plasma Theory and Applications
B. J. Albright , X-1-PTA Plasma Theory and Applications
B. Bergen , CCS-2 Computational Physics
L. Yin , X-1-PTA Plasma Theory and Applications
K. J. Barker , Computing of the Los Alamos National Laboratory, Los Alamos, NM
D. J. Kerbyson , Computing of the Los Alamos National Laboratory, Los Alamos, NM
pp. 1-11

369 Tflop/s molecular dynamics simulations on the Roadrunner general-purpose heterogeneous supercomputer (Abstract)

Sriram Swaminarayan , Los Alamos National Laboratory, Los Alamos, NM
Kai Kadau , Los Alamos National Laboratory, Los Alamos, NM
Timothy C. Germann , Los Alamos National Laboratory, Los Alamos, NM
Gordon C. Fossum , IBM Corporation, Austin, TX
pp. 1-10

Linearly scaling 3D fragment method for large-scale electronic structure calculations (Abstract)

Lin-Wang Wang , Lawrence Berkeley National Laboratory, Berkeley, CA
Byounghak Lee , Lawrence Berkeley National Laboratory, Berkeley, CA
Hongzhang Shan , Lawrence Berkeley National Laboratory, Berkeley, CA
Zhengji Zhao , Lawrence Berkeley National Laboratory, Berkeley, CA
Juan Meza , Lawrence Berkeley National Laboratory, Berkeley, CA
Erich Strohmaier , Lawrence Berkeley National Laboratory, Berkeley, CA
David H. Bailey , Lawrence Berkeley National Laboratory, Berkeley, CA
pp. 1-10
88 ms
(Ver 3.3 (11022016))