The Community for Technology Leaders
2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (2013)
Denver, CO, USA
Nov. 17, 2013 to Nov. 22, 2013
ISSN: 2167-4337
ISBN: 978-1-4503-2378-9
TABLE OF CONTENTS

Introduction (PDF)

William Gropp , University of Illinois at Urbana-Champaign, USA
Satoshi Matsuoka , Tokyo Institute of Technology, Japan
pp. 1-2

Taking a quantum leap in time to solution for simulations of high-Tc superconductors (PDF)

Peter Staar , ITP, ETH Zurich, Switzerland
Thomas A. Maier , CSMD, Oak Ridge Natl. Lab., USA
Raffaele Solca , ITP, ETH Zurich, Switzerland
Gilles Fourestey , CSCS, ETH Zurich, Switzerland
Michael S. Summers , CSMD, Oak Ridge Natl. Lab., USA
Thomas C. Schulthess , ITP & CSCS, ETH Zurich, Switzerland
pp. 1-11

20 Petaflops simulation of proteins suspensions in crowding conditions (Abstract)

Massimo Bernaschi , Istituto Applicazioni Calcolo, (CNR-IAC), Consiglio Nazionale delle Ricerche, Rome, Italy
Mauro Bisson , Istituto Processi Chimico-Fisici, (CNR-IPCF), Consiglio Nazionale delle Ricerche, Rome, Italy
Massimiliano Fatica , Nvidia Corporation, Santa Clara, CA, USA
Simone Melchionna , Istituto Processi Chimico-Fisici, (CNR-IPCF), Consiglio Nazionale delle Ricerche, Italy
pp. 1-11

11 PFLOP/s simulations of cloud cavitation collapse (Abstract)

Diego Rossinelli , Professorship for Computational Science, ETH Zürich, Switzerland
Babak Hejazialhosseini , Professorship for Computational Science, ETH Zürich, Switzerland
Panagiotis Hadjidoukas , Professorship for Computational Science, ETH Zürich, Switzerland
Costas Bekas , IBM Research Division, Zürich Research Laboratory, Switzerland
Alessandro Curioni , IBM Research Division, Zürich Research Laboratory, Switzerland
Adam Bertsch , Lawrence Livermore National Laboratory, U.S.A.
Scott Futral , Lawrence Livermore National Laboratory, U.S.A.
Steffen J. Schmidt , Institute of Aerodynamics and Fluid Mechanics, TU München, Germany
Nikolaus A. Adams , Institute of Aerodynamics and Fluid Mechanics, TU München, Germany
Petros Koumoutsakos , Professorship for Computational Science, ETH Zürich, Switzerland
pp. 1-13

The origin of mass (Abstract)

Peter Boyle , U. of Edinburgh, UK
Michael I. Buchoff , LLNL, CA, USA
Norman Christ , Columbia U., NY, USA
Taku Izubuchi , BNL, NY, USA
Chulwoo Jung , BNL, NY, USA
Thomas C. Luu , LLNL, CA, USA
Robert Mawhinney , Columbia U., NY, USA
Chris Schroeder , LLNL, CA, USA
Ron Soltz , LLNL, CA, USA
Pavlos Vranas , LLNL, CA, USA
Joseph Wasem , LLNL, CA, USA
pp. 1-10

Radiative signature of the relativistic Kelvin-Helmholtz Instability (PDF)

M. Bussmann , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
H. Burau , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
T.E. Cowan , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
A. Debus , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
A. Huebl , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
G. Juckeland , Technische Universität Dresden, Center for Information Services and High Performance Computing, 01062, Germany
T. Kluge , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
W.E. Nagel , Technische Universität Dresden, Center for Information Services and High Performance Computing, 01062, Germany
R. Pausch , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
F. Schmitt , Technische Universität Dresden, Center for Information Services and High Performance Computing, 01062, Germany
U. Schramm , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
J. Schuchart , Oak Ridge National Lab, PO Box 2008 MS-6164, TN 37831-6164, USA
R. Widera , Helmholtz-Zentrum, Dresden - Rossendorf, Bautzner Landstrasse 400, 01328, Germany
pp. 1-12

HACC: Extreme scaling and performance across diverse architectures (Abstract)

Salman Habib , Argonne National Laboratory, USA
Vitali Morozov , Argonne National Laboratory, USA
Nicholas Frontiere , Argonne National Laboratory, USA
Hal Finkel , Argonne National Laboratory, USA
Adrian Pope , Argonne National Laboratory, USA
Katrin Heitmann , Argonne National Laboratory, USA
pp. 1-10

ACR: Automatic checkpoint/restart for soft and hard error protection (Abstract)

Xiang Ni , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
Esteban Meneses , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
Nikhil Jain , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
Laxmikant V. Kale , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
pp. 1-12

SPBC: Leveraging the characteristics of MPI HPC applications for scalable checkpointing (Abstract)

Thomas Ropars , École Polytechnique Fédérale de Lausanne (EPFL), Switzerland
Tatiana V. Martsinkevich , INRIA, University of Paris Sud, France
Amina Guermouche , Université de Versailles, Saint-Quentin en Yveline, France
Andre Schiper , École Polytechnique Fédérale de Lausanne (EPFL), Switzerland
Franck Cappello , Argonne National Laboratory, USA
pp. 1-12

Using simulation to explore distributed key-value stores for extreme-scale system services (Abstract)

Ke Wang , Illinois Institute of Technology, USA
Abhishek Kulkarni , Indiana University, USA
Michael Lang , Los Alamos National Laboratory, USA
Dorian Arnold , University of New Mexico, USA
Ioan Raicu , Illinois Institute of Technology, USA
pp. 1-12

General transformations for GPU execution of tree traversals (Abstract)

Michael Goldfarb , School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA
Youngjoon Jo , School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA
Milind Kulkarni , School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA
pp. 1-12

A large-scale cross-architecture evaluation of thread-coarsening (Abstract)

Alberto Magni , University of Edinburgh, UK
Christophe Dubach , University of Edinburgh, UK
Michael F.P. O'Boyle , University of Edinburgh, UK
pp. 1-11

Semi-automatic restructuring of offloadable tasks for many-core accelerators (Abstract)

Nishkam Ravi , NEC Labs America, Princeton, NJ, USA
Yi Yang , NEC Labs America, Princeton, NJ, USA
Tao Bao , Computer Science, Purdue University, USA
Srimat Chakradhar , NEC Labs America, Princeton, NJ, USA
pp. 1-12

A framework for load balancing of Tensor Contraction expressions via dynamic task partitioning (Abstract)

Pai-Wei Lai , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
Kevin Stock , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
Samyam Rajbhandari , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
Sriram Krishnamoorthy , Computational Sciences and Mathematics Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
P. Sadayappan , Dept. of Computer Science and Engineering, The Ohio State University, Columbus, 43221, USA
pp. 1-10

Load-balanced pipeline parallelism (Abstract)

Md Kamruzzaman , Computer Science and Engineering, University of California, San Diego, USA
Steven Swanson , Computer Science and Engineering, University of California, San Diego, USA
Dean M. Tullsen , Computer Science and Engineering, University of California, San Diego, USA
pp. 1-12

A distributed dynamic load balancer for iterative applications (Abstract)

Harshitha Menon , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
Laxmikant Kale , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
pp. 1-11

Distributed wait state tracking for runtime MPI deadlock detection (Abstract)

Tobias Hilbrich , Technische Universität Dresden, D-01062, Germany
Bronis R. de Supinski , Lawrence Livermore National Laboratory, CA 94551, USA
Wolfgang E. Nagel , Technische Universität Dresden, D-01062, Germany
Joachim Protze , RWTH Aachen University, D-52056, Germany
Christel Baier , Technische Universität Dresden, D-01062, Germany
Matthias S. Muller , RWTH Aachen University, D-52056, Germany
pp. 1-12

Globalizing selectively: Shared-memory efficiency with address-space separation (Abstract)

Nilesh Mahajan , Indiana University, Bloomington, IN 47405, USA
Uday Pitambare , Indiana University, Bloomington, IN 47405, USA
Arun Chauhan , Indiana University, Bloomington, IN 47405, USA
pp. 1-12

Hybrid MPI: Efficient message passing for multi-core systems (Abstract)

Andrew Friedley , Indiana University, USA
Greg Bronevetsky , Lawrence Livermore National Laboratory, USA
Torsten Hoefler , ETH Zurich, Switzerland
Andrew Lumsdaine , Indiana University, USA
pp. 1-11

Performance evaluation of Intel® Transactional Synchronization Extensions for high-performance computing (Abstract)

Richard M. Yoo , Parallel Computing Laboratory, Intel Labs, Santa Clara, CA 95054, USA
Christopher J. Hughes , Parallel Computing Laboratory, Intel Labs, Santa Clara, CA 95054, USA
Konrad Lai , Intel Architecture Development Group, Intel Architecture Group, Hillsboro, OR 97124, USA
Ravi Rajwar , Intel Architecture Development Group, Intel Architecture Group, Hillsboro, OR 97124, USA
pp. 1-11

Location-aware cache management for many-core processors with deep cache hierarchy (Abstract)

Jongsoo Park , Parallel Computing Lab, Intel Corporation, USA
Richard M. Yoo , Parallel Computing Lab, Intel Corporation, USA
Daya S. Khudia , University of Michigan - Ann Arbor, USA
Christopher J. Hughes , Parallel Computing Lab, Intel Corporation, USA
Daehyun Kim , Parallel Computing Lab, Intel Corporation, USA
pp. 1-12

Practical nonvolatile multilevel-cell phase change memory (Abstract)

Doe Hyun Yoon , IBM Thomas J. Watson Research Center, USA
Jichuan Chang , Hewlett-Packard Labs, USA
Robert S. Schreiber , Hewlett-Packard Labs, USA
Norman P. Jouppi , Hewlett-Packard Labs, USA
pp. 1-12

Feng Shui of supercomputer memory positional effects in DRAM and SRAM faults (Abstract)

Vilas Sridharan , RAS Architecture, Advanced Micro Devices, Inc., Boxborough, MA, USA
Jon Stearley , Scalable Architectures, Sandia National Laboratories, Albuquerque, New Mexico, USA
Nathan DeBardeleben , Ultrascale Systems Research Center, Los Alamos National Laboratory, New Mexico, USA
Sean Blanchard , Ultrascale Systems Research Center, Los Alamos National Laboratory, New Mexico, USA
Sudhanva Gurumurthi , AMD Research, Advanced Micro Devices, Inc., Boxborough, MA, USA
pp. 1-11

Exploring DRAM organizations for energy-efficient and resilient exascale memories (Abstract)

Bharan Giridhar , University of Michigan, Ann Arbor, 48109, USA
Michael Cieslak , University of Michigan, Ann Arbor, 48109, USA
Deepankar Duggal , University of Michigan, Ann Arbor, 48109, USA
Ronald Dreslinski , University of Michigan, Ann Arbor, 48109, USA
Hsing Min Chen , Arizona State University, Tempe, 85287, USA
Robert Patti , Tezzaron Semiconductor, Naperville, IL 60563, USA
Betina Hold , ARM Inc., San Jose, CA 95134, USA
Chaitali Chakrabarti , Arizona State University, Tempe, 85287, USA
Trevor Mudge , University of Michigan, Ann Arbor, 48109, USA
David Blaauw , University of Michigan, Ann Arbor, 48109, USA
pp. 1-12

Low-power, low-storage-overhead chipkill correct via multi-line error correction (Abstract)

Xun Jian , University of Illinois at Urbana-Champaign, USA
Henry Duwe , University of Illinois at Urbana-Champaign, USA
John Sartori , University of Minnesota, USA
Vilas Sridharan , AMD Research, Advanced Micro Devices, Inc., USA
Rakesh Kumar , University of Illinois at Urbana-Champaign, USA
pp. 1-12

AUGEM: Automatically generate high performance Dense Linear Algebra kernels on x86 CPUs (Abstract)

Qian Wang , Institute of Software, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Beijing, China
Xianyi Zhang , Institute of Software, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Beijing, China
Yunquan Zhang , Institute of Software, Chinese Academy of Sciences, Beijing, China
Qing Yi , University of Colorado at Colorado Springs, United States
pp. 1-12

Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes (Abstract)

Wai Teng Tang , School of Computer Engineering, Nanyang Technological University, Singapore
Wen Jun Tan , School of Computer Engineering, Nanyang Technological University, Singapore
Rajarshi Ray , Department of Computer Science, School of Computing, National University of Singapore, Singapore
Yi Wen Wong , Department of Computer Science, School of Computing, National University of Singapore, Singapore
Weiguang Chen , Department of Computer Science, School of Computing, National University of Singapore, Singapore
Shyh-hao Kuo , Institute of High Performance Computing, Agency for Science, Technology and Research, Singapore
Rick Siow Mong Goh , Institute of High Performance Computing, Agency for Science, Technology and Research, Singapore
Stephen John Turner , School of Computer Engineering, Nanyang Technological University, Singapore
Weng-Fai Wong , Department of Computer Science, School of Computing, National University of Singapore, Singapore
pp. 1-12

Precimonious: Tuning assistant for floating-point precision (Abstract)

Cindy Rubio-Gonzalez , EECS Department, UC Berkeley, USA
Cuong Nguyen , EECS Department, UC Berkeley, USA
Hong Diep Nguyen , EECS Department, UC Berkeley, USA
James Demmel , EECS Department, UC Berkeley, USA
William Kahan , EECS Department, UC Berkeley, USA
Koushik Sen , EECS Department, UC Berkeley, USA
David H. Bailey , Lawrence Berkeley National Laboratory, USA
Costin Iancu , Lawrence Berkeley National Laboratory, USA
David Hough , Oracle Corporation, USA
pp. 1-12

A data-centric profiler for parallel programs (Abstract)

Xu Liu , Department of Computer Science, Rice University, Houston, TX, USA
John Mellor-Crummey , Department of Computer Science, Rice University, Houston, TX, USA
pp. 1-12

On the usefulness of object tracking techniques in performance analysis (Abstract)

German Llort , Barcelona Supercomputing Center, Universitat Politècnica de Catalunya, Jordi Girona 29, 08034, Spain
Harald Servat , Barcelona Supercomputing Center, Universitat Politècnica de Catalunya, Jordi Girona 29, 08034, Spain
Juan Gonzalez , Barcelona Supercomputing Center, Universitat Politècnica de Catalunya, Jordi Girona 29, 08034, Spain
Judit Gimenez , Barcelona Supercomputing Center, Universitat Politècnica de Catalunya, Jordi Girona 29, 08034, Spain
Jesus Labarta , Barcelona Supercomputing Center, Universitat Politècnica de Catalunya, Jordi Girona 29, 08034, Spain
pp. 1-11

Detection of false sharing using machine learning (Abstract)

Sanath Jayasena , Dept of Computer Science & Engineering, University of Moratuwa, Sri Lanka
Saman Amarasinghe , Computer Science & Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, USA
Asanka Abeyweera , Dept of Computer Science & Engineering, University of Moratuwa, Sri Lanka
Gayashan Amarasinghe , Dept of Computer Science & Engineering, University of Moratuwa, Sri Lanka
Himeshi De Silva , Dept of Computer Science & Engineering, University of Moratuwa, Sri Lanka
Sunimal Rathnayake , Dept of Computer Science & Engineering, University of Moratuwa, Sri Lanka
Xiaoqiao Meng , IBM Research, Yorktown Heights, New York, USA
Yanbin Liu , IBM Research, Yorktown Heights, New York, USA
pp. 1-9

Parallelizing the execution of sequential scripts (Abstract)

Zhao Zhang , Department of Computer Science, University of Chicago, USA
Daniel S. Katz , Computation Institute, University of Chicago, USA
Timothy G. Armstrong , Department of Computer Science, University of Chicago, USA
Justin M. Wozniak , Mathematics and Computer Science Division, Argonne National Laboratory, USA
Ian Foster , Computation Institute, University of Chicago, USA
pp. 1-12

Deterministic scale-free pipeline parallelism with hyperqueues (Abstract)

Hans Vandierendonck , Queen's University Belfast, United Kingdom
Kallia Chronaki , Barcelona Supercomputing Center, Spain
Dimitrios S. Nikolopoulos , Queen's University Belfast, United Kingdom
pp. 1-12

Compiling affine loop nests for distributed-memory parallel architectures (Abstract)

Uday Bondhugula , Indian Institute of Science, Department of Computer Science and Automation, Bangalore 560012, India
pp. 1-12

Tera-scale 1D FFT with low-communication algorithm and Intel® Xeon Phi™ coprocessors (Abstract)

Jongsoo Park , Parallel Computing Lab, USA
Ganesh Bikshandi , Parallel Computing Lab, USA
Karthikeyan Vaidyanathan , Parallel Computing Lab, USA
Ping Tak Peter Tang , Software and Service Group, Intel Corporation, USA
Pradeep Dubey , Parallel Computing Lab, USA
Daehyun Kim , Parallel Computing Lab, USA
pp. 1-12

A framework for hybrid parallel flow simulations with a trillion cells in complex geometries (Abstract)

Christian Godenschwager , Friedrich-Alexander-Universität Erlangen-Nürnberg, Cauerstraße 11, Germany
Florian Schornbaum , Friedrich-Alexander-Universität Erlangen-Nürnberg, Cauerstraße 11, Germany
Martin Bauer , Friedrich-Alexander-Universität Erlangen-Nürnberg, Cauerstraße 11, Germany
Harald Kostler , Friedrich-Alexander-Universität Erlangen-Nürnberg, Cauerstraße 11, Germany
Ulrich Rude , Friedrich-Alexander-Universität Erlangen-Nürnberg, Cauerstraße 11, Germany
pp. 1-12

A new routing scheme for jellyfish and its performance with HPC workloads (Abstract)

Xin Yuan , Dept. of Computer Science, Florida State University, Tallahassee, 32312, USA
Santosh Mahapatra , Dept. of Computer Science, Florida State University, Tallahassee, 32312, USA
Wickus Nienaber , Dept. of Computer Science, Florida State University, Tallahassee, 32312, USA
Scott Pakin , Los Alamos National Laboratory, New Mexico, USA
Michael Lang , Los Alamos National Laboratory, New Mexico, USA
pp. 1-11

Enabling fair pricing on HPC systems with node sharing (Abstract)

Alex D. Breslow , University of California, San Diego, USA
Ananta Tiwari , San Diego Supercomputer Center, La Jolla, CA, USA
Martin Schulz , Lawrence Livermore National Laboratory, CA, USA
Laura Carrington , San Diego Supercomputer Center, La Jolla, CA, USA
Lingjia Tang , University of Michigan, Ann Arbor, USA
Jason Mars , University of Michigan, Ann Arbor, USA
pp. 1-12

ACIC: Automatic cloud I/O configurator for HPC applications (Abstract)

Mingliang Liu , Department of Computer Science and Technology, Tsinghua University, China
Ye Jin , Department of Computer Science, North Carolina State University, USA
Jidong Zhai , Department of Computer Science and Technology, Tsinghua University, China
Yan Zhai , Department of Computer Sciences, University of Wisconsin-Madison, USA
Qianqian Shi , Department of Computer Science and Technology, Tsinghua University, China
Xiaosong Ma , Department of Computer Science, North Carolina State University, USA
Wenguang Chen , Department of Computer Science and Technology, Tsinghua University, China
pp. 1-12

COCA: Online distributed resource management for cost minimization and carbon neutrality in data centers (Abstract)

Shaolei Ren , Florida International University, USA
Yuxiong He , Microsoft Research, Redmond, USA
pp. 1-12

Supercomputing with commodity CPUs: Are mobile SoCs ready for HPC? (Abstract)

Nikola Rajovic , Barcelona Supercomputing Center, C/ Jordi Girona 29, 08034, Spain
Paul M. Carpenter , Barcelona Supercomputing Center, C/ Jordi Girona 29, 08034, Spain
Isaac Gelado , Barcelona Supercomputing Center, C/ Jordi Girona 29, 08034, Spain
Nikola Puzovic , Barcelona Supercomputing Center, C/ Jordi Girona 29, 08034, Spain
Alex Ramirez , Barcelona Supercomputing Center, C/ Jordi Girona 29, 08034, Spain
Mateo Valero , Barcelona Supercomputing Center, C/ Jordi Girona 29, 08034, Spain
pp. 1-12

There goes the neighborhood: Performance degradation due to nearby jobs (Abstract)

Abhinav Bhatele , Lawrence Livermore National Laboratory, California 94551 USA
Kathryn Mohror , Lawrence Livermore National Laboratory, California 94551 USA
Steven H. Langer , Lawrence Livermore National Laboratory, California 94551 USA
Katherine E. Isaacs , Department of Computer Science, University of California, Davis, 95616 USA
pp. 1-12

CooMR: Cross-task coordination for efficient data management in MapReduce programs (Abstract)

Xiaobing Li , Department of Computer Science and Software Engineering, Auburn University, AL 36849, USA
Yandong Wang , Department of Computer Science and Software Engineering, Auburn University, AL 36849, USA
Yizheng Jiao , Department of Computer Science and Software Engineering, Auburn University, AL 36849, USA
Cong Xu , Department of Computer Science and Software Engineering, Auburn University, AL 36849, USA
Weikuan Yu , Department of Computer Science and Software Engineering, Auburn University, AL 36849, USA
pp. 1-11

Effective sampling-driven performance tools for GPU-accelerated supercomputers (Abstract)

Milind Chabbi , Department of Computer Science, Rice University Houston, TX, USA
Karthik Murthy , Department of Computer Science, Rice University Houston, TX, USA
Michael Fagan , Department of Computer Science, Rice University Houston, TX, USA
John Mellor-Crummey , Department of Computer Science, Rice University Houston, TX, USA
pp. 1-12

Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach (Abstract)

Dong Li , Oak Ridge National Laboratory, USA
Zizhong Chen , University of California, Riverside, USA
Panruo Wu , University of California, Riverside, USA
Jeffrey S. Vetter , Oak Ridge National Laboratory, USA
pp. 1-12

Using automated performance modeling to find scalability bugs in complex codes (Abstract)

Alexandru Calotoiu , German Research School for Simulation Sciences, RWTH Aachen University, Germany
Torsten Hoefler , ETH Zurich, Switzerland
Marius Poke , German Research School for Simulation Sciences, RWTH Aachen University, Germany
Felix Wolf , German Research School for Simulation Sciences, RWTH Aachen University, Germany
pp. 1-12

Efficient data partitioning model for heterogeneous graphs in the cloud (Abstract)

Kisung Lee , Georgia Institute of Technology, USA
Ling Liu , Georgia Institute of Technology, USA
pp. 1-12

SDQuery DSI: Integrating data management support with a wide area data transfer protocol (Abstract)

Yu Su , Computer Science and Engineering, The Ohio State University, Columbus, 43210, USA
Yi Wang , Computer Science and Engineering, The Ohio State University, Columbus, 43210, USA
Gagan Agrawal , Computer Science and Engineering, The Ohio State University, Columbus, 43210, USA
Rajkumar Kettimuthu , The University of Chicago, IL 60439, USA
pp. 1-12

Design and performance evaluation of NUMA-aware RDMA-based end-to-end data transfer systems (Abstract)

Yufei Ren , Stony Brook University, New York 11790, USA
Tan Li , Stony Brook University, New York 11790, USA
Dantong Yu , Brookhaven National Laboratory, Upton, New York 11973, USA
Shudong Jin , Stony Brook University, New York 11790, USA
Thomas Robertazzi , Stony Brook University, New York 11790, USA
pp. 1-10

Scalable parallel OPTICS data clustering using graph algorithmic techniques (Abstract)

Md. Mostofa Ali Patwary , Northwestern University, Evanston, IL 60208, USA
Diana Palsetia , Northwestern University, Evanston, IL 60208, USA
Ankit Agrawal , Northwestern University, Evanston, IL 60208, USA
Wei-keng Liao , Northwestern University, Evanston, IL 60208, USA
Fredrik Manne , University of Bergen, Norway
Alok Choudhary , Northwestern University, Evanston, IL 60208, USA
pp. 1-12

Scalable matrix computations on large scale-free graphs using 2D graph partitioning (Abstract)

Erik G. Boman , Sandia National Laboratories, Scalable Algorithms Department, Albuquerque, NM 87185, USA
Karen D. Devine , Sandia National Laboratories, Scalable Algorithms Department, Albuquerque, NM 87185, USA
Sivasankaran Rajamanickam , Sandia National Laboratories, Scalable Algorithms Department, Albuquerque, NM 87185, USA
pp. 1-12

Scalable parallel graph partitioning (Abstract)

Shad Kirmani , Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Padma Raghavan , Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
pp. 1-10

Channel reservation protocol for over-subscribed channels and destinations (Abstract)

George Michelogiannakis , Stanford University, USA
Nan Jiang , Stanford University, USA
Daniel Becker , Stanford University, USA
William J. Dally , Stanford University, USA
pp. 1-12

Enabling highly-scalable remote memory access programming with MPI-3 one sided (Abstract)

Robert Gerstenberger , ETH Zurich, Dept. of Computer Science, Universitätstr. 6, 8092, Switzerland
Maciej Besta , ETH Zurich, Dept. of Computer Science, Universitätstr. 6, 8092, Switzerland
Torsten Hoefler , ETH Zurich, Dept. of Computer Science, Universitätstr. 6, 8092, Switzerland
pp. 1-12

MVAPICH-PRISM: A proxy-based communication framework using InfiniBand and SCIF for Intel MIC clusters (Abstract)

Sreeram Potluri , Department of Computer Science and Engineering, The Ohio State University, USA
Devendar Bureddy , Department of Computer Science and Engineering, The Ohio State University, USA
Khaled Hamidouche , Department of Computer Science and Engineering, The Ohio State University, USA
Akshay Venkatesh , Department of Computer Science and Engineering, The Ohio State University, USA
Krishna Kandalla , Department of Computer Science and Engineering, The Ohio State University, USA
Hari Subramoni , Department of Computer Science and Engineering, The Ohio State University, USA
Dhabaleswar K. Panda , Department of Computer Science and Engineering, The Ohio State University, USA
pp. 1-11

Exploring portfolio scheduling for long-term execution of scientific workloads in IaaS clouds (Abstract)

Kefeng Deng , School of Computer, National University of Defense Technology, Changsha, China
Junqiang Song , School of Computer, National University of Defense Technology, Changsha, China
Kaijun Ren , School of Computer, National University of Defense Technology, Changsha, China
Alexandru Iosup , Parallel and Distributed Systems Group, Delft University of Technology, The Netherlands
pp. 1-12

Cost-effective cloud HPC resource provisioning by building Semi-Elastic virtual clusters (Abstract)

Shuangcheng Niu , Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing, China
Jidong Zhai , Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing, China
Xiaosong Ma , Department of Computer Science, North Carolina State University, USA
Xiongchao Tang , Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing, China
Wenguang Chen , Department of Computer Science and Technology, Tsinghua National Laboratory for Information Science and Technology, Tsinghua University, Beijing, China
pp. 1-12

Exploiting application dynamism and cloud elasticity for continuous dataflows (Abstract)

Alok Kumbhare , University of Southern California, Los Angeles, 90089, USA
Yogesh Simmhan , University of Southern California, Los Angeles, 90089, USA
Viktor K. Prasanna , University of Southern California, Los Angeles, 90089, USA
pp. 1-12

A ‘cool’ way of improving the reliability of HPC machines (Abstract)

Osman Sarood , Dept. of Computer Science, University of Illinois at Urbana-Champaign, 61801, USA
Esteban Meneses , Dept. of Computer Science, University of Illinois at Urbana-Champaign, 61801, USA
Laxmikant V. Kale , Dept. of Computer Science, University of Illinois at Urbana-Champaign, 61801, USA
pp. 1-12

Coordinated energy management in heterogeneous processors (Abstract)

Indrani Paul , Advanced Micro Devices, Inc., USA
Vignesh Ravi , Advanced Micro Devices, Inc., USA
Srilatha Manne , Advanced Micro Devices, Inc., USA
Manish Arora , Advanced Micro Devices, Inc., USA
Sudhakar Yalamanchili , Georgia Institute of Technology
pp. 1-12

Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems (Abstract)

Xu Yang , Illinois Institute of Technology, Chicago, USA
Zhou Zhou , Illinois Institute of Technology, Chicago, USA
Sean Wallace , Illinois Institute of Technology, Chicago, USA
Zhiling Lan , Illinois Institute of Technology, Chicago, USA
Wei Tang , Argonne National Laboratory, IL, USA
Susan Coghlan , Argonne National Laboratory, IL, USA
Michael E. Papka , Argonne National Laboratory, IL, USA
pp. 1-11

Petascale direct numerical simulation of turbulent channel flow on up to 786K cores (Abstract)

Myoungkyu Lee , Department of Mechanical Engineering, University of Texas at Austin, 78735, USA
Nicholas Malaya , Institute of Computational Engineering and Sciences, University of Texas at Austin, 78735, USA
Robert D. Moser , Institute of Computational Engineering and Sciences and Department of Mechanical Engineering, University of Texas at Austin, 78735, USA
pp. 1-11

Solving the compressible Navier-Stokes equations on up to 1.97 million cores and 4.1 trillion grid points (Abstract)

Ivan Bermejo-Moreno , Center for Turbulence Research, Stanford University, CA 94305-3024, USA
Julien Bodart , Center for Turbulence Research, Stanford University, CA 94305-3024, USA
Johan Larsson , Department of Mechanical Engineering, University of Maryland, College Park, 20742, USA
Blaise M. Barney , Lawrence Livermore National Laboratory, CA 94550, USA
Joseph W. Nichols , Center for Turbulence Research, Stanford University, CA 94305-3024, USA
Steve Jones , HPC Center, Stanford University, CA 94305-3024, USA
pp. 1-10

Petascale WRF simulation of hurricane sandy: Deployment of NCSA's cray XE6 blue waters (Abstract)

Peter Johnsen , Performance Engineering Group, Cray, Inc., St. Paul, MN. USA
Mark Straka , NCSA, University of Illinois at Urbana-Champaign, USA
Melvyn Shapiro , National Center for Atmospheric Research, Boulder, CO. USA
Alan Norton , National Center for Atmospheric Research, Boulder, CO. USA
Thomas Galarneau , National Center for Atmospheric Research, Boulder, CO. USA
pp. 1-7

Optimization of cloud task processing with checkpoint-restart mechanism (Abstract)

Sheng Di , Argonne National Laboratory, USA
Yves Robert , ENS Lyon and INRIA, France
Frederic Vivien , ENS Lyon and INRIA, France
Derrick Kondo , INRIA, Grenoble, France
Cho-Li Wang , The University of Hong Kong, Hong Kong
Franck Cappello , Argonne National Laboratory, USA
pp. 1-12

Scalable virtual machine deployment using VM image caches (Abstract)

Kaveh Razavi , Dept. of Computer Science, VU University Amsterdam, The Netherlands
Thilo Kielmann , Dept. of Computer Science, VU University Amsterdam, The Netherlands
pp. 1-12

Guide-copy: Fast and silent migration of virtual machine for datacenters (Abstract)

Jihun Kim , Department of Computer Science and Engineering, POSTECH, Korea
Dongju Chae , Department of Computer Science and Engineering, POSTECH, Korea
Jangwoo Kim , Department of Computer Science and Engineering, POSTECH, Korea
Jong Kim , Department of Computer Science and Engineering, POSTECH, Korea
pp. 1-12

Characterization and modeling of PIDX parallel I/O for performance optimization (Abstract)

Sidharth Kumar , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, USA
Avishek Saha , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, USA
Venkatram Vishwanath , Argonne National Laboratory, IL, USA
Philip Carns , Argonne National Laboratory, IL, USA
John A. Schmidt , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, USA
Giorgio Scorzelli , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, USA
Hemanth Kolla , Sandia National Laboratories, Livermore, CA, USA
Ray Grout , National Renewable Energy Laboratory, Golden, CO, USA
Robert Latham , Argonne National Laboratory, IL, USA
Robert Ross , Argonne National Laboratory, IL, USA
Michael E. Papka , Argonne National Laboratory, IL, USA
Jacqueline Chen , Sandia National Laboratories, Livermore, CA, USA
Valerio Pascucci , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, USA
pp. 1-12

Taming parallel I/O complexity with auto-tuning (Abstract)

Babak Behzad , Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Huong Vu Thanh Luu , Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Joseph Huchette , Rice Univ., Houston, TX, USA
Surendra Byna , Lawrence Berkeley Nat. Lab., Berkeley, CA, USA
Prabhat , Lawrence Berkeley Nat. Lab., Berkeley, CA, USA
Ruth Aydt , HDF Group, USA
Quincey Koziol , HDF Group, USA
Marc Snir , Argonne Nat. Lab., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
pp. 1-12

Toward millions of file system IOPS on low-cost, commodity hardware (Abstract)

Da Zheng , Department of Computer Science, Johns Hopkins University, USA
Randal Burns , Department of Computer Science, Johns Hopkins University, USA
Alexander S. Szalay , Department of Physics and Astronomy, Johns Hopkins University, USA
pp. 1-12

Physics-based seismic hazard analysis on petascale heterogeneous supercomputers (Abstract)

Y. Cui , University of California, San Diego, USA
E. Poyraz , University of California, San Diego, USA
K. B. Olsen , San Diego State University, USA
J. Zhou , University of California, San Diego, USA
K. Withers , San Diego State University, USA
S. Callaghan , University of Southern California, USA
J. Larkin , NVIDIA Inc., USA
C. Guest , University of California, San Diego, USA
D. Choi , University of California, San Diego, USA
A. Chourasia , University of California, San Diego, USA
Z. Shi , San Diego State University, USA
S. M. Day , San Diego State University, USA
P. J. Maechling , University of Southern California, USA
T. H. Jordan , University of Southern California, USA
pp. 1-12

A scalable parallel algorithm for dynamic range-limited n-tuple computation in many-body molecular dynamics simulation (Abstract)

Manaschai Kunaseth , Collaboratory for Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Material Science, University of Southern California, Los Angeles, 90089-0242, USA
Rajiv K. Kalia , Collaboratory for Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Material Science, University of Southern California, Los Angeles, 90089-0242, USA
Aiichiro Nakano , Collaboratory for Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Material Science, University of Southern California, Los Angeles, 90089-0242, USA
Ken-ichi Nomura , Center for High-Performance Computing and Communications, University of Southern California, Los Angeles, 90089-0706, USA
Priya Vashishta , Collaboratory for Advanced Computing and Simulations, Department of Computer Science, Department of Physics & Astronomy, Department of Material Science, University of Southern California, Los Angeles, 90089-0242, USA
pp. 1-12

2HOT: An improved parallel hashed oct-tree N-Body algorithm for cosmological simulation (PDF)

Michael S. Warren , Theoretical Division, Los Alamos National Laboratory, USA
pp. 1-12

SIDR: Structure-aware intelligent data routing in hadoop (Abstract)

Joe Buck , Department of Computer Science, University of California - Santa Cruz, USA
Noah Watkins , Department of Computer Science, University of California - Santa Cruz, USA
Greg Levin , Department of Computer Science, University of California - Santa Cruz, USA
Adam Crume , Department of Computer Science, University of California - Santa Cruz, USA
Kleoni Ioannidou , Department of Computer Science, University of California - Santa Cruz, USA
Scott Brandt , Department of Computer Science, University of California - Santa Cruz, USA
Carlos Maltzahn , Department of Computer Science, University of California - Santa Cruz, USA
Neoklis Polyzotis , Department of Computer Science, University of California - Santa Cruz, USA
Aaron Torres , Los Alamos National Laboratory, USA
pp. 1-12

Using cross-layer adaptations for dynamic data management in large scale coupled scientific workflows (Abstract)

Tong Jin , NSF Cloud and Autonomic Computing Cente, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, NJ 08854, USA
Fan Zhang , NSF Cloud and Autonomic Computing Cente, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, NJ 08854, USA
Qian Sun , NSF Cloud and Autonomic Computing Cente, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, NJ 08854, USA
Hoang Bui , NSF Cloud and Autonomic Computing Cente, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, NJ 08854, USA
Manish Parashar , NSF Cloud and Autonomic Computing Cente, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, NJ 08854, USA
Hongfeng Yu , Computer Science and Engineering, University of Nebraska-Lincoln, 68588, USA
Scott Klasky , Oak Ridge National Labortory, P.O. Box 2008, TN, 37831, USA
Norbert Podhorszki , Oak Ridge National Labortory, P.O. Box 2008, TN, 37831, USA
Hasan Abbasi , Oak Ridge National Labortory, P.O. Box 2008, TN, 37831, USA
pp. 1-12

Exploring the future of out-of-core computing with compute-local non-volatile memory (Abstract)

Myoungsoo Jung , Department of Electrical Engineering, The University of Texas at Dallas, USA
Ellis H. Wilson , Department of Computer Science and Engineering, The Pennsylvania State University, USA
Wonil Choi , Department of Electrical Engineering, The University of Texas at Dallas, USA
John Shalf , Computational Research Division, Lawrence Berkeley National Laboratory, USA
Hasan Metin Aktulga , Computational Research Division, Lawrence Berkeley National Laboratory, USA
Chao Yang , Computational Research Division, Lawrence Berkeley National Laboratory, USA
Erik Saule , Biomedical Informatics, The Ohio State University, USA
Umit V. Catalyurek , Biomedical Informatics, The Ohio State University, USA
Mahmut Kandemir , Department of Computer Science and Engineering, The Pennsylvania State University, USA
pp. 1-11

Assessing the effects of data compression in simulations using physically motivated metrics (Abstract)

Daniel Laney , Lawrence Livermore Lab, USA
Steven Langer , Lawrence Livermore Lab, USA
Christopher Weber , Lawrence Livermore Lab, USA
Peter Lindstrom , Lawrence Livermore Lab, USA
Al Wegener , Samplify, USA
pp. 1-12

Exploring power behaviors and trade-offs of in-situ data analytics (Abstract)

Marc Gamell , Rutgers University, USA
Ivan Rodero , Rutgers University, USA
Manish Parashar , Rutgers University, USA
Janine C. Bennett , Sandia National Laboratories, USA
Hemanth Kolla , Sandia National Laboratories, USA
Jacqueline Chen , Sandia National Laboratories, USA
Peer-Timo Bremer , Lawrence Livermore National, Laboratory & University of Utah, USA
Aaditya G. Landge , University of Utah, USA
Attila Gyulassy , University of Utah, USA
Patrick McCormick , Los Alamos National Laboratory, USA
Scott Pakin , Los Alamos National Laboratory, USA
Valerio Pascucci , University of Utah & Pacific Northwest National Laboratory, USA
Scott Klasky , Oak Ridge National Laboratory, USA
pp. 1-12

GoldRush: Resource efficient in situ scientific data analytics using fine-grained interference aware execution (Abstract)

Fang Zheng , Georgia Institute of Technology, USA
Hongfeng Yu , University of Nebraska Lincoln, USA
Can Hantas , Georgia Institute of Technology, USA
Matthew Wolf , Georgia Institute of Technology, USA
Greg Eisenhauer , Georgia Institute of Technology, USA
Karsten Schwan , Georgia Institute of Technology, USA
Hasan Abbasi , Oak Ridge National Laboratory, USA
Scott Klasky , Oak Ridge National Laboratory, USA
pp. 1-12

A scalable, efficient scheme for evaluation of stencil computations over unstructured meshes (Abstract)

James King , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, USA
Robert M. Kirby , School of Computing and Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, USA
pp. 1-12

Scalable domain decomposition preconditioners for heterogeneous elliptic problems (Abstract)

Pierre Jolivet , Laboratoire J. Kuntzmannn, Université J. Fourier, Grenoble Cedex 9, France
Frederic Hecht , Laboratoire J.-L. Lions, Université P. et M. Curie, Paris, France
Frederic Nataf , INRIA, ALPINES research team, Rocquencourt, France
Christophe Prud'homme , IRMA, Université de Strasbourg, Cedex, France
pp. 1-11

Parallel design and performance of nested filtering factorization preconditioner (Abstract)

Long Qu , Université Paris Sud 11, Laboratoire de Recherche en Informatique, Orsay, France
Laura Grigori , INRIA Paris-Rocquencourt, Alpines and UPMC - Univ Paris 6, CNRS UMR 7598, Laboratoire Jacques-Louis Lions, France
Frederic Nataf , UPMC - Univ Paris 6, CNRS, UMR 7598, Laboratoire Jacques-Louis Lions and INRIA, Paris-Rocquencourt, Alpines, France
pp. 1-12

Kinetic turbulence simulations at extreme scale on leadership-class systems (Abstract)

Bei Wang , Princeton Institute of Computational Science and Engineering, Princeton University, NJ, USA
Stephane Ethier , Princeton Plasma Physics Laboratory, NJ, USA
William Tang , Princeton Institute of Computational Science and Engineering, Princeton University, NJ, USA
Timothy Williams , Argonne Leadership Computing Facility, Argonne National Laboratory, IL, USA
Khaled Z. Ibrahim , Computational Research Division, Lawrence Berkeley National Laboratory, CA, USA
Kamesh Madduri , Computer Science and Engineering, The Pennsylvania State University, University Park, USA
Samuel Williams , Computational Research Division, Lawrence Berkeley National Laboratory, CA, USA
Leonid Oliker , Computational Research Division, Lawrence Berkeley National Laboratory, CA, USA
pp. 1-12

Swendsen-Wang multi-cluster algorithm for the 2D/3D Ising model on Xeon Phi and GPU (Abstract)

Florian Wende , Zuse Institute Berlin, Takustrasse 7, D-14195 Dahlem, Germany
Thomas Steinke , Zuse Institute Berlin, Takustrasse 7, D-14195 Dahlem, Germany
pp. 1-12

Mr. Scan: Extreme scale density-based clustering using a tree-based network of GPGPU nodes (Abstract)

Benjamin Welton , Computer Sciences Department, University of Wisconsin, Madison, 53706, USA
Evan Samanas , Computer Sciences Department, University of Wisconsin, Madison, 53706, USA
Barton P. Miller , Computer Sciences Department, University of Wisconsin, Madison, 53706, USA
pp. 1-11

The Science DMZ: A network design pattern for data-intensive science (Abstract)

Eli Dart , Energy Sciences Network, Lawrence Berkeley National Laboratory, CA 94720, USA
Lauren Rotman , Energy Sciences Network, Lawrence Berkeley National Laboratory, CA 94720, USA
Brian Tierney , Energy Sciences Network, Lawrence Berkeley National Laboratory, CA 94720, USA
Mary Hester , Energy Sciences Network, Lawrence Berkeley National Laboratory, CA 94720, USA
Jason Zurawski , Internet, Office of the CTO, Washington DC, 20036, USA
pp. 1-10

Enabling comprehensive data-driven system management for large computational facilities (Abstract)

James C. Browne , Center for Computational Research, SUNY at Buffalo, NY, USA
Robert L. DeLeon , Center for Computational Research, SUNY at Buffalo, NY, USA
Charng-Da Lu , Center for Computational Research, SUNY at Buffalo, NY, USA
Matthew D. Jones , Center for Computational Research, SUNY at Buffalo, NY, USA
Steven M. Gallo , Center for Computational Research, SUNY at Buffalo, NY, USA
Amin Ghadersohi , Center for Computational Research, SUNY at Buffalo, NY, USA
Abani K. Patra , Center for Computational Research, SUNY at Buffalo, NY, USA
William L. Barth , Texas Advanced Computing Center, University of Texas, Austin, USA
John Hammond , Texas Advanced Computing Center, University of Texas, Austin, USA
Thomas R. Furlani , Center for Computational Research, SUNY at Buffalo, NY, USA
Robert T. McLay , Texas Advanced Computing Center, University of Texas, Austin, USA
pp. 1-11

Insights for exascale IO APIs from building a petascale IO API (Abstract)

Jay Lofstead , Sandia National Laboratories, USA
Robert Ross , Argonne National Laboratory, USA
pp. 1-12

Parallel reduction to Hessenberg form with Algorithm-Based Fault Tolerance (Abstract)

Yulu Jia , University of Tennessee, Knoxville, USA
George Bosilca , University of Tennessee, Knoxville, USA
Piotr Luszczek , University of Tennessee, Knoxville, USA
Jack J. Dongarra , University of Tennessee, Knoxville, Oak Ridge National Laboratory and University of Manchester, USA
pp. 1-11

A computationally efficient algorithm for the 2D covariance method (Abstract)

Oded Green , Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Yitzhak Birk , Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
pp. 1-12

An improved parallel singular value algorithm and its implementation for multicore hardware (Abstract)

Azzam Haidar , Electrical Engineering and Computer Science, University of Tennessee, Knoxville, USA
Jakub Kurzak , Electrical Engineering and Computer Science, University of Tennessee, Knoxville, USA
Piotr Luszczek , Electrical Engineering and Computer Science, University of Tennessee, Knoxville, USA
pp. 1-12

Distributed-memory parallel algorithms for generating massive scale-free networks using preferential attachment model (Abstract)

Maksudul Alam , Department of Computer Science, Virginia Tech, Blacksburg, 24061, USA
Maleq Khan , NDSSL, Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, 24061, USA
Madhav V. Marathe , Department of Computer Science, Virginia Tech, Blacksburg, 24061, USA
pp. 1-12

On fast parallel detection of strongly connected components (SCC) in small-world graphs (Abstract)

Sungpack Hong , Oracle Labs, Redwood Shores, CA, USA
Nicole C. Rodia , Pervasive Parallelism Laboratory, Stanford University, CA, USA
Kunle Olukotun , Pervasive Parallelism Laboratory, Stanford University, CA, USA
pp. 1-11

Algorithms for high-throughput disk-to-disk sorting (Abstract)

Hari Sundar , The University of Texas at Austin, 78712, USA
Dhairya Malhotra , The University of Texas at, Austin, 78712, USA
Karl W. Schulz , Texas Advanced Computing Center, Austin, 78712, USA
pp. 1-10

An early performance evaluation of many integrated core architecture based sgi rackable computing system (Abstract)

Subhash Saini , NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Haoqiang Jin , NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Dennis Jespersen , NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Huiyu Feng , SGI, Fremont, CA 94538, USA
Jahed Djomehri , Computer Sciences Corporation, Moffett Field, CA 94035-1000, USA
William Arasin , Computer Sciences Corporation, Moffett Field, CA 94035-1000, USA
Robert Hood , Computer Sciences Corporation, Moffett Field, CA 94035-1000, USA
Piyush Mehrotra , NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
Rupak Biswas , NASA Ames Research Center, Moffett Field, CA 94035-1000, USA
pp. 1-12

Predicting application performance using supervised learning on communication features (Abstract)

Nikhil Jain , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
Abhinav Bhatele , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, CA, USA
Michael P. Robson , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
Todd Gamblin , Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, CA, USA
Laxmikant V. Kale , Department of Computer Science, University of Illinois at Urbana-Champaign, USA
pp. 1-12

Investigating applications portability with the uintah DAG-based runtime system on petascale supercomputers (Abstract)

Qingyu Meng , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, 84112 USA
Alan Humphrey , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, 84112 USA
John Schmidt , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, 84112 USA
Martin Berzins , Scientific Computing and Imaging Institute, University of Utah, Salt Lake City, 84112 USA
pp. 1-12
80 ms
(Ver 3.3 (11022016))