The Omega test: A fast and practical integer programming algorithm for dependence analysis (Abstract)

Pointer target tracking an empirical study (PDF)

On-the-fly detection of data races for programs with nested fork-join parallelism (PDF)

Programming costs of explicit memory localization on a large scale shared memory multiprocessor (PDF)

A conflict-free memory design for multiprocessors (PDF)

A ultra fast Euclidean division algorithm for prime memory systems (PDF)

A high school supercomputing challenge (PDF)

Chaotic cardiac arrhythmias (PDF)

Compiler optimizations for Fortran D on MIMD distributed-memory machines (PDF)

Compile-time generation of regular communications patterns (PDF)

Tiling multidimensional iteration spaces for nonshared memory machines (PDF)

A new approach for automatic parallelization of blocked linear Algebra computations (PDF)

Wide format floating-point math libraries (PDF)

Distributing the comparison of DNA and protein sequences across heterogeneous supercomputers (PDF)

Panel: parallel computing in the undergraduate computer science curriculum (PDF)

A performance comparison of three supercomputers: Fujitsu VP-2600, NEC SX-3, and CRAY Y-MP (PDF)

The NAS parallel benchmarks summary and preliminary results (PDF)

Performance results for two of the NAS parallel benchmarks (PDF)

An effective on-chip preloading scheme to reduce data access penalty (PDF)

Using Lookahead to reduce memory bank contention for decoupled operand references (PDF)

Delayed consistency and its effects on the miss rate of parallel programs (PDF)

Architecture-independent scientific programming in data parallel C: three case studies (PDF)

Solution functions of PDEQSOL (Partial differential EQuation SOlver language) for fluid problems (PDF)

Computing turbulent flow in complex geometries on a massively parallel processor (PDF)

A lattice Boltzmann method for a two-dimensional viscous Burgers equation: computational results (PDF)

Distribution of a climate model across high-speed networks (PDF)

Retire Fortran? A debate rekindled (PDF)

Object oriented parallel programming: experiments and results (PDF)

High level support for divide-and-conquer parallelism (PDF)

Vector/parallel implementation of a porous media flow code (PDF)

High performance vector processing in reservoir simulation (PDF)

Seismic modeling at 14 gigaflops on the connection machine (PDF)

Gordon Bell prize lectures (PDF)

Compiler parallelization of an elliptic grid generator for 1990 Gordon Bell prize (PDF)

A new parallel architecture for sparse matrix computation based on finite projective geometries (PDF)

Time multiplexed optical computers (PDF)

Universal multistage networks via linear permutations (PDF)

Design and analysis of efficient hierarchical interconnection networks (PDF)

Alleviation of tree saturation in multistage interconnection networks (PDF)

An evaluation of automatic and interactive parallel programming tools (PDF)

Interprocedural transformations for parallel code generation (PDF)

Graphical development tools for network-based concurrent supercomputing (PDF)

Synthetic aperture radar image processing on parallel supercomputers (PDF)

The auditorialization of scientific information (PDF)

Parallel approaches to short range molecular dynamics simulations (PDF)

Visualizing the behavior of massively parallel programs (PDF)

Performance debugging shared memory multiprocessor programs with MTOOL (PDF)

Graphical animation of parallel Fortran programs (PDF)

Scheduling parallel programs with non-uniform parallelism profiles (PDF)

Intelligent mapping of communicating processes in distributed computing systems (PDF)

Load balancing by function distribution on the EM-4 prototype (PDF)

Gigascale integration (GSI) technology (PDF)

Exploration geophysics, parallel computing and reality (PDF)

Large scale reservoir simulation in the concurrent processing milieu (PDF)

Vectorizing C compilers: how good are they? (PDF)

Characterizing memory hot spots in a shared memory MIMD machine (PDF)

Input/output behavior of supercomputing applications (PDF)

PILS: an iterative linear solver package for ill-conditioned systems (PDF)

Threshold pivoting for dense LU factorization on distributed memory multiprocessors (PDF)

Factoring: a practical and robust method for scheduling parallel loops (PDF)

A fast static scheduling algorithm for DAGs on an unbounded number of processors (PDF)

Time-division optical communications in multiprocessor arrays (PDF)

Fully-adaptive routing: packet switching performance and wormhole algorithms (PDF)

Network-based multicomputers: an emerging parallel architecture (PDF)

Computing climate change: can we beat nature? (PDF)

Climate modeling with parallel vector supercomputers (PDF)

Computing modeling in a MIMD environment (PDF)

Ocean modeling on the connection machine (PDF)

An integrated memory management scheme for dynamic alias resolution (PDF)

MOVE: a framework for high-performance processor design (PDF)

A semantics-directed partitioning of a processor architecture (PDF)

Radix sort for vector multiprocessors (PDF)

A method of vector processing for shared symbolic data (PDF)

Optimal bounded-degree VLSI networks for sorting in a constant number of rounds (PDF)

“Whither massive parallelism?” (PDF)

An efficient parallel algorithm for all pairs examination (PDF)

Parallel power-of-two FFTs on hypercubes (PDF)

Analysis of replicated data algorithms on processor array architectures (PDF)

Design of a highly reliable cube-connected cycles architecture (PDF)

Three-dimensional finite-element analyses: implications for computer architectures (PDF)

Massively parallel computing and the mid-course tracking problem (PDF)

Measurement of memory access contentions in multiple vector processor systems (PDF)

Comparison and analysis of software and directory coherence schemes (PDF)

Performance prediction of distributed load balancing on multicomputer systems (PDF)

Efficient Doacross execution on distributed shared-memory multiprocessors (PDF)

Detecting redundant accesses to array data (PDF)

Effects of partitioning and scheduling sparse matrix factorization on communication and load balance (PDF)

Mass storage requirements in the intelligence community (PDF)

A virtual memory translation mechanism to support checkpoint and rollback recovery (PDF)

The K2 distributed memory parallel processor: architecture, compiler, and operating system (PDF)