Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, PACT'96, Boston, MA, USA, October 20-23, 1996 - researchr publication

researchr

You are not signed in
Sign in
Sign up

Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, PACT'96, Boston, MA, USA, October 20-23, 1996. IEEE Computer Society, 1996. [doi]

Conference: IEEEpact1996

Abstract is missing.

Nomadic Threads: a migrating multithreaded approach to remote memory accesses in multiprocessorsStephen Jenks, Jean-Luc Gaudiot. 2-11 [doi]

Compiling C for the EARTH multithreaded architectureLaurie J. Hendren, Xinan Tang, Yingchun Zhu, Guang R. Gao, Xun Xue, Haiying Cai, Pierre Ouellet. 12-23 [doi]

Performance and hardware complexity tradeoffs in designing multithreaded architecturesMichael Bekerman, Avi Mendelson, Gad Sheaffer. 24-34 [doi]

The superthreaded architecture: thread pipelining with run-time data dependence checking and control speculationJenn-Yuan Tsai, Pen-Chung Yew. 35-46 [doi]

Improving branch prediction accuracy by reducing pattern history table interferencePo-Yung Chang, Marius Evers, Yale N. Patt. 48-57 [doi]

The effects of mispredicted-path execution on branch prediction structuresStéphan Jourdan, Tse-Hao Hsing, Jared Stark, Yale N. Patt. 58-67 [doi]

Improving the effectiveness of software prefetching with adaptive executionsRafael H. Saavedra, Daeyeon Park. 68-78 [doi]

Swing module scheduling: a lifetime-sensitive approachJosep Llosa, Antonio González, Eduard Ayguadé, Mateo Valero. 80-86 [doi]

An efficient, global resource-directed approach to exploiting instruction-level parallelismSteven Novack, Alexandru Nicolau. 87-96 [doi]

The design of a modulo scheduler for a superscalar RISC processorP. Tinumalai, Boris Beylin, Krishna Subramanian. 97-109 [doi]

Using the parallel complexity of programs to improve compactionMarc Pouzet. 111-115 [doi]

Multithread execution mechanisms on RICA-1 for massively parallel computationKazuaki Okamoto, Shuichi Sakai, Hiroshi Matsuoka, Takashi Yokota, Hideo Hirono. 116-121 [doi]

I-Structure Software Cache: a split-phase transaction runtime cache systemWen-Yen Lin, Jean-Luc Gaudiot. 122-126 [doi]

A heuristic approach for finding a solution to the constant-degree parallelism alignment problemClaude G. Diderich, Marc Gengler. 127-132 [doi]

Identifying the capability of overlapping computation with communicationAndrew Sohn, Jui Ku, Yuetsu Kodama, Mitsuhisa Sato, Hirofumi Sakane, Hayato Yamana, Shuichi Sakai, Yoshinori Yamaguchi. 133-138 [doi]

Address generation of dataflow fine-grain parallel data-structures on a distributed-memory computerShigeru Kusakabe, Taku Nagai, Kentaro Inenaga, Makoto Amamiya. 139-143 [doi]

Elastic-plastic flow simulation using the Supercomputer ToolkitAlexander Goikhman, Jacob Katzenelson. 144-149 [doi]

Managing the computing space in the mpC compilerDmitry Arapov, Alexey Kalinov, Alexey L. Lastovetsky. 150-155 [doi]

A robust compile time method for scheduling task parallelism on distributed memory machinesSekhar Darbha, Santosh Pande. 156-162 [doi]

A fine-grain multithreading superscalar architectureM. Loikkanen, Nader Bagherzadeh. 163-168 [doi]

Branch prediction and simultaneous multithreadingSébastien Hily, André Seznec. 169-173 [doi]

A compiler transformation to improve memory access time in SIMD systemsMayez A. Al-Mouhamed, Lubomir F. Bic, Husam Abu-Haimed. 174-178 [doi]

A scalable register file architecture for dynamically scheduled processorsSteven Wallace, Nader Bagherzadeh. 179-184 [doi]

Dynamic parallelization of modifications to directed acyclic graphsLorenz Huelsbergen. 186-197 [doi]

Performance tuning scientific codes for dataflow executionAndrew Shaw, Arvind, R. Paul Johnson. 198-207 [doi]

Bulk Synchronous Parallel: practical experience with a model for parallel computingDanny Krizanc, Anton Saarimaki. 208-217 [doi]

Implementation techniques for a parallel relative debuggerDavid Abramson, Rok Sosic, Greg Watson. 218-226 [doi]

Loop induction variable canonicalization in parallelizing compilersShin-Ming Liu, Raymond Lo, Fred C. Chow. 228-237 [doi]

Combining optimization for cache and instruction-level parallelismSteve Carr. 238-247 [doi]

A compiler algorithm to reduce invalidation latency in virtual shared memory systemsMichael F. P. O'Boyle, Andy Nisbet, Rupert W. Ford. 248-257 [doi]

Adaptive granularity: transparent integration of fine- and coarse-grain communicationDaeyeon Park, Rafael H. Saavedra. 260-268 [doi]

Automatic partitioning of signal processing programs for symmetric multiprocessorsChris J. Newburn, John Paul Shen. 269-280 [doi]

Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphsAlain Darte, Frédéric Vivien. 281-291 [doi]

The compiler TwoL for the design of parallel implementationsThomas Rauber, Gudula Rünger. 292-301 [doi]

runs on WebDSL