Abstract is missing.
- Speeding up the Memory Hierarchy in Flat COMA MultiprocessorsLiuxi Yang, Josep Torrellas. 4-13 [doi]
- Reducing the Replacement Overhead in Bus-Based COMA MultiprocessorsFredrik Dahlgren, Anders Landin. 14-23 [doi]
- Datapath Design for a VLIW Video Signal ProcessorAndrew Wolfe, Jason Fritts, Santanu Dutta, Edil S. Tavares Fernandes. 24 [doi]
- Distributed Path Reservation Algorithms for Multiplexed All-Optical Interconnection NetworksXin Yuan, Rami G. Melhem, Rajiv Gupta. 38-47 [doi]
- Multicast on Irregular Switch-Based Networks with Wormhole RoutingRam Kesavan, Kiran Bondalapati, Dhabaleswar K. Panda. 48-57 [doi]
- A Performance Comparison of Hierarchical Ring- and Mesh-Connected Multiprocessor NetworksGovindan Ravindran, Michael Stumm. 58 [doi]
- The Impact of Instruction-Level Parallelism on Multiprocessor Performance and Simulation MethodologyVijay S. Pai, Parthasarathy Ranganathan, Sarita V. Adve. 72-83 [doi]
- Architectural Support for Compiler-Synthesized Dynamic Branch Prediction Strategies: Rationale and Initial ResultsDavid I. August, Daniel A. Connors, John C. Gyllenhaal, Wen-mei W. Hwu. 84-93 [doi]
- Multiple Branch and Block PredictionSteven Wallace, Nader Bagherzadeh. 94 [doi]
- Evaluating MPI Collective Communication on the SP2, T3D, and Paragon MulticomputersKai Hwang, Choming Wang, Cho-Li Wang. 106-115 [doi]
- Message Proxies for Efficient, Protected Communication on SMP ClustersBeng-Hong Lim, Philip Heidelberger, Pratap Pattnaik, Marc Snir. 116-127 [doi]
- Scheduling Communication on a SMP Node Parallel MachineBabak Falsafi, David A. Wood. 128 [doi]
- Design Issues and Tradeoffs for Write BuffersKevin Skadron, Douglas W. Clark. 144-155 [doi]
- Software-Managed Address TranslationBruce L. Jacob, Trevor N. Mudge. 156-167 [doi]
- Global Address Space, Non-Uniform Bandwidth: A Memory System Performance Characterization of Parallel SystemsThomas Stricker, Thomas R. Gross. 168 [doi]
- On the Use and Performance of Explicit Communication Primitives in Cache-Coherent Multiprocessor SystemsXiaohan Qin, Jean-Loup Baer. 182-193 [doi]
- Reducing the Communication Overhead of Dynamic Applications on Shared Memory MultiprocessorsAnand Sivasubramaniam. 194-203 [doi]
- Control Flow Speculation in Multiscalar ProcessorsQuinn Jacobson, Steve Bennett, Nikhil Sharma, James E. Smith. 218-229 [doi]
- Advances of the Counterflow Pipeline MicroarchitectureKenneth J. Janik, Shih-Lien Lu, Michael F. Miller. 230-236 [doi]
- Multithreaded Vector ArchitecturesRoger Espasa, Mateo Valero. 237 [doi]
- The Memory Performance of DSS Commercial Workloads in Shared-Memory MultiprocessorsPedro Trancoso, Josep-Lluis Larriba-Pey, Zheng Zhang, Josep Torrellas. 250-260 [doi]
- Reducing Remote Conflict Misses: NUMA with Remote Cache versus COMAZheng Zhang, Josep Torrellas. 272 [doi]
- Performance Characterization of the Pentium(r) Pro ProcessorDileep Bhandarkar, Jianxun Jason Ding. 288-299 [doi]
- A Framework for Statistical Modeling of Superscalar Processor PerformanceDerek B. Noonburg, John Paul Shen. 298-309 [doi]
- Towards a Communication Characterization Methodology for Parallel ApplicationsSucheta Chodnekar, Viji Srinivasan, Aniruddha S. Vaidya, Anand Sivasubramaniam, Chita R. Das. 310 [doi]
- User-Level DMA without Operating System Kernel ModificationEvangelos P. Markatos, Manolis Katevenis. 322-331 [doi]
- ATM and Fast Ethernet Network Interfaces for User-Level CommunicationMatt Welsh, Anindya Basu, Thorsten von Eicken. 332-342 [doi]
- Architectural Support for Reducing Communication Overhead in Multiprocessor Interconnection NetworksBinh Vien Dao, Sudhakar Yalamanchili, José Duato. 343-352 [doi]