Abstract is missing.
- Can Multithreaded Programming Save Massively Parallel Computing?Charles E. Leiserson. 2-3 [doi]
- Eliminating Stale Data References through Array Data-Flow AnalysisLynn Choi, Pen-Chung Yew. 4-13 [doi]
- Commutativity Analysis: A Technique for Automatically Parallelizing Pointer-Based ComputationsMartin C. Rinard, Pedro C. Diniz. 14-22 [doi]
- Profiling Dependence Vectors for Loop ParallelizationShaw-Yen Tseng, Chung-Ta King, Chuan Yi Tang. 23-27 [doi]
- A Method for Register Allocation to Loops in Multiple Register File ArchitecturesDavid J. Kolson, Alexandru Nicolau, Nikil D. Dutt, Ken Kennedy. 28-33 [doi]
- Affine-by-Statement Transformations of Imperfectly Nested LoopsJingling Xue. 34-38 [doi]
- The Combined Effectiveness of Unimodular Transformations, Tiling, and Software PrefetchingRafael H. Saavedra-Barrera, Weihua Mao, Daeyeon Park, Jacqueline Chame, Sungdo Moon. 39-45 [doi]
- Ocean Circulation on the Intel Paragon: Modeling and ImplementationKa-Cheong Leung, Ishfaq Ahmad, Hsiao-Ming Hsu. 47-54 [doi]
- Dynamic Alignment and Distribution of Irregularly Coupled Data Arrays for Scalable Parallelization of Particle-in-Cell ProblemsWei-keng Liao, Chao-Wei Ou, Sanjay Ranka. 57-61 [doi]
- A Hierarchical Parallel Processing System for the Multipass-Rendering MethodHiroaki Kobayashi, Hitoshi Yamauchi, Yuichiro Toh, Tadao Nakamura. 62-67 [doi]
- Performance Modeling and Composition: A Case Study in Cell SimulationSteve G. Steinberg, Jun Yang, Katherine A. Yelick. 68-74 [doi]
- A Study of High-Performance Communication Mechanism for Multicomputer SystemsHideki Murayama, Satoshi Yoshizawa, Takeshi Aimoto, Hidenori Inouchi, Shooichi Murase, Takehisa Hayashi, Hiroshi Iwamoto. 76-83 [doi]
- A TeraFLOP Supercomputer in 1996: The ASCI TFLOP SystemTimothy G. Mattson, David Scott, Stephen R. Wheat. 84-93 [doi]
- Experience with Parallel Computing on the AN2 NetworkDaniel J. Scales, Michael Burrows, Chandramohan A. Thekkath. 94-103 [doi]
- Achieving a Balanced Low-Cost Architecture for Mass Storage Management through Multiple Fast Ethernet Channels on the Beowulf Parallel WorkstationThomas L. Sterling, Donald J. Becker, Chance Reschke, Daniel Savarese, Michael R. Berry. 104-108 [doi]
- Exploiting the Capabilities of Communications Co-ProcessorsKlaus E. Schauser, Chris J. Scheiman, J. Mitchell Ferguson, Paul Z. Kolano. 109-115 [doi]
- Effects of Multithreading on Data and Workload Distribution for Distributed-Memory MultiprocessorsAndrew Sohn, Mitsuhisa Sato, Namhoon Yoo, Jean-Luc Gaudiot. 116-122 [doi]
- Formal Verification of Delayed Consistency ProtocolsFong Pong, Michel Dubois. 124-131 [doi]
- Dag-Consistent Distributed Shared MemoryRobert D. Blumofe, Matteo Frigo, Christopher F. Joerg, Charles E. Leiserson, Keith H. Randall. 132-141 [doi]
- Categorizing Network Traffic in Update-Based Protocols on Scalable MultiprocessorsRicardo Bianchini, Thomas J. LeBlanc, Jack E. Veenstra. 142-151 [doi]
- Implementing the Data Diffusion Machine Using Crossbar RoutersHenk L. Muller, Paul W. A. Stallard, David H. D. Warren. 152-158 [doi]
- A Memory Controller for Improved Performance of Streamed Computations on Symmetric MultiprocessorsSally A. McKee, William A. Wulf. 159-165 [doi]
- Kiloprocessor Extensions to SCIStefanos Kaxiras. 166-172 [doi]
- Approximate Compaction and Padded-Sorting on Exclusive Write PRAMsMiroslaw Kutylowski, Tomasz Wierzbicki. 174-181 [doi]
- A Parallel Solution to the Extended Set Union Problem with Unlimited BacktrackingMaria Cristina Pinotti, Vincenzo A. Crupi, Sajal K. Das. 182-186 [doi]
- A Parallel Algorithm for Minimization of Finite AutomataBala Ravikumar, X. Xiong. 187-191 [doi]
- A Randomized Algorithm for Voronoi Diagram of Line Segments on Coarse-Grained MultiprocessorsXiaotie Deng, Binhai Zhu. 192-198 [doi]
- Self-Timed Resynchronization: A Post-Optimization for Static Multiprocessor SchedulesShuvra S. Bhattacharyya, Sundararajan Sriram, Edward A. Lee. 199-205 [doi]
- Constructing the Spanners of Graphs in ParallelWeifa Liang, Richard P. Brent. 206-210 [doi]
- Converse: An Interoperable Framework for Parallel ProgrammingLaxmikant V. Kalé, Milind A. Bhandarkar, Narain Jagathesan, Sanjeev Krishnan, Josh Yelon. 212-217 [doi]
- Dome: Parallel Programming in a Distributed Computing EnvironmentJose Nagib Cotrim Árabe, Adam Beguelin, Bruce Lowekamp, Erik Seligman, Mike Starkey, Peter Stephan. 218-224 [doi]
- Nested Parallel Call OptimizationEnrico Pontelli, Gopal Gupta. 225-229 [doi]
- The Parallel Break Construct, or How to Kill an Activity TreeYair I. Friedman, Dror G. Feitelson, Iaakov Exman. 230-234 [doi]
- Optimizing COOP Languages: Study of a Protein Dynamics ProgramXingbin Zhang, Vijay Karamcheti, Tony Ng, Andrew A. Chien. 235-240 [doi]
- Support for Extensibility and Reusability in a Concurrent Object-Oriented Programming LanguageRaju Pandey, James C. Browne. 241-247 [doi]
- Modeling the Communication Performance of the IBM SP2Gheith A. Abandah, Edward S. Davidson. 249-257 [doi]
- Adaptive Source Routing in Multistage Interconnection NetworksYucel Aydogan, Craig B. Stunkel, Cevdet Aykanat, Bülent Abali. 258-267 [doi]
- The Effects of Network Contention on Processor Allocation StrategiesSherry Moore, Lionel M. Ni. 268-273 [doi]
- ServerNet Deadlock Avoidance and Fractahedral TopologiesRobert W. Horst. 274-280 [doi]
- Analysis of Memory Interference in Buffered Multiprocessor Systems in Presence of Hot Spots and Favorite MemoriesSajal K. Das, Sanjoy K. Sen. 281-285 [doi]
- Benefits of Processor Clustering in Designing Large Parallel Systems: When and How?Debashis Basak, Dhabaleswar K. Panda, Mohammad Banikazemi. 286-290 [doi]
- Practical Parallel Algorithms for Dynamic Data Redistribution, Median Finding, and SelectionDavid A. Bader, Joseph JáJá. 292-301 [doi]
- Parallel Implementation of Borvka s Minimum Spanning Tree AlgorithmSun Chung, Anne Condon. 302-308 [doi]
- Practical Algorithms for Selection on Coarse-Grained Parallel ComputersIbraheem Al-Furaih, Srinivas Aluru, Sanjay Goil, Sanjay Ranka. 309-313 [doi]
- Parallel Multilevel Graph PartitioningGeorge Karypis, Vipin Kumar. 314-319 [doi]
- PACK/UNPACK on Coarse-Grained Distributed Memory Parallel MachinesSeungjo Bae, Sanjay Ranka. 320-324 [doi]
- Resource Placement in Torus-Based NetworksMyung M. Bae, Bella Bose. 327-331 [doi]
- Simultaneous Compression of Makespan and Number of Processors Using CRPYiqun Ge, David Y. Y. Yun. 332-338 [doi]
- Implementation of Scalable Blocking Locks Using an Adaptive Thread SchedulerBodhisattwa Mukherjee, Karsten Schwan. 339-343 [doi]
- Hector: Automated Task Allocation for MPISamuel H. Russ, Brian K. Flachs, Jonathan Robinson, Bjørn Heckel. 344-348 [doi]
- An Adaptive Approach to Data PlacementDavid K. Lowenthal, Gregory R. Andrews. 349-353 [doi]
- Complete Parallelization of Computations: Integration of Data Partitioning and Functional Parallelism for Dynamic Data StructuresDwip Banerjee, James C. Browne. 354-360 [doi]
- MPPs versus ClustersCharles L. Seitz. 362 [doi]
- Generating Realignment-Based Communication for HPF ProgramsTsunehiko Kamachi, Kazuhiro Kusano, Kenji Suehiro, Yoshiki Seo. 364-371 [doi]
- Software Support for Virtual Memory-Mapped CommunicationCezary Dubnicki, Liviu Iftode, Edward W. Felten, Kai Li. 372-281 [doi]
- How to Optimize Residual Communications?Michèle Dion, Cyril Randriamaro, Yves Robert. 382-391 [doi]
- A Comparative Study of Methods for Time-Deterministic Message Delivery in a Multiprocessor ArchitectureJan Jonsson, Jonas Vasell. 392-398 [doi]
- ECO: Efficient Collective Operations for Communication on Heterogeneous NetworksBruce Lowekamp, Adam Beguelin. 399-405 [doi]
- Software Techniques for Improving MPP Bulk-Transfer PerformanceEric A. Brewer, Paul Gauthier, Armando Fox, Angela Schuett. 406-412 [doi]
- Parallel Algorithms for Image Enhancement and Segmentation by Region Growing with an Experimental StudyDavid A. Bader, Joseph JáJá, David Harwood, Larry S. Davis. 414-423 [doi]
- The Chessboard Distance Transform and the Medial Axis Transform are InterchangeableYu-Hua Lee, Shi-Jinn Horng. 424-428 [doi]
- Parallel Algorithms for Image Processing: Practical Algorithms with ExperimentsArmin Bäumker, Wolfgang Dittrich. 429-433 [doi]
- Study of Scalable Declustering Algorithms for Parallel Grid FilesBongki Moon, Anurag Acharya, Joel H. Saltz. 434-440 [doi]
- A Parallel Algorithm for Text InferenceSanda M. Harabagiu, Dan I. Moldovan. 441-445 [doi]
- Efficient Execution of Parallel Applications in Multiprogrammed Multiprocessor SystemsKelvin K. Yue, David J. Lilja. 448-456 [doi]
- The Relation of Scalability and Execution TimeXian-He Sun. 457-462 [doi]
- Maximizing Speedup through Self-Tuning of Processor AllocationThu D. Nguyen, Raj Vaswani, John Zahorjan. 463-468 [doi]
- Profiling Optimized Code: A Profiling System for an HPF CompilerShaun Kaneshiro, Tatsuya Shindo. 469-473 [doi]
- Toward Symbolic Performance Prediction of Parallel ProgramsThomas Fahringer. 474-478 [doi]
- Performance Prediction with BenchmapsSivan Toledo. 479-485 [doi]
- IBM System/390 Division: Overview of IBM System/390 Parallel Sysplex - A Commercial Parallel Processing SystemJeffrey M. Nick, Jen-Yao Chung, Nicholas S. Bowen. 488-495 [doi]
- Litton Guidance and Control Systems, Inc.: Implementing Parallel Processing in a Rugged Embeddable EnvironmentAlan L. Smeyne. 496-501 [doi]
- Mercury Computer Systems, Inc.: Planned Direct Transfers: A Programming Model for Real-Time ApplicationsGerard Vichniac, Barry Isenstein, Craig Lund, Arlan Pool. 502-505 [doi]
- Centre for Development of Advanced Computing: DS-Link over Fiber: A High-Speed Interconnect for Cluster ComputingYogindra Abhyankar, Anil Degwekar, Abhay Karandikar. 507-511 [doi]
- Electronics and Telecommunications Research Institute: A Multiprocessor Server with a New Highly Pipelined BusWoo-Jong Hahn, Ando Ki, Kee-Wook Rim, Soo-Won Kim. 512-517 [doi]
- Tandem Computers Incorporated: Performance Modeling of ServerNet:::TM::: TopologiesRobert W. Horst, Doug Jewett, William J. Watson, L. Young, Dimiter R. Avresky, R. Wilkinson, Chris M. Cunningham. 518-523 [doi]
- CoCheck: Checkpointing and Process Migration for MPIGeorg Stellner. 526-531 [doi]
- Tulip: A Portable Run-Time System for Object-Parallel SystemsPeter H. Beckman, Dennis Gannon. 532-536 [doi]
- A Virtual Memory Model for Parallel SupercomputersVeronica L. M. Reis, Isaac D. Scherson. 537-543 [doi]
- A Partitioning Programming Environment for a Novel Parallel ArchitectureReiner W. Hartenstein, Jürgen Becker, Michael Herz, Rainer Kress, Ulrich Nageldinger. 544-548 [doi]
- An Integrated Synchronization and Consistency Protocol for the Implementation of a High-Level Parallel Programming LanguageMartin C. Rinard. 549-553 [doi]
- Implementation and Evaluation of Prefetching in the Intel Paragon Parallel File SystemMeenakshi Arunachalam, Alok N. Choudhary, Brad Rullman. 554-559 [doi]
- Routing a Permutation in the Hypercube by Two Sets of Edge-Disjoint PathsQian-Ping Gu, Hisao Tamaki. 561-567 [doi]
- Determining Asynchronous Acyclic Pipeline Execution TimesVal Donaldson, Jeanne Ferrante. 568-572 [doi]
- Distributing Tokens on a Hypercube without Error AccumulationBogdan S. Chlebus, José D. P. Rolim, Giora Slutzki. 573-578 [doi]
- On Some Global Operations in Faulty SIMD HypercubesAmit Sengupta, C. S. Raghavendra. 579-583 [doi]
- An Improved Approximation Algorithm for Scheduling Task Trees on Linear ArraysHari Krishna Tadepalli, Errol L. Lloyd. 584-590 [doi]
- Jacobi-like Algorithms for Eigenvalue Decomposition of a Real Normal Matrix Using Real ArithmeticBing Bing Zhou, Richard P. Brent. 593-600 [doi]
- An Element-Based Concurrent Partitioner for Unstructured Finite Element MeshesHong Q. Ding, Robert D. Ferraro. 601-605 [doi]
- Analysis of the Numerical Effects of Parallelism on a Parallel Genetic AlgorithmWilliam E. Hart, Scott B. Baden, Richard K. Belew, Scott R. Kohn. 606-612 [doi]
- Compiling MATLAB Programs to ScaLAPACK: Exploiting Task and Data ParallelismShankar Ramaswamy, Eugene W. Hodges IV, Prithviraj Banerjee. 613-619 [doi]
- Mapping Techniques for Parallel Evaluation of Chains of RecurrencesEugene V. Zima, Karthi R. Vadivelu, Thomas L. Casavant. 620-624 [doi]
- Performance of Asynchronous Linear Iterations with Random DelaysAdrian Moga, Michel Dubois. 625-629 [doi]
- For a Massive Number of Massively Parallel Machines: What are the Target Applications, Who are the Target Users, and What New R&D is Needed to Hit the Target?William M. Farmer, Richard F. Freund, Mark Furtney, Paul Messina, Lionel M. Ni, Charles L. Seitz, Marc Snir. 631-634 [doi]
- Clusters for Commercial Computing: An Invisible ArchitectureGregory F. Pfister. 636 [doi]
- Generic Methodologies for Deadlock-Free RoutingHyunmin Park, Dharma P. Agrawal. 638-643 [doi]
- Partitionability of the Multistage Interconnection NetworksYeimkuan Chang. 644-649 [doi]
- On Embedding Various Networks into the Hypercube Using Matrix TransformationsMounir Hamdi, Siang W. Song. 650-654 [doi]
- Optimal Subcube Fault Tolerance in a Circuit-Switched HypercubeBaback A. Izadi, Füsun Özgüner. 655-659 [doi]
- Fault-Tolerant Ring Embedding in Star GraphsYu-Chee Tseng, Shu-Hui Chang, Jang-Ping Sheu. 660-665 [doi]
- An Optical Interconnect Model for k-ary n-cube Wormhole NetworksMongkol Raksapatcharawong, Timothy Mark Pinkston. 666-672 [doi]
- Fault-Tolerant Multiple Bus Networks for Fan-In AlgorithmsRamachandran Vaidyanathan, Sudharani Nadella. 674-681 [doi]
- Coping with Sparse Inputs on Enhanced Meshes - Semigroup Computation with COMMON CRCW BusesPeter Damaschke. 682-686 [doi]
- An Optimal Algorithm for the Angle-Restricted All Nearest Neighbor Problem on the ReconfigurableKoji Nakano, Stephan Olariu. 687-691 [doi]
- Parallel Algorithms Using Unreliable BroadcastsJohn Matthews, Charles U. Martel. 692-696 [doi]
- Efficient Algorithms for the Hough Transform on Arrays with Reconfigurable Optical BusesSandy Pavel, Selim G. Akl. 697-701 [doi]
- Integer and Floating Point Matrix-Vector Multiplication on the Reconfigurable MeshJerry L. Trahan, Chun-ming Lu, Ramachandran Vaidyanathan. 702-706 [doi]
- Some Image Processing Algorithms on a RAP with Wider Bus NetworksShung-Shing Lee, Shi-Jinn Horng, Horng-Ren Tsai, Yu-Hua Lee. 708-715 [doi]
- Parallel Synthetic Aperture Radar Processing on Workstation NetworksPeter G. Meisl, Mabo Robert Ito, Ian G. Cumming. 716-723 [doi]
- The Evolution of a Massively Parallel Vision System for Real-Time Automotive Image ProcessingAlberto Broggi. 724-728 [doi]
- 2D Object Recognition on a Reconfigurable MeshConcettina Guerra. 729-733 [doi]
- Space-Time Adaptive Processing on the Mesh Synchronous ProcessorJanice S. McMahon, Ken Teitelbaum. 734-740 [doi]
- An Experimental Study of Input/Output Characteristics of NASA Earth and Space Sciences ApplicationsMichael R. Berry, Tarek A. El-Ghazawi. 741-747 [doi]
- Bitonic Sorting on Bene NetworksBeverly Gocal. 749-753 [doi]
- Designing Adaptable Real-Time Fault-Tolerant Parallel SystemsCélio Estevan Morón. 754-758 [doi]
- Improving Memory Performance for Indirect Accesses on SIMD ComputersJames D. Allen, David E. Schimmel. 759-765 [doi]
- A New Approach to Pipeline FFT ProcessorShousheng He, Mats Torkelson. 766-770 [doi]
- Implementation of a SliM Array ProcessorHyun M. Chang, Myung Hoon Sunwoo, Tai-Hoon Cho. 771-775 [doi]
- Temporal Characterization of Demands for Data Movement on Parallel ProgramsBernardo Rodriguez, Harry F. Jordan, Gita Alaghband. 776-779 [doi]
- Broadcasting Multiple Messages in the Multiport ModelAmotz Bar-Noy, Ching-Tien Ho. 781-788 [doi]
- The Necessary Conditions for Clos-Type Nonblocking Multicast NetworksYuanyuan Yang, Gerald M. Masson. 789-795 [doi]
- A Class of Interconnection Networks for MulticastingYuanyuan Yang. 796-802 [doi]
- Performance Prediction of PVM ProgramsMichael R. Steed, Mark J. Clement. 803-807 [doi]
- Algorithms for All-to-All Personalized Exchange in 2D and 3D ToriYoung-Joo Suh, Sudhakar Yalamanchili. 808-814 [doi]
- Generalized Theory for Deadlock-Free Adaptive Wormhole Routing and its Application to Disha ConcurrentAnjan K. Venkatramani, Timothy Mark Pinkston, José Duato. 815-821 [doi]
- Efficient Run-Time Support for Irregular Task Computations with Mixed GranularitiesCong Fu, Tao Yang. 823-830 [doi]
- A New Technique for 3-D Domain Decomposition on Multicomputers which Reduces Message-PassingJoseph Gil, Alan S. Wagner. 831-835 [doi]
- Application Load Imbalance on Parallel ProcessorsVasudha Govindan, Mark A. Franklin. 836-842 [doi]
- Native ATM Application Programmer Interface Testbed for Cluster-Based ComputingPatrick W. Dowd, Todd M. Carrozzi, Frank A. Pellegrino, Amy Xin Chen. 843-849 [doi]
- SWEB: Towards a Scalable World Wide Web Server on MulticomputersDaniel Andresen, Tao Yang, Vegard Holmedahl, Oscar H. Ibarra. 850-856 [doi]
- Parallel Implementations of Irregular Problems Using High-Level Actor LanguageR. Panwar, W. Kim, Gul Agha. 857-862 [doi]
- Implementation of an Automatic Semi-Fluid Motion Analysis Algorithm on a Massively Parallel ComputerKannappan Palaniappan, Mohammad Faisal, Chandra Kambhamettu, A. Frederick Haslert. 864-877 [doi]
- NAS Experiences of Porting CM Fortran Codes to on IBM SP2 and SGI Power ChallengeSubhash Saini. 878-880 [doi]
- Random Seeking: A General, Efficient, and Informed Randomized Scheme for Dynamic Load BalancingNihar R. Mahapatra, Shantanu Dutt. 881-885 [doi]
- A Direct Block-Five-Diagonal System Solver for the VLSI Parallel ModelMarián Vajtersic. 886-890 [doi]
- Mapping Linear Recurrences onto Systolic ArraysLadan Kazerouni, Basant Rajan, R. K. Shyamasundar. 891-897 [doi]