Proceedings of IPPS 96, The 10th International Parallel Processing Symposium, April 15-19, 1996, Honolulu, Hawaii, USA

researchr

You are not signed in
Sign in
Sign up

Proceedings of IPPS 96, The 10th International Parallel Processing Symposium, April 15-19, 1996, Honolulu, Hawaii, USA. IEEE Computer Society, 1996.

Conference: ipps1996

Abstract is missing.

Can Multithreaded Programming Save Massively Parallel Computing?Charles E. Leiserson. 2-3 [doi]

Eliminating Stale Data References through Array Data-Flow AnalysisLynn Choi, Pen-Chung Yew. 4-13 [doi]

Commutativity Analysis: A Technique for Automatically Parallelizing Pointer-Based ComputationsMartin C. Rinard, Pedro C. Diniz. 14-22 [doi]

Profiling Dependence Vectors for Loop ParallelizationShaw-Yen Tseng, Chung-Ta King, Chuan Yi Tang. 23-27 [doi]

A Method for Register Allocation to Loops in Multiple Register File ArchitecturesDavid J. Kolson, Alexandru Nicolau, Nikil D. Dutt, Ken Kennedy. 28-33 [doi]

Affine-by-Statement Transformations of Imperfectly Nested LoopsJingling Xue. 34-38 [doi]

The Combined Effectiveness of Unimodular Transformations, Tiling, and Software PrefetchingRafael H. Saavedra-Barrera, Weihua Mao, Daeyeon Park, Jacqueline Chame, Sungdo Moon. 39-45 [doi]

Ocean Circulation on the Intel Paragon: Modeling and ImplementationKa-Cheong Leung, Ishfaq Ahmad, Hsiao-Ming Hsu. 47-54 [doi]

Dynamic Alignment and Distribution of Irregularly Coupled Data Arrays for Scalable Parallelization of Particle-in-Cell ProblemsWei-keng Liao, Chao-Wei Ou, Sanjay Ranka. 57-61 [doi]

A Hierarchical Parallel Processing System for the Multipass-Rendering MethodHiroaki Kobayashi, Hitoshi Yamauchi, Yuichiro Toh, Tadao Nakamura. 62-67 [doi]

Performance Modeling and Composition: A Case Study in Cell SimulationSteve G. Steinberg, Jun Yang, Katherine A. Yelick. 68-74 [doi]

A Study of High-Performance Communication Mechanism for Multicomputer SystemsHideki Murayama, Satoshi Yoshizawa, Takeshi Aimoto, Hidenori Inouchi, Shooichi Murase, Takehisa Hayashi, Hiroshi Iwamoto. 76-83 [doi]

A TeraFLOP Supercomputer in 1996: The ASCI TFLOP SystemTimothy G. Mattson, David Scott, Stephen R. Wheat. 84-93 [doi]

Experience with Parallel Computing on the AN2 NetworkDaniel J. Scales, Michael Burrows, Chandramohan A. Thekkath. 94-103 [doi]

Achieving a Balanced Low-Cost Architecture for Mass Storage Management through Multiple Fast Ethernet Channels on the Beowulf Parallel WorkstationThomas L. Sterling, Donald J. Becker, Chance Reschke, Daniel Savarese, Michael R. Berry. 104-108 [doi]

Exploiting the Capabilities of Communications Co-ProcessorsKlaus E. Schauser, Chris J. Scheiman, J. Mitchell Ferguson, Paul Z. Kolano. 109-115 [doi]

Effects of Multithreading on Data and Workload Distribution for Distributed-Memory MultiprocessorsAndrew Sohn, Mitsuhisa Sato, Namhoon Yoo, Jean-Luc Gaudiot. 116-122 [doi]

Formal Verification of Delayed Consistency ProtocolsFong Pong, Michel Dubois. 124-131 [doi]

Dag-Consistent Distributed Shared MemoryRobert D. Blumofe, Matteo Frigo, Christopher F. Joerg, Charles E. Leiserson, Keith H. Randall. 132-141 [doi]

Categorizing Network Traffic in Update-Based Protocols on Scalable MultiprocessorsRicardo Bianchini, Thomas J. LeBlanc, Jack E. Veenstra. 142-151 [doi]

Implementing the Data Diffusion Machine Using Crossbar RoutersHenk L. Muller, Paul W. A. Stallard, David H. D. Warren. 152-158 [doi]

A Memory Controller for Improved Performance of Streamed Computations on Symmetric MultiprocessorsSally A. McKee, William A. Wulf. 159-165 [doi]

Kiloprocessor Extensions to SCIStefanos Kaxiras. 166-172 [doi]

Approximate Compaction and Padded-Sorting on Exclusive Write PRAMsMiroslaw Kutylowski, Tomasz Wierzbicki. 174-181 [doi]

A Parallel Solution to the Extended Set Union Problem with Unlimited BacktrackingMaria Cristina Pinotti, Vincenzo A. Crupi, Sajal K. Das. 182-186 [doi]

A Parallel Algorithm for Minimization of Finite AutomataBala Ravikumar, X. Xiong. 187-191 [doi]

A Randomized Algorithm for Voronoi Diagram of Line Segments on Coarse-Grained MultiprocessorsXiaotie Deng, Binhai Zhu. 192-198 [doi]

Self-Timed Resynchronization: A Post-Optimization for Static Multiprocessor SchedulesShuvra S. Bhattacharyya, Sundararajan Sriram, Edward A. Lee. 199-205 [doi]

Constructing the Spanners of Graphs in ParallelWeifa Liang, Richard P. Brent. 206-210 [doi]

Converse: An Interoperable Framework for Parallel ProgrammingLaxmikant V. Kalé, Milind A. Bhandarkar, Narain Jagathesan, Sanjeev Krishnan, Josh Yelon. 212-217 [doi]

Dome: Parallel Programming in a Distributed Computing EnvironmentJose Nagib Cotrim Árabe, Adam Beguelin, Bruce Lowekamp, Erik Seligman, Mike Starkey, Peter Stephan. 218-224 [doi]

Nested Parallel Call OptimizationEnrico Pontelli, Gopal Gupta. 225-229 [doi]

The Parallel Break Construct, or How to Kill an Activity TreeYair I. Friedman, Dror G. Feitelson, Iaakov Exman. 230-234 [doi]

Optimizing COOP Languages: Study of a Protein Dynamics ProgramXingbin Zhang, Vijay Karamcheti, Tony Ng, Andrew A. Chien. 235-240 [doi]

Support for Extensibility and Reusability in a Concurrent Object-Oriented Programming LanguageRaju Pandey, James C. Browne. 241-247 [doi]

Modeling the Communication Performance of the IBM SP2Gheith A. Abandah, Edward S. Davidson. 249-257 [doi]

Adaptive Source Routing in Multistage Interconnection NetworksYucel Aydogan, Craig B. Stunkel, Cevdet Aykanat, Bülent Abali. 258-267 [doi]

The Effects of Network Contention on Processor Allocation StrategiesSherry Moore, Lionel M. Ni. 268-273 [doi]

ServerNet Deadlock Avoidance and Fractahedral TopologiesRobert W. Horst. 274-280 [doi]

Analysis of Memory Interference in Buffered Multiprocessor Systems in Presence of Hot Spots and Favorite MemoriesSajal K. Das, Sanjoy K. Sen. 281-285 [doi]

Benefits of Processor Clustering in Designing Large Parallel Systems: When and How?Debashis Basak, Dhabaleswar K. Panda, Mohammad Banikazemi. 286-290 [doi]

Practical Parallel Algorithms for Dynamic Data Redistribution, Median Finding, and SelectionDavid A. Bader, Joseph JáJá. 292-301 [doi]

Parallel Implementation of Borvka s Minimum Spanning Tree AlgorithmSun Chung, Anne Condon. 302-308 [doi]

Practical Algorithms for Selection on Coarse-Grained Parallel ComputersIbraheem Al-Furaih, Srinivas Aluru, Sanjay Goil, Sanjay Ranka. 309-313 [doi]

Parallel Multilevel Graph PartitioningGeorge Karypis, Vipin Kumar. 314-319 [doi]

PACK/UNPACK on Coarse-Grained Distributed Memory Parallel MachinesSeungjo Bae, Sanjay Ranka. 320-324 [doi]

Resource Placement in Torus-Based NetworksMyung M. Bae, Bella Bose. 327-331 [doi]

Simultaneous Compression of Makespan and Number of Processors Using CRPYiqun Ge, David Y. Y. Yun. 332-338 [doi]

Implementation of Scalable Blocking Locks Using an Adaptive Thread SchedulerBodhisattwa Mukherjee, Karsten Schwan. 339-343 [doi]

Hector: Automated Task Allocation for MPISamuel H. Russ, Brian K. Flachs, Jonathan Robinson, Bjørn Heckel. 344-348 [doi]

An Adaptive Approach to Data PlacementDavid K. Lowenthal, Gregory R. Andrews. 349-353 [doi]

Complete Parallelization of Computations: Integration of Data Partitioning and Functional Parallelism for Dynamic Data StructuresDwip Banerjee, James C. Browne. 354-360 [doi]

MPPs versus ClustersCharles L. Seitz. 362 [doi]

Generating Realignment-Based Communication for HPF ProgramsTsunehiko Kamachi, Kazuhiro Kusano, Kenji Suehiro, Yoshiki Seo. 364-371 [doi]

Software Support for Virtual Memory-Mapped CommunicationCezary Dubnicki, Liviu Iftode, Edward W. Felten, Kai Li. 372-281 [doi]

How to Optimize Residual Communications?Michèle Dion, Cyril Randriamaro, Yves Robert. 382-391 [doi]

A Comparative Study of Methods for Time-Deterministic Message Delivery in a Multiprocessor ArchitectureJan Jonsson, Jonas Vasell. 392-398 [doi]

ECO: Efficient Collective Operations for Communication on Heterogeneous NetworksBruce Lowekamp, Adam Beguelin. 399-405 [doi]

Software Techniques for Improving MPP Bulk-Transfer PerformanceEric A. Brewer, Paul Gauthier, Armando Fox, Angela Schuett. 406-412 [doi]

Parallel Algorithms for Image Enhancement and Segmentation by Region Growing with an Experimental StudyDavid A. Bader, Joseph JáJá, David Harwood, Larry S. Davis. 414-423 [doi]

The Chessboard Distance Transform and the Medial Axis Transform are InterchangeableYu-Hua Lee, Shi-Jinn Horng. 424-428 [doi]

Parallel Algorithms for Image Processing: Practical Algorithms with ExperimentsArmin Bäumker, Wolfgang Dittrich. 429-433 [doi]

Study of Scalable Declustering Algorithms for Parallel Grid FilesBongki Moon, Anurag Acharya, Joel H. Saltz. 434-440 [doi]

A Parallel Algorithm for Text InferenceSanda M. Harabagiu, Dan I. Moldovan. 441-445 [doi]

Efficient Execution of Parallel Applications in Multiprogrammed Multiprocessor SystemsKelvin K. Yue, David J. Lilja. 448-456 [doi]

The Relation of Scalability and Execution TimeXian-He Sun. 457-462 [doi]

Maximizing Speedup through Self-Tuning of Processor AllocationThu D. Nguyen, Raj Vaswani, John Zahorjan. 463-468 [doi]

Profiling Optimized Code: A Profiling System for an HPF CompilerShaun Kaneshiro, Tatsuya Shindo. 469-473 [doi]

Toward Symbolic Performance Prediction of Parallel ProgramsThomas Fahringer. 474-478 [doi]

Performance Prediction with BenchmapsSivan Toledo. 479-485 [doi]

IBM System/390 Division: Overview of IBM System/390 Parallel Sysplex - A Commercial Parallel Processing SystemJeffrey M. Nick, Jen-Yao Chung, Nicholas S. Bowen. 488-495 [doi]

Litton Guidance and Control Systems, Inc.: Implementing Parallel Processing in a Rugged Embeddable EnvironmentAlan L. Smeyne. 496-501 [doi]

Mercury Computer Systems, Inc.: Planned Direct Transfers: A Programming Model for Real-Time ApplicationsGerard Vichniac, Barry Isenstein, Craig Lund, Arlan Pool. 502-505 [doi]

Centre for Development of Advanced Computing: DS-Link over Fiber: A High-Speed Interconnect for Cluster ComputingYogindra Abhyankar, Anil Degwekar, Abhay Karandikar. 507-511 [doi]

Electronics and Telecommunications Research Institute: A Multiprocessor Server with a New Highly Pipelined BusWoo-Jong Hahn, Ando Ki, Kee-Wook Rim, Soo-Won Kim. 512-517 [doi]

Tandem Computers Incorporated: Performance Modeling of ServerNet:::TM::: TopologiesRobert W. Horst, Doug Jewett, William J. Watson, L. Young, Dimiter R. Avresky, R. Wilkinson, Chris M. Cunningham. 518-523 [doi]

CoCheck: Checkpointing and Process Migration for MPIGeorg Stellner. 526-531 [doi]

Tulip: A Portable Run-Time System for Object-Parallel SystemsPeter H. Beckman, Dennis Gannon. 532-536 [doi]

A Virtual Memory Model for Parallel SupercomputersVeronica L. M. Reis, Isaac D. Scherson. 537-543 [doi]

A Partitioning Programming Environment for a Novel Parallel ArchitectureReiner W. Hartenstein, Jürgen Becker, Michael Herz, Rainer Kress, Ulrich Nageldinger. 544-548 [doi]

An Integrated Synchronization and Consistency Protocol for the Implementation of a High-Level Parallel Programming LanguageMartin C. Rinard. 549-553 [doi]

Implementation and Evaluation of Prefetching in the Intel Paragon Parallel File SystemMeenakshi Arunachalam, Alok N. Choudhary, Brad Rullman. 554-559 [doi]

Routing a Permutation in the Hypercube by Two Sets of Edge-Disjoint PathsQian-Ping Gu, Hisao Tamaki. 561-567 [doi]

Determining Asynchronous Acyclic Pipeline Execution TimesVal Donaldson, Jeanne Ferrante. 568-572 [doi]

Distributing Tokens on a Hypercube without Error AccumulationBogdan S. Chlebus, José D. P. Rolim, Giora Slutzki. 573-578 [doi]

On Some Global Operations in Faulty SIMD HypercubesAmit Sengupta, C. S. Raghavendra. 579-583 [doi]

An Improved Approximation Algorithm for Scheduling Task Trees on Linear ArraysHari Krishna Tadepalli, Errol L. Lloyd. 584-590 [doi]

Jacobi-like Algorithms for Eigenvalue Decomposition of a Real Normal Matrix Using Real ArithmeticBing Bing Zhou, Richard P. Brent. 593-600 [doi]

An Element-Based Concurrent Partitioner for Unstructured Finite Element MeshesHong Q. Ding, Robert D. Ferraro. 601-605 [doi]

Analysis of the Numerical Effects of Parallelism on a Parallel Genetic AlgorithmWilliam E. Hart, Scott B. Baden, Richard K. Belew, Scott R. Kohn. 606-612 [doi]

Compiling MATLAB Programs to ScaLAPACK: Exploiting Task and Data ParallelismShankar Ramaswamy, Eugene W. Hodges IV, Prithviraj Banerjee. 613-619 [doi]

Mapping Techniques for Parallel Evaluation of Chains of RecurrencesEugene V. Zima, Karthi R. Vadivelu, Thomas L. Casavant. 620-624 [doi]

Performance of Asynchronous Linear Iterations with Random DelaysAdrian Moga, Michel Dubois. 625-629 [doi]

For a Massive Number of Massively Parallel Machines: What are the Target Applications, Who are the Target Users, and What New R&D is Needed to Hit the Target?William M. Farmer, Richard F. Freund, Mark Furtney, Paul Messina, Lionel M. Ni, Charles L. Seitz, Marc Snir. 631-634 [doi]

Clusters for Commercial Computing: An Invisible ArchitectureGregory F. Pfister. 636 [doi]

Generic Methodologies for Deadlock-Free RoutingHyunmin Park, Dharma P. Agrawal. 638-643 [doi]

Partitionability of the Multistage Interconnection NetworksYeimkuan Chang. 644-649 [doi]

On Embedding Various Networks into the Hypercube Using Matrix TransformationsMounir Hamdi, Siang W. Song. 650-654 [doi]

Optimal Subcube Fault Tolerance in a Circuit-Switched HypercubeBaback A. Izadi, Füsun Özgüner. 655-659 [doi]

Fault-Tolerant Ring Embedding in Star GraphsYu-Chee Tseng, Shu-Hui Chang, Jang-Ping Sheu. 660-665 [doi]

An Optical Interconnect Model for k-ary n-cube Wormhole NetworksMongkol Raksapatcharawong, Timothy Mark Pinkston. 666-672 [doi]

Fault-Tolerant Multiple Bus Networks for Fan-In AlgorithmsRamachandran Vaidyanathan, Sudharani Nadella. 674-681 [doi]

Coping with Sparse Inputs on Enhanced Meshes - Semigroup Computation with COMMON CRCW BusesPeter Damaschke. 682-686 [doi]

An Optimal Algorithm for the Angle-Restricted All Nearest Neighbor Problem on the ReconfigurableKoji Nakano, Stephan Olariu. 687-691 [doi]

Parallel Algorithms Using Unreliable BroadcastsJohn Matthews, Charles U. Martel. 692-696 [doi]

Efficient Algorithms for the Hough Transform on Arrays with Reconfigurable Optical BusesSandy Pavel, Selim G. Akl. 697-701 [doi]

Integer and Floating Point Matrix-Vector Multiplication on the Reconfigurable MeshJerry L. Trahan, Chun-ming Lu, Ramachandran Vaidyanathan. 702-706 [doi]

Some Image Processing Algorithms on a RAP with Wider Bus NetworksShung-Shing Lee, Shi-Jinn Horng, Horng-Ren Tsai, Yu-Hua Lee. 708-715 [doi]

Parallel Synthetic Aperture Radar Processing on Workstation NetworksPeter G. Meisl, Mabo Robert Ito, Ian G. Cumming. 716-723 [doi]

The Evolution of a Massively Parallel Vision System for Real-Time Automotive Image ProcessingAlberto Broggi. 724-728 [doi]

2D Object Recognition on a Reconfigurable MeshConcettina Guerra. 729-733 [doi]

Space-Time Adaptive Processing on the Mesh Synchronous ProcessorJanice S. McMahon, Ken Teitelbaum. 734-740 [doi]

An Experimental Study of Input/Output Characteristics of NASA Earth and Space Sciences ApplicationsMichael R. Berry, Tarek A. El-Ghazawi. 741-747 [doi]

Bitonic Sorting on Bene NetworksBeverly Gocal. 749-753 [doi]

Designing Adaptable Real-Time Fault-Tolerant Parallel SystemsCélio Estevan Morón. 754-758 [doi]

Improving Memory Performance for Indirect Accesses on SIMD ComputersJames D. Allen, David E. Schimmel. 759-765 [doi]

A New Approach to Pipeline FFT ProcessorShousheng He, Mats Torkelson. 766-770 [doi]

Implementation of a SliM Array ProcessorHyun M. Chang, Myung Hoon Sunwoo, Tai-Hoon Cho. 771-775 [doi]

Temporal Characterization of Demands for Data Movement on Parallel ProgramsBernardo Rodriguez, Harry F. Jordan, Gita Alaghband. 776-779 [doi]

Broadcasting Multiple Messages in the Multiport ModelAmotz Bar-Noy, Ching-Tien Ho. 781-788 [doi]

The Necessary Conditions for Clos-Type Nonblocking Multicast NetworksYuanyuan Yang, Gerald M. Masson. 789-795 [doi]

A Class of Interconnection Networks for MulticastingYuanyuan Yang. 796-802 [doi]

Performance Prediction of PVM ProgramsMichael R. Steed, Mark J. Clement. 803-807 [doi]

Algorithms for All-to-All Personalized Exchange in 2D and 3D ToriYoung-Joo Suh, Sudhakar Yalamanchili. 808-814 [doi]

Generalized Theory for Deadlock-Free Adaptive Wormhole Routing and its Application to Disha ConcurrentAnjan K. Venkatramani, Timothy Mark Pinkston, José Duato. 815-821 [doi]

Efficient Run-Time Support for Irregular Task Computations with Mixed GranularitiesCong Fu, Tao Yang. 823-830 [doi]

A New Technique for 3-D Domain Decomposition on Multicomputers which Reduces Message-PassingJoseph Gil, Alan S. Wagner. 831-835 [doi]

Application Load Imbalance on Parallel ProcessorsVasudha Govindan, Mark A. Franklin. 836-842 [doi]

Native ATM Application Programmer Interface Testbed for Cluster-Based ComputingPatrick W. Dowd, Todd M. Carrozzi, Frank A. Pellegrino, Amy Xin Chen. 843-849 [doi]

SWEB: Towards a Scalable World Wide Web Server on MulticomputersDaniel Andresen, Tao Yang, Vegard Holmedahl, Oscar H. Ibarra. 850-856 [doi]

Parallel Implementations of Irregular Problems Using High-Level Actor LanguageR. Panwar, W. Kim, Gul Agha. 857-862 [doi]

Implementation of an Automatic Semi-Fluid Motion Analysis Algorithm on a Massively Parallel ComputerKannappan Palaniappan, Mohammad Faisal, Chandra Kambhamettu, A. Frederick Haslert. 864-877 [doi]

NAS Experiences of Porting CM Fortran Codes to on IBM SP2 and SGI Power ChallengeSubhash Saini. 878-880 [doi]

Random Seeking: A General, Efficient, and Informed Randomized Scheme for Dynamic Load BalancingNihar R. Mahapatra, Shantanu Dutt. 881-885 [doi]

A Direct Block-Five-Diagonal System Solver for the VLSI Parallel ModelMarián Vajtersic. 886-890 [doi]

Mapping Linear Recurrences onto Systolic ArraysLadan Kazerouni, Basant Rajan, R. K. Shyamasundar. 891-897 [doi]

External Links

Cite Key

Statistics

PDF

Researchr

Proceedings of IPPS 96, The 10th International Parallel Processing Symposium, April 15-19, 1996, Honolulu, Hawaii, USA

Abstract

Table of Contents