Abstract is missing.
- An Optimal Offline Permutation Algorithm on the Hierarchical Memory Machine, with the GPU ImplementationAkihiko Kasagi, Koji Nakano, Yasuaki Ito. 1-10 [doi]
- AdELL: An Adaptive Warp-Balancing ELL Format for Efficient Sparse Matrix-Vector Multiplication on GPUsMarco Maggioni, Tanya Y. Berger-Wolf. 11-20 [doi]
- A Push-Relabel-Based Maximum Cardinality Bipartite Matching Algorithm on GPUsMehmet Deveci, Kamer Kaya, Bora Uçar, Ümit V. Çatalyürek. 21-29 [doi]
- Inspector-Executor Load Balancing Algorithms for Block-Sparse Tensor ContractionsDavid Ozog, Jeff R. Hammond, James Dinan, Pavan Balaji, Sameer Shende, Allen D. Malony. 30-39 [doi]
- Efficient Data Redistribution Methods for Coupled Parallel Particle CodesMichael Hofmann, Gudula Rünger. 40-49 [doi]
- A Diffusion-Based Processor Reallocation Strategy for Tracking Multiple Dynamically Varying Weather PhenomenaPreeti Malakar, Vijay Natarajan, Sathish S. Vadhiyar, Ravi S. Nanjundiah. 50-59 [doi]
- HAccRG: Hardware-Accelerated Data Race Detection in GPUsAnup Holey, Vineeth Mekkat, Antonia Zhai. 60-69 [doi]
- Adaptive Runtime Selection for GPUJean-François Dollinger, Vincent Loechner. 70-79 [doi]
- Efficient Inter-node MPI Communication Using GPUDirect RDMA for InfiniBand Clusters with NVIDIA GPUsSreeram Potluri, Khaled Hamidouche, Akshay Venkatesh, Devendar Bureddy, Dhabaleswar K. Panda. 80-89 [doi]
- On Scientific Workflow Scheduling in Clouds under Budget ConstraintXiangyu Lin, Chase Qishi Wu. 90-99 [doi]
- On the Merits of Distributed Work-Stealing on Selective Locality-Aware TasksJeeva Paudel, Olivier Tardieu, José Nelson Amaral. 100-109 [doi]
- A Dynamic Moldable Job Scheduling Based Parallel SAT SolverSajjad Asghar, Eric Aubanel, David Bremner. 110-119 [doi]
- BlindDate: A Neighbor Discovery ProtocolKeyu Wang, Xufei Mao, Yunhao Liu. 120-129 [doi]
- Freeweb: P2P-Assisted Collaborative Censorship-Resistant Web BrowsingHaiying Shen, Alex X. Liu, Lianyu Zhao. 130-139 [doi]
- Churn: A Key Effect on Real-World P2P SoftwareCheng-Yuan Ho, Ming-Chen Chung, Li-Hsing Yen, Chien-Chao Tseng. 140-149 [doi]
- Flow Migration on Multicore Network Processors: Load Balancing While Minimizing Packet ReorderingMuhammad Faisal Iqbal, Jim Holt, Jee Ho Ryoo, Lizy K. John, Gustavo de Veciance. 150-159 [doi]
- Prediction of Parallel Speed-Ups for Las Vegas AlgorithmsCharlotte Truchet, Florian Richoux, Philippe Codognet. 160-169 [doi]
- Empirical Analysis of Space-Filling Curves for Scientific Computing ApplicationsDaryl Deford, Ananth Kalyanaraman. 170-179 [doi]
- Engineering High-Performance Community Detection Heuristics for Massive GraphsChristian Staudt, Henning Meyerhenke. 180-189 [doi]
- Hypergraph Sparsification and Its Application to PartitioningMehmet Deveci, Kamer Kaya, Ümit V. Çatalyürek. 200-209 [doi]
- Fast Approximate Subgraph Counting and EnumerationGeorge M. Slota, Kamesh Madduri. 210-219 [doi]
- Simultaneous Finite Automata: An Efficient Data-Parallel Model for Regular Expression MatchingRyoma Sin'ya, Kiminori Matsuzaki, Masataka Sassa. 220-229 [doi]
- Expression Tree Evaluation by Dynamic Code Generation - Are Accelerators Up for the Task?Thomas Muller, Josef Weidendorfer, Andreas Blaszczyk. 230-239 [doi]
- Predicting Execution Readiness of MPI Binaries with FEAM, a Framework for Efficient Application MigrationKarolina Sarnowska-Upton, Andrew S. Grimshaw. 240-249 [doi]
- A NUMA-Aware Runtime Environment for the Actor ModelEmilio Francesquini, Alfredo Goldman, Jean-François Méhaut. 250-259 [doi]
- Integrating Multi-GPU Execution in an OpenACC CompilerToshiya Komoda, Shinobu Miwa, Hiroshi Nakamura, Naoya Maruyama. 260-269 [doi]
- AOmpLib: An Aspect Library for Large-Scale Multi-core Parallel ProgrammingBruno Medeiros, João L. Sobral. 270-279 [doi]
- HyPHI - Task Based Hybrid Execution C++ Library for the Intel Xeon Phi CoprocessorJirí Dokulil, Enes Bajrovic, Siegfried Benkner, Martin Sandrieser, Beverly Bachmayer. 280-289 [doi]
- A Prioritized Distributed Mutual Exclusion Algorithm Balancing Priority Inversions and Response TimeJonathan Lejeune, Luciana Arantes, Julien Sopena, Pierre Sens. 290-299 [doi]
- A Generalized Mutual Exclusion Problem and Its AlgorithmAoxueluo, Weigang Wu, Jiannong Cao, Michel Raynal. 300-309 [doi]
- Efficient Dissemination Algorithm for Scale-Free TopologiesRuijing Hu, Julien Sopena, Luciana Arantes, Pierre Sens, Isabelle M. Demeure. 310-319 [doi]
- Reformulated Conjugate Gradient for the Energy-Aware Solution of Linear Systems on GPUsJosé Ignacio Aliaga, Joaquin Perez, Enrique S. Quintana-Ortí, Hartwig Anzt. 320-329 [doi]
- Energy-Efficient Synthetic-Aperture Radar Processing on a Manycore ArchitectureZain-ul-Abdin, Anders Ahlander, Bertil Svensson. 330-338 [doi]
- Parallel Radix Sort on the AMD Fusion Accelerated Processing UnitMichael C. Delorme, Tarek S. Abdelrahman, Chengyan Zhao. 339-348 [doi]
- Sampling-Based Phase Classification and Prediction for Multi-threaded Program Execution on Multi-core ArchitecturesChih-Hao Chang, Pangfeng Liu, Jan-Jan Wu. 349-358 [doi]
- Characterization of Input/Output Bandwidth Performance Models in NUMA Architecture for Data Intensive ApplicationsTan Li, Yufei Ren, Dantong Yu, Shudong Jin, Thomas G. Robertazzi. 369-378 [doi]
- Finite-State Robots in a Warehouse: Achieving Linear Parallel Speedup While Rearranging ObjectsArnold L. Rosenberg. 379-388 [doi]
- Hysteresis Re-chunking Based Metadata Harnessing Deduplication of Disk ImagesBing Zhou, Jiangtao Wen. 389-398 [doi]
- Energy-Efficient Leader Election Protocols for Single-Hop Radio NetworksMarcin Kardas, Marek Klonowski, Dominik Pajak. 399-408 [doi]
- Backing Up Your Data to the Cloud: Want to Pay Less?Yingwu Zhu, Justin Masui. 409-418 [doi]
- Handling Uncertainty: Pareto-Efficient BoT Scheduling on Hybrid CloudsMohammad Reza Hoseinyfarahabady, Hamid R. D. Samani, Luke M. Leslie, Young Choon Lee, Albert Y. Zomaya. 419-428 [doi]
- Parallel Birth and Death Process for Cell Nuclei Extraction in Histopathology ImagesChristophe Avenel, Pierre Fortin, Dominique Béréziat. 429-438 [doi]
- Use of a Mobile Sink for Maximizing Data Collection in Energy Harvesting Sensor NetworksXiaojiang Ren, Weifa Liang, Wenzheng Xu. 439-448 [doi]
- Application-Aware Workload Consolidation to Minimize Both Energy Consumption and Network Load in Cloud EnvironmentsNikos Tziritas, Cheng-Zhong Xu, Thanasis Loukopoulos, Samee Ullah Khan, Zhibin Yu. 449-457 [doi]
- Risk Intelligence: Profiting from Uncertainty in Data Processing SystemSi Zheng, Yunhuai Liu, Shanshan Li, Tian He, Xiangke Liao. 458-467 [doi]
- Characterizing Cloud Applications on a Google Data CenterSheng Di, Derrick Kondo, Franck Cappello. 468-473 [doi]
- Protein Structure Prediction on GPU: A Declarative Approach in a Multi-agent FrameworkFederico Campeotto, Agostino Dovier, Enrico Pontelli. 474-479 [doi]
- Multiple-SPMD Programming Environment Based on PGAS and Workflow toward Post-petascale ComputingMiwako Tsuji, Mitsuhisa Sato, Maxime R. Hugues, Serge G. Petiton. 480-485 [doi]
- An Efficient Deterministic Parallel Algorithm for Adaptive Multidimensional Numerical Integration on GPUsKamesh Arumugam, Alexander Godunov, Desh Ranjan, Bala Terzic, Mohammad Zubair. 486-491 [doi]
- Towards Hardware Realizations of Intelligent Systems: A Cortical Column ApproachAnita Tino, Gul N. Khan, Fei Yuan. 492-497 [doi]
- WormPlanar: Topological Planarization Based Wormhole Detection in Wireless NetworksXiaopei Lu, Dezun Dong, Xiangke Liao. 498-503 [doi]
- Java with Auto-parallelization on Graphics Coprocessing ArchitectureGuodong Han, Chenggang Zhang, King Tin Lam, Cho-Li Wang. 504-509 [doi]
- Symbolic Analysis of Concurrency Errors in OpenMP ProgramsHongyi Ma, Steve Diersen, Liqiang Wang, Chunhua Liao, Daniel J. Quinlan, Zijiang Yang. 510-516 [doi]
- Efficient Forwarding of Producer-Consumer Data in Task-Based ProgramsMadhavan Manivannan, Anurag Negi, Per Stenström. 517-522 [doi]
- Parallelization of Particle-in-Cell Codes for Nonlinear Kinetic Models from Mathematical PhysicsMatthias Korch, Tobias Ramming, Gerhard Rein. 523-529 [doi]
- On the Scalability of Constraint Programming on Hierarchical Multiprocessor SystemsRui Machado, Vasco Pedro, Salvador Abreu. 530-535 [doi]
- Dynamic Server Provisioning for Carbon-Neutral Data CentersA. Hasan Mahmud, Shaolei Ren. 536-541 [doi]
- A Flexible Framework to Enhance RAID-6 Scalability via Exploiting the Similarities among MDS CodesChentao Wu, Xubin He. 542-551 [doi]
- Load-Balanced Recovery Schemes for Single-Disk Failure in Storage Systems with Any Erasure CodeXianghong Luo, Jiwu Shu. 552-561 [doi]
- Temporal-Aware Mechanism to Detect Private Data in Chip MultiprocessorsAlberto Ros, Blas Cuesta, María Engracia Gómez, Antonio Robles, José Duato. 562-571 [doi]
- Distributed Shortcut Networks: Layout-Aware Low-Degree Topologies Exploiting Small-World EffectVan K. Nguyen, Nhat T. X. Le, Ikki Fujiwara, Michihiro Koibuchi. 572-581 [doi]
- Efficient Routing Mechanisms for Dragonfly NetworksMarina García, Enrique Vallejo 0001, Ramón Beivide, Miguel Odriozola, Mateo Valero. 582-592 [doi]
- Protocols for Fully Offloaded Collective Operations on Accelerated Network AdaptersTimo Schneider, Torsten Hoefler, Ryan E. Grant, Brian W. Barrett, Ron Brightwell. 593-602 [doi]
- Efficient Information Dissemination in Dynamic NetworksZhiwei Yang, Wei-Gang Wu, Yishun Chen, Jun Zhang. 603-610 [doi]
- A Novel Functional Partitioning Approach to Design High-Performance MPI-3 Non-blocking Alltoallv Collective on Multi-core SystemsKrishna Chaitanya Kandalla, Hari Subramoni, Karen Tomko, Dmitry Pekurovsky, Dhabaleswar K. Panda. 611-620 [doi]
- HEUSPEC: A Software Speculation Parallel ModelFan Xu, Li Shen, Zhiying Wang, Hui Guo, Bo Su, Wei Chen 0009. 621-630 [doi]
- Enhancing Performance Portability of MPI Applications through Annotation-Based TransformationsMd. Ziaul Haque, Qing Yi, James Dinan, Pavan Balaji. 631-640 [doi]
- High-Performance Design of Hadoop RPC with RDMA over InfiniBandXiaoyi Lu, Nusrat S. Islam, Md. Wasi-ur-Rahman, Jithin Jose, Hari Subramoni, Hao Wang, Dhabaleswar K. Panda. 641-650 [doi]
- Mixed Model Universal Software Thread-Level SpeculationZhen Cao, Clark Verbrugge. 651-660 [doi]
- A Flexible Approach to Staged EventsTiago Salmito, Ana Lucia de Moura, Noemi Rodriguez. 661-670 [doi]
- ConMR: Concurrent MapReduce Programming Model for Large Scale Shared-Data ApplicationsFan Zhang, Qutaibah M. Malluhi, Tamer M. Elsyed. 671-679 [doi]
- Read-Write Lock Allocation in Software Transactional MemoryAmir Ghanbari Bavarsad, Ehsan Atoofian. 680-687 [doi]
- A Heterogeneous Computing Framework for Computational FinanceGordon Inggs, David B. Thomas, Wayne Luk. 688-697 [doi]
- A Framework for Performance-Aware Composition of Applications for GPU-Based SystemsUsman Dastgeer, Christoph W. Keßler. 698-707 [doi]
- Exploiting Execution Order and Parallelism from Processing Flow Applying Pipeline-Based Programming Method on Manycore AcceleratorsShinichi Yamagiwa, Ryo Jozaki, Shixun Zhang, Ryo Zaizen, Dewen Xu. 708-717 [doi]
- Performance Tuning on Multicore Systems for Feature Matching within Image CollectionsXiaoxin Tang, Steven Mills, David M. Eyers, Zhiyi Huang 0001, Kai-Cheung Leung, Minyi Guo. 718-727 [doi]
- X-kaapi: A Multi Paradigm Runtime for Multicore ArchitecturesThierry Gautier, Fabien Le Mentec, Vincent Faucher, Bruno Raffin. 728-735 [doi]
- Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon PhiArunmoezhi Ramachandran, Jérôme Vienne, Rob F. Van der Wijngaart, Lars Koesterke, Ilya Sharapov. 736-743 [doi]
- Tiled QR Decomposition and Its Optimization on CPU and GPU Computing SystemDongjin Kim, Kyu Ho Park. 744-753 [doi]
- Hierarchical Parallel Matrix Multiplication on Large-Scale Distributed Memory PlatformsJean-Noël Quintin, Khalid Hasanov, Alexey L. Lastovetsky. 754-762 [doi]
- A Scalability Model for Distributed Resource Management in Real-Time Online ApplicationsDominik Meiländer, Sebastian Kottinger, Sergei Gorlatch. 763-772 [doi]
- A Dynamic Resource Management System for Network-Attached Accelerator ClustersSuraj Prabhakaran, Mohsin Iqbal, Sebastian Rinke, Felix Wolf. 773-782 [doi]
- Enhanced Resource Management Enabling Standard Parameter Sweep Jobs for Scientific ApplicationsSonja Holl, M. Shahbaz Memon, Bernd Schuller, Morris Riedel, Yassene Mohammed, Magnus Palmblad, Andrew S. Grimshaw. 783-790 [doi]
- Pipelining/Overlapping Data Transfer for Distributed Data-Intensive Job ExecutionEun Sung Jung, Ketan Maheshwari, Rajkumar Kettimuthu. 791-797 [doi]
- Scheduling Data Parallel Workloads - A Comparative Study of Two Common Algorithmic ApproachesMahadevan Balasubramaniam, Ioana Banicescu, Florina M. Ciorba. 798-807 [doi]
- A Model Based Load-Balancing Method in IaaS CloudZhenzhong Zhang, Limin Xiao, Yuan Tao, Ji Tian, Shouxin Wang, Hua Liu. 808-816 [doi]
- Extending Battery Life of a Multi-buffered, Single-Threaded Processor in a Mobile Computing DeviceRashid Khogali, Olivia Das. 817-825 [doi]
- Effects of Dynamic Voltage and Frequency Scaling on a K20 GPURong Ge, Ryan Vogt, Jahangir Majumder, Arif Alam, Martin Burtscher, Ziliang Zong. 826-833 [doi]
- Revisiting Server Energy ProportionalityChung-Hsing Hsu, Stephen W. Poole. 834-840 [doi]
- Relating Application Memory Activity to Processor PowerSaman Khoshbakht, Nikitas Dimopoulos. 849-857 [doi]
- Power-Aware Multi-data Center Management Using Machine LearningJosep Lluis Berral, Ricard Gavaldà, Jordi Torres. 858-867 [doi]
- Analytical Energy Models for MPI Communications on a Sandy-Bridge ArchitectureFrancisco Almeida, Vicente Blanco Pérez, Isidro Gonzalez, Alberto Cabrera, Domingo Giménez. 868-876 [doi]
- Efficient Offloading of Parallel Kernels Using MPI_Comm_SpawnSebastian Rinke, Suraj Prabhakaran, Felix Wolf. 877-884 [doi]
- The DEEP Project - Pursuing Cluster-Computing in the Many-Core EraNorbert Eicker, Thomas Lippert, Thomas Moschny, Estela Suarez. 885-892 [doi]
- Integration of a Highly Scalable, Multi-FPGA-Based Hardware Accelerator in Common Cluster InfrastructuresOliver Knodel, Andy Georgi, Patrick Lehmann, Wolfgang E. Nagel, Rainer G. Spallek. 893-900 [doi]
- GPU Powered ROSA AnalyzerRaúl Pardo, Fernando L. Pelayo, Pedro Valero-Lara. 901-908 [doi]
- Achieving Speedup in Aggregate Risk Analysis Using Multiple GPUsA. K. Bahl, Oliver Baltzer, Andrew Rau-Chaplin, Blesson Varghese, A. Whiteway. 909-916 [doi]
- iTraffic: A Smartphone-based Traffic Information SystemYi-Ta Chuang, Chih-Wei Yi, Yin-Chih Lu, Pei-Chuan Tsai. 917-922 [doi]
- An Indoor Collaborative Pedestrian Dead Reckoning SystemYi-Ting Li, Guaning Chen, Min-Te Sun. 923-930 [doi]
- Development of Emergency Rescue Evacuation Support System (ERESS) in Panic-Type Disasters: Disaster Detection by Positioning Area of TerminalsTakahumi Nakamura, Katsunori Kogo, Jun Fujimura, Kentaro Tsudaka, Tomotaka Wada, Kazuhiro Ohtsuki, Hiromi Okada. 931-936 [doi]
- Secure Homomorphic and Searchable Encryption in Ad Hoc NetworksScott C.-H. Huang, Qiao-Wei Lin, Chih-Kai Chang. 937-942 [doi]
- Dynamic Content Adjustment in Mobile Ad Hoc NetworksShih-Rong Yang, Guaning Chen, Min-Te Sun. 943-949 [doi]
- Automatic Extraction of Task-Level Parallelism for Heterogeneous MPSoCsDaniel Cordes, Olaf Neugebauer, Michael Engel, Peter Marwedel. 950-959 [doi]
- Toward a Performance/Resilience Tool for Hardware/Software Co-design of High-Performance Computing SystemsChristian Engelmann, Thomas Naughton. 960-969 [doi]
- Hierarchical Memory Buffering Techniques for an In-Memory Event Tracing Extension to the Open Trace Format 2Michael Wagner, Andreas Knüpfer, Wolfgang E. Nagel. 970-976 [doi]
- Is Source-Code Isolation Viable for Performance Characterization?Chadi Akel, Yuriy Kashnikov, Pablo de Oliveira Castro, William Jalby. 977-984 [doi]
- Event Streaming for Online Performance Measurements ReductionJean-Baptiste Besnard, Marc Pérache, William Jalby. 985-994 [doi]
- Intralayer Communication for Tree-Based Overlay NetworksTobias Hilbrich, Joachim Protze, Bronis R. de Supinski, Martin Schulz, Matthias S. Müller, Wolfgang E. Nagel. 995-1003 [doi]
- Discovery of Potential Parallelism in Sequential ProgramsZhen Li, Ali Jannesari, Felix Wolf. 1004-1013 [doi]
- StreamMine3G OneClick - Deploy and Monitor ESP Applications with a Single ClickAndrey Brito, Andre Martin, Christof Fetzer, Isabelly Rocha, Telles Nobrega. 1014-1019 [doi]
- A Server Model for Reliable Communication on Cell/B.ERui Zhou 0005, Huaming Chen, Qun Liu, Yong Sheng, Qingguo Zhou, Xuan Wang, Kuan-Ching Li. 1020-1027 [doi]
- A Power-Aware Study of Iris Matching Algorithms on Intel's SCCGildo Torres, Jed Kao-Tung Chang, Fang Hua, Chen Liu, Stephanie A. C. Schuckers. 1028-1037 [doi]
- Hardware-Specific Bare-Metal Microhypervisor PrototypeIvan Kolchin, Maxim Nikolaev, Stanislav Parfenov, Oleg Popkov, Sergey Sobolev. 1038-1043 [doi]
- Thermal-Aware Scheduling Collaborating with OS and ArchitectureCheng-Yu Lee, Shuang-Jhu Yang, Rong-Guey Chang. 1044-1051 [doi]
- Compilers for Low Power with Design Patterns on Embedded Multicore SystemsCheng-Yen Lin, Chi-Bang Kuan, Jenq Kuen Lee. 1052-1060 [doi]
- Mechanism of Automatic Deployment for Virtual Network EnvironmentMin-Xiou Chen, Kuo-Le Mei. 1061-1066 [doi]
- Secure PHR Access Control Scheme for Healthcare Application CloudsChia-Hui Liu, Fong-Qi Lin, Dai-Lun Chiang, Tzer-Long Chen, Chin-Sheng Chen, Han-Yu Lin, Yu-Fang Chung, Tzer-Shyong Chen. 1067-1076 [doi]
- Cycles Embedding of Twisted CubesPao-Lien Lai, Kao-Lin Hu, Hong-Chun Hsu. 1077-1081 [doi]
- A Secure Cloud-Based Payment Model for M-CommerceTao-Ku Chang. 1082-1086 [doi]