Abstract is missing.
- Balancing context switch penalty and response time with elastic time slicingNagakishore Jammula, Moinuddin Qureshi, Ada Gavrilovska, Jongman Kim. 1-10 [doi]
- Design and evaluation of parallel hashing over large-scale dataLong Cheng, Spyros Kotoulas, Tomas E. Ward, Georgios Theodoropoulos. 1-10 [doi]
- Optimization of scan algorithms on multi- and many-core processorsQiao Sun, Chao Yang. 1-10 [doi]
- GPU parallelization of the stochastic on-time arrival problemMaleen Abeydeera, Samitha Samaranayake. 1-8 [doi]
- Optimizing the performance of parallel applications on a 5D torus via task mappingAbhinav Bhatele, Nikhil Jain, Katherine E. Isaacs, Ronak Buch, Todd Gamblin, Steven H. Langer, Laxmikant V. Kalé. 1-10 [doi]
- Matrix-matrix multiplication on a large register file architecture with indirectionDheeraj Sreedhar, Jeff H. Derby, Robert K. Montoye, C. L. Johnson. 1-10 [doi]
- Cache-conscious scheduling of streaming pipelines on parallel machines with private cachesKunal Agrawal, Jordyn Maglalang, Jeremy T. Fineman. 1-12 [doi]
- A flexible scheduling framework for heterogeneous CPU-GPU clustersKittisak Sajjapongse, Tejaswi Agarwal, Michela Becchi. 1-11 [doi]
- A fast implementation of MLR-MCL algorithm on multi-core processorsQingpeng Niu, Pai-Wei Lai, S. M. Faisal, Srinivasan Parthasarathy, P. Sadayappan. 1-10 [doi]
- Smart multi-task scheduling for OpenCL programs on CPU/GPU heterogeneous platformsYuan Wen, Zheng Wang, Michael F. P. O'Boyle. 1-10 [doi]
- RADIR: Lock-free and wait-free bandwidth allocation models for solid state drivesPooja Aggarwal, Giridhar Yasa, Smruti R. Sarangi. 1-10 [doi]
- An improved recursive graph bipartitioning algorithm for well balanced domain decompositionAstrid Casadei, Pierre Ramet, Jean Roman. 1-10 [doi]
- Online failure prediction for HPC resources using decentralized clusteringAlejandro Pelaez, Andres Quiroz, James C. Browne, Edward Chuah, Manish Parashar. 1-9 [doi]
- Particle advection performance over varied architectures and workloadsHank Childs, Scott Biersdorff, David Poliakoff, David Camp, Allen D. Malony. 1-10 [doi]
- Coupling-aware graph partitioning algorithms: Preliminary studyMaria Predari, Aurélien Esnard. 1-10 [doi]
- A proactive approach for coping with uncertain resource availabilities on desktop gridsLouis-Claude Canon, Adel Essafi, Denis Trystram. 1-9 [doi]
- A multilevel compressed sparse row format for efficient sparse computations on multicore processorsHumayun Kabir, Joshua Dennis Booth, Padma Raghavan. 1-10 [doi]
- Optimizing shared data accesses in distributed-memory X10 systemsJeeva Paudel, Olivier Tardieu, José Nelson Amarai. 1-10 [doi]
- Simple parallel biconnectivity algorithms for multicore platformsGeorge M. Slota, Kamesh Madduri. 1-10 [doi]
- CQA: A code quality analyzer tool at binary levelAndres Charif Rubial, Emmanuel Oseret, Jose Noudohouenou, William Jalby, Ghislain Lartigue. 1-10 [doi]
- Premonition of storage response class using Skyline ranked Ensemble methodKumar Dheenadayalan, V. N. Muralidhara, Pushpa Datla, G. Srinivasaraghavan, Maulik Shah. 1-10 [doi]
- GpuTejas: A parallel simulator for GPU architecturesGeetika Malhotra, Seep Goel, Smruti R. Sarangi. 1-10 [doi]
- DRIVE: Using implicit caching hints to achieve disk I/O reduction in virtualized environmentsSujesha Sudevalayam, Purushottam Kulkarni. 1-10 [doi]
- Software based ultrasound B-mode/beamforming optimization on GPU and its performance predictionThi Yen Phuong, Jeong-Gun Lee. 1-10 [doi]
- Improving Multi-dimensional query processing with data migration in distributed cache infrastructureYoungmoon Eom, Jinwoong Kim, Deukyeon Hwang, Jaewon Kwak, Minho Shin, Beomseok Nam. 1-10 [doi]
- High performance MPI library over SR-IOV enabled infiniband clustersJie Zhang, Xiaoyi Lu, Jithin Jose, Mingzhe Li, Rong Shi, Dhabaleswar K. Panda. 1-10 [doi]
- Distance threshold similarity searches on spatiotemporal trajectories using GPGPUMichael G. Gowanlock, Henri Casanova. 1-10 [doi]
- Analysis and tuning of libtensor framework on multicore architecturesKhaled Z. Ibrahim, Samuel W. Williams, Evgeny Epifanovsky, Anna I. Krylov. 1-10 [doi]
- Interface for heterogeneous kernels: A framework to enable hybrid OS designs targeting high performance computing on manycore architecturesTaku Shimosawa, Balazs Gerofi, Masamichi Takagi, Gou Nakamura, Tomoki Shirasawa, Yuji Saeki, Masaaki Shimizu, Atsushi Hori, Yutaka Ishikawa. 1-10 [doi]
- Efficient and robust allocation algorithms in clouds under memory constraintsOlivier Beaumont, Juan Angel Lorenzo del Lorenzo, Lionel Eyraud-Dubois, Paul Renaud-Goud. 1-10 [doi]
- Parallel AMG solver for three dimensional unstructured grids using GPUK. Ravi Tej, Naveen Sivadasan, Vatsalya Sharma, Raja Banerjee. 1-10 [doi]
- Towards realizing the potential of malleable jobsAbhishek Gupta, Bilge Acun, Osman Sarood, Laxmikant V. Kalé. 1-10 [doi]
- Mixed-precision models for calculation of high-order virial coefficients on GPUsChao Feng, Andrew Schultz, Vipin Chaudhary, David Kofke. 1-10 [doi]
- TriKon: A hypervisor aware manycore processorRohan Bhalla, Prathmesh Kallurkar, Nitin Gupta, Smruti R. Sarangi. 1-10 [doi]
- Optical overlay NUCA: A high speed substrate for shared L2 cachesEldhose Peter, Anuj Arora, Akriti Bagaria, Smruti R. Sarangi. 1-10 [doi]
- On the suitability of MPI as a PGAS runtimeJeff Daily, Abhinav Vishnu, Bruce J. Palmer, Hubertus van Dam, Darren J. Kerbyson. 1-10 [doi]
- Queueing-based storage performance modeling and placement in OpenStack environmentsYang Song, Rakesh Jain, Ramani Routray. 1-10 [doi]
- Combining HoL-blocking avoidance and differentiated services in high-speed interconnectsPedro Yebenes, Jesús Escudero-Sahuquillo, Crispín Gómez Requena, Pedro Javier García, Francisco J. Alfaro, Francisco J. Quiles, José Duato. 1-10 [doi]
- A high performance broadcast design with hardware multicast and GPUDirect RDMA for streaming applications on Infiniband clustersAkshay Venkatesh, Hari Subramoni, Khaled Hamidouche, Dhabaleswar K. Panda. 1-10 [doi]
- Algorithms for power-aware resource activationSonika Arora, Archita Agarwal, Venkatesan T. Chakaravarthy, Yogish Sabharwal. 1-10 [doi]
- Scaling graph community detection on the Tilera many-core architectureDaniel G. Chavarría-Miranda, Mahantesh Halappanavar, Ananth Kalyanaraman. 1-11 [doi]
- Fine-grained GPU parallelization of pairwise local sequence alignmentChirag Jain, Subodh Kumar. 1-10 [doi]
- Performance evaluation of multi core systems for high throughput medical applications involving model predictive controlMadhurima Pore, Ayan Banerjee, Sandeep K. S. Gupta. 1-10 [doi]
- Xevolver: An XML-based code translation framework for supporting HPC application migrationHiroyuki Takizawa, Shoichi Hirasawa, Yasuharu Hayashi, Ryusuke Egawa, Hiroaki Kobayashi. 1-11 [doi]
- Relax-Miracle: GPU parallelization of semi-analytic fourier-domain solvers for earthquake modelingSagar Shrishailappa Masuti, Sylvain Barbot, Nachiket Kapre. 1-10 [doi]
- An early experience of regional ocean modelling on intel many integrated core architectureSrikanth Yalavarthi, Akshara Kaginalkar. 1-6 [doi]
- Designing efficient small message transfer mechanism for inter-node MPI communication on InfiniBand GPU clustersRong Shi, Sreeram Potluri, Khaled Hamidouche, Jonathan L. Perkins, Mingzhe Li, Davide Rossetti, Dhabaleswar K. Panda. 1-10 [doi]
- Saving energy by exploiting residual imbalances on iterative applicationsEdson L. Padoin, Márcio Bastos Castro, Laércio Lima Pilla, Philippe Olivier Alexandre Navaux, Jean-François Mehaut. 1-10 [doi]
- Reducing elimination tree height for parallel LU factorization of sparse unsymmetric matricesEnver Kayaaslan, Bora Uçar. 1-10 [doi]