Abstract is missing.
- Advances in Incremental PCA AlgorithmsTal Halpern, Sivan Toledo. 3-13 [doi]
- Algorithms for Forward and Backward Solution of the Fokker-Planck Equation in the Heliospheric Transport of Cosmic RaysAnna Wawrzynczak, Renata Modzelewska, Agnieszka Gil. 14-23 [doi]
- Efficient Evaluation of Matrix PolynomialsNiv Hoffman, Oded Schwartz, Sivan Toledo. 24-35 [doi]
- A Comparison of Soft-Fault Error Models in the Parallel Preconditioned Flexible GMRESEvan Coleman, Aygul Jamal, Marc Baboulin, Amal Khabou, Masha Sosonkina. 36-46 [doi]
- Multilayer Approach for Joint Direct and Transposed Sparse Matrix Vector Multiplication for Multithreaded CPUsIvan Simecek, Daniel Langr, Ivan Kotenkov. 47-56 [doi]
- Comparison of Parallel Time-Periodic Navier-Stokes SolversPeter Arbenz, Daniel Hupp, Dominik Obrist. 57-67 [doi]
- Blocked Algorithms for Robust Solution of Triangular Linear SystemsCarl Christian Kjelgaard Mikkelsen, Lars Karlsson. 68-78 [doi]
- A Comparison of Accuracy and Efficiency of Parallel Solvers for Fractional Power Diffusion ProblemsRaimondas Ciegis, Vadimas Starikovicius, Svetozar Margenov, Rima Kriauziene. 79-89 [doi]
- Efficient Cross Section Reconstruction on Modern Multi and Many Core ArchitecturesYunsong Wang, François-Xavier Hugot, Emeric Brun, Fausto Malvagi, Christophe Calvin. 90-100 [doi]
- Parallel Assembly of ACA BEM Matrices on Xeon Phi ClustersMichal Kravcenko, Lukás Malý, Michal Merta, Jan Zapletal. 101-110 [doi]
- Stochastic Bounds for Markov Chains on Intel Xeon Phi CoprocessorJaroslaw Bylina. 111-120 [doi]
- Fast DEM Collision Checks on Multicore NodesKonstantinos Krestenitis, Tobias Weinzierl, Tomasz Koziara. 123-132 [doi]
- A Space and Bandwidth Efficient Multicore Algorithm for the Particle-in-Cell MethodYann Barsamian, Arthur Charguéraud, Alain Ketterlin. 133-144 [doi]
- Load Balancing for Particle-in-Cell Plasma Simulation on Multicore SystemsAnton Larin, Sergey Bastrakov, Alexei Bashinov, Evgeny Efimenko, Igor Surmin, Arkady Gonoskov, Iosif Meyerov. 145-155 [doi]
- The Impact of Particle Sorting on Particle-In-Cell Simulation PerformanceAndrzej Dorobisz, Michal Kotwica, Jacek Niemiec, Oleh Kobzar, Artem Bohdan, Kazimierz Wiatr. 156-165 [doi]
- TaskUniVerse: A Task-Based Unified Interface for Versatile Parallel ExecutionAfshin Zafari. 169-184 [doi]
- Comparison of Time and Energy Oriented Scheduling for Task-Based ProgramsThomas Rauber, Gudula Rünger. 185-196 [doi]
- Experiments with Sparse Cholesky Using a Parametrized Task Graph ImplementationIain S. Duff, Florent Lopez. 197-206 [doi]
- A Task-Based Algorithm for Reordering the Eigenvalues of a Matrix in Real Schur FormMirko Myllykoski. 207-216 [doi]
- Radix Tree for Binary Sequences on GPUKrzysztof Kaczmarski, Albert Wolant. 219-231 [doi]
- A Comparison of Performance Tuning Process for Different Generations of NVIDIA GPUs and an Example Scientific Computing AlgorithmKrzysztof Banas, Filip Kruzel, Jan Bielanski, Kazimierz Chlon. 232-242 [doi]
- NVIDIA GPUs Scalability to Solve Multiple (Batch) Tridiagonal Systems Implementation of cuThomasBatchPedro Valero-Lara, Ivan Martínez-Perez, Raúl Sirvent, Xavier Martorell, Antonio J. Peña. 243-253 [doi]
- Two-Echelon System Stochastic Optimization with R and CUDAWitold Andrzejewski, Maciej Drozdowski, Gang Mu, Yong-Chao Sun. 254-264 [doi]
- Parallel Hierarchical Agglomerative Clustering for fMRI DataMélodie Angeletti, Jean-Marie Bonny, Franck Durif, Jonas Koko. 265-275 [doi]
- Two Parallelization Schemes for the Induction of Nondeterministic Finite Automata on PCsTomasz Jastrzab. 279-289 [doi]
- Approximating Personalized Katz Centrality in Dynamic GraphsEisha Nathan, David A. Bader. 290-302 [doi]
- Graph-Based Speculative Query Execution for RDBMSAnna Sasak-Okon, Marek Tudruj. 303-313 [doi]
- A GPU Implementation of Bulk Execution of the Dynamic Programming for the Optimal Polygon TriangulationKohei Yamashita, Yasuaki Ito, Koji Nakano. 314-323 [doi]
- Early Performance Evaluation of the Hybrid Cluster with Torus Interconnect Aimed at Molecular-Dynamics SimulationsVladimir V. Stegailov, Alexander Agarkov, Sergey Biryukov, Timur Z. Ismagilov, Mikhail Khalilov, Nikolay Kondratyuk, Evgeny Kushtanov, Dmitry Makagon, Anatoly Mukosey, Alexander Semenov, Alexey Simonov, Alexey Timofeev, Vyacheslav S. Vecher. 327-336 [doi]
- Load Balancing for CPU-GPU Coupling in Computational Fluid DynamicsImmo Huismann, Matthias Lieber, Jörg Stiller, Jochen Fröhlich. 337-347 [doi]
- Implementation and Performance Analysis of 2.5D-PDGEMM on the K ComputerDaichi Mukunoki, Toshiyuki Imamura. 348-358 [doi]
- An Approach for Detecting Abnormal Parallel Applications Based on Time Series Analysis MethodsDenis Shaykhislamov, Vadim Voevodin. 359-369 [doi]
- Prediction of the Inter-Node Communication Costs of a New Gyrokinetic Code with Toroidal DomainAndreas Jocksch, Noé Ohana, Emmanuel Lanti, Aaron Scheinberg, Stephan Brunner, Claudio Gheller, Laurent Villard. 370-380 [doi]
- D-Spline Performance Tuning Method Flexibly Responsive to Execution Time PerturbationGuning Fan, Masayoshi Mochizuki, Akihiro Fujii, Teruo Tanaka, Takahiro Katagiri. 381-391 [doi]
- Dfuntest: A Testing Framework for Distributed ApplicationsGrzegorz Milka, Krzysztof Rzadca. 395-405 [doi]
- Security Monitoring and Analytics in the Context of HPC Processing ModelMikolaj Dobski, Gerard Frankowski, Norbert Meyer, Maciej Milostan, Michal Pilc. 406-416 [doi]
- Multidimensional Performance and Scalability Analysis for Diverse Applications Based on System Monitoring DataMaya Neytcheva, Sverker Holmgren, Jonathan Bull, Ali Dorostkar, Anastasia Kruchinina, Dmitry A. Nikitenko, Nina Popova, Pavel Shvets, Alexey Teplov, Vadim Voevodin, Vladimir V. Voevodin. 417-431 [doi]
- Bridging the Gap Between HPC and Cloud Using HyperFlow and PaaSageDennis Hoppe, Yosandra Sandoval, Anthony Sulistio, Maciej Malawski, Bartosz Balis, Maciej Pawlik, Kamil Figiela, Dariusz Król 0002, Michal Orzechowski, Jacek Kitowski, Marian Bubak. 432-442 [doi]
- A Memory Efficient Parallel All-Pairs Computation Framework: Computation - Communication OverlapVenkata Kasi Viswanath Yeleswarapu, Arun K. Somani. 443-458 [doi]
- Automatic Parallelization of ANSI C to CUDA C ProgramsJan Kwiatkowski, Dzanan Bajgoric. 459-470 [doi]
- Consistency Models for Global Scalable Data Access ServicesMichal Wrzeszcz, Darin Nikolow, Tomasz Lichon, Rafal Slota, Lukasz Dutka, Renata Slota, Jacek Kitowski. 471-480 [doi]
- Global State Monitoring in Optimization of Parallel Event-Driven SimulationLukasz Masko, Marek Tudruj. 483-494 [doi]
- High Performance Optimization of Independent Component Analysis Algorithm for EEG DataAnna Gajos-Balinska, Grzegorz M. Wójcik, Przemyslaw Stpiczynski. 495-504 [doi]
- Continuous and Discrete Models of Melanoma Progression Simulated in Multi-GPU EnvironmentWitold Dzwinel, Adrian Klusek, Rafal Wcislo, Marta Panuszewska, Pawel Topa. 505-518 [doi]
- Early Experience on Using Knights Landing Processors for Lattice Boltzmann ApplicationsEnrico Calore, Alessandro Gabbana, Sebastiano Fabio Schifano, Raffaele Tripiccione. 519-530 [doi]
- Towards a Model of Semi-supervised Learning for the Syntactic Pattern Recognition-Based Electrical Load Prediction SystemJanusz Jurek. 533-543 [doi]
- Parallel Processing of Color Digital Images for Linguistic Description of Their ContentKrzysztof Wiaderek, Danuta Rutkowska, Elisabeth Rakus-Andersson. 544-554 [doi]
- Co-evolution of Fitness Predictors and Deep Neural NetworksWlodzimierz Funika, Pawel Koperek. 555-564 [doi]
- Performance Evaluation of DBN Learning on Intel Multi- and Manycore ArchitecturesTomasz Olas, Wojciech K. Mleczko, Marcin Wozniak, Robert K. Nowicki, Pawel Gepner. 565-575 [doi]
- On the Tunability of a New Hessenberg Reduction Algorithm Using Parallel Cache AssignmentMahmoud Eljammaly, Lars Karlsson, Bo Kågström. 579-589 [doi]
- New Preconditioning for the One-Sided Block-Jacobi SVD AlgorithmMartin Becka, Gabriel Oksa, Eva Vidlicková. 590-599 [doi]
- Structure-Preserving Technique in the Block SS-Hankel Method for Solving Hermitian Generalized Eigenvalue ProblemsAkira Imakura, Yasunori Futamura, Tetsuya Sakurai. 600-611 [doi]
- On Using the Cholesky QR Method in the Full-Blocked One-Sided Jacobi AlgorithmShuhei Kudo, Yusaku Yamamoto. 612-622 [doi]
- Parallel Divide-and-Conquer Algorithm for Solving Tridiagonal Eigenvalue Problems on Manycore SystemsYusuke Hirota, Toshiyuki Imamura. 623-633 [doi]
- Partial Inverses of Complex Block Tridiagonal MatricesLouise Spellacy, Darach Golden. 634-645 [doi]
- Parallel Nonnegative Matrix Factorization Based on Newton Iteration with Improved Convergence BehaviorRade Kutil, Markus Flatz, Marián Vajtersic. 646-655 [doi]