


default search action
25th HiPC 2018: Bengaluru, India
- 25th IEEE International Conference on High Performance Computing, HiPC 2018, Bengaluru, India, December 17-20, 2018. IEEE 2018, ISBN 978-1-5386-8386-6

Keynote 1
- Balaraman Ravindran:

Looking Under the Hood of Deep Neural Networks. 1
Technical Session 1: Learning
- Rajarshi Biswas

, Xiaoyi Lu, Dhabaleswar K. Panda:
Accelerating TensorFlow with Adaptive RDMA-Based gRPC. 2-11 - Saurav Basu, Vaibhav Saxena, Rintu Panja, Ashish Verma:

Balancing Stragglers Against Staleness in Distributed Deep Learning. 12-21 - Grey Ballard

, Koby Hayashi
, Ramakrishnan Kannan:
Parallel Nonnegative CP Decomposition of Dense Tensors. 22-31 - Israt Nisa, Aravind Sukumaran-Rajam

, Süreyya Emre Kurt, Changwan Hong, P. Sadayappan
:
Sampled Dense Matrix Multiplication for High-Performance Machine Learning. 32-41 - Prasanna Balaprakash

, Michael Salim, Thomas D. Uram, Venkat Vishwanath, Stefan M. Wild
:
DeepHyper: Asynchronous Hyperparameter Search for Deep Neural Networks. 42-51
Technical Session 2: Graph Algorithms
- Jesun Sahariar Firoz, Marcin Zalewski, Thejaka Amila Kanewala, Andrew Lumsdaine

:
Synchronization-Avoiding Graph Algorithms. 52-61 - Apurba Das, Seyed-Vahid Sanei-Mehri, Srikanta Tirthapura:

Shared-Memory Parallel Maximal Clique Enumeration. 62-71 - Kishore Kothapalli, Mihir Wadwekar:

Expediting Parallel Graph Connectivity Algorithms. 72-81 - Jesun Sahariar Firoz, Marcin Zalewski, Joshua Suetterlein

, Andrew Lumsdaine
:
Adaptive Runtime Features for Distributed Graph Algorithms. 82-91 - Hiroki Kanezashi, Toyotaro Suzumura, Dario Garcia-Gasulla

, Min-hwan Oh, Satoshi Matsuoka:
Adaptive Pattern Matching with Reinforcement Learning for Dynamic Graphs. 92-101 - Priyanka Singla, Shubhankar Suman Singh, K. Gopinath, Smruti Sarangi:

Probabilistic Sequential Consistency in Social Networks. 102-111
Technical Session 3: GPUs
- Kramer Straube, Jason Lowe-Power

, Christopher Nitta
, Matthew K. Farrens, Venkatesh Akella:
Improving Provisioned Power Efficiency in HPC Systems with GPU-CAPP. 112-122 - Hancheng Wu, John Ravi, Michela Becchi

:
Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA. 123-132 - Karthikeyan Natarajan

, Nitin Chandrachoodan
:
Lossless Parallel Implementation of a Turbo Decoder on GPU. 133-142 - Ammar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni, Xiaoyi Lu, Dhabaleswar K. Panda:

OC-DNN: Exploiting Advanced Unified Memory Capabilities in CUDA 9 and Volta GPUs for Out-of-Core DNN Training. 143-152 - Harichand M. V, Bharatkumar Sharma, G. Sudhakaran, V. Ashok:

Acceleration of an Adaptive Cartesian Mesh CFD Solver in the Current Generation Processor Architectures. 153-161 - Sofia Vallecorsa, Diana Moise, Federico Carminati, Gul Rukh Khattak

:
Data-Parallel Training of Generative Adversarial Networks on HPC Systems for HEP Simulations. 162-171
Keynote 2
- Marc Snir:

The Future of Supercomputing. 172
Technical Session 4: Linear Algebra and Fault Tolerance
- Himeshi De Silva, John L. Gustafson, Weng-Fai Wong

:
Making Strassen Matrix Multiplication Safe. 173-182 - Omer Subasi, Ramakrishna Tipireddy, Sriram Krishnamoorthy:

Quantification, Trade-off Analysis, and Optimal Checkpoint Placement for Reliability and Availability. 183-192 - Muhammed Emin Ozturk, Marissa Renardy, Yukun Li, Gagan Agrawal, Ching-Shan Chou:

A Novel Approach for Handling Soft Error in Conjugate Gradients. 193-202 - Burcu Ozcelik Mutlu

, Gokcen Kestor, Joseph B. Manzano
, Osman S. Unsal, Samrat Chatterjee, Sriram Krishnamoorthy:
Characterization of the Impact of Soft Errors on Iterative Methods. 203-214
Technical Session 5: Algorithms and Data Analysis
- Vasilios I. Kelefouras

, Karim Djemame
:
Workflow Simulation Aware and Multi-threading Effective Task Scheduling for Heterogeneous Computing. 215-224 - Xiaobo Zhu, Guangjun Wu, Hong Zhang, Shupeng Wang, Bingnan Ma:

Dynamic Count-Min Sketch for Analytical Queries Over Continuous Data Streams. 225-234 - Hao Lu, Sudip K. Seal, Jonathan D. Poplawsky

:
Scalable Proximity-Based Methods for Large-Scale Analysis of Atom Probe Data. 235-244 - Sriram Srinivasan, Sara Riazi, Boyana Norris, Sajal K. Das, Sanjukta Bhowmick:

A Shared-Memory Parallel Algorithm for Updating Single-Source Shortest Paths in Large Dynamic Networks. 245-254 - Hariharan Devarajan

, Anthony Kougkas, Prajwal Challa, Xian-He Sun:
Vidya: Performing Code-Block I/O Characterization for Data Access Optimization. 255-264 - Chao Li

, Balaji Palanisamy:
Decentralized Privacy-Preserving Timed Execution in Blockchain-Based Smart Contract Platforms. 265-274
Keynote 3
- Srini Devadas:

Secure High-Performance Computer Architectures: Challenges and Opportunities. 275
Technical Session 6: Applications and System Tools
- Venkatesh-Prasad Ranganath, Daniel Andresen:

Why do Users Kill HPC Jobs? 276-283 - Damon Fenacci, Hans Vandierendonck, Dimitrios S. Nikolopoulos

:
Code and Data Transformations to Address Garbage Collector Performance in Big Data Processing. 284-293 - Shaleen Garg, Kishore Kothapalli, Suresh Purini:

Share-a-GPU: Providing Simple and Effective Time-Sharing on GPUs. 294-303 - Gangyi Zhu, Gagan Agrawal:

A Performance Prediction Framework for Irregular Applications. 304-313 - Jia Guo, Gagan Agrawal:

Achieving Performance and Programmability for MapReduce(-Like) Frameworks. 314-323 - Vasudevan Rengasamy, Mahmut T. Kandemir, Paul Medvedev, Kamesh Madduri:

Parallel Read Partitioning for Concurrent Assembly of Metagenomic Data. 324-333

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














