GNNAdvisor: An Efficient Runtime System for GNN Acceleration on GPUs

ArXiv, 2020

  • Propose and implement an efficient runtime system for accelerating GNN on GPU;
  • The proposed system can effectively leverage the input-level information of GNNs for guiding system-level optimizations on GPUs;
  • Rigorous experiments and comparisons with existing GNN frameworks, such as DGL, demonstrate the effectiveness of our system.

Download here

Boosting Deep Neural Network Efficiency with Dual-Module Inference

ICML, 2020

  • Develop a light-weighted auxiliary “little” module with random projection and weight quantization for probing Neural Network (NN) layerwise output sparsity to facilitate NN inference acceleration;
  • The proposed scheme can be easily applied to various types of neural networks, such as CNN, and LSTM. (e.g., on ResNet-18, and it outperforms the state-of-the-art solutions with much higher FLOPs reduction, memory saving and model accuracy);
  • The proposed scheme can also be applied to various tasks, such as object detection (SSD: Single Shot MultiBox Detector).

Download here