Skip to content

Optimize inference performance of ERNIE on P40 GPU #165

Description

@Xreki

负责人

@Xreki @zhaoyuchen2018

初始性能

  • 测试时间:2019年8月14日
  • 测试者:@Xreki
  • GPU平台信息:Tesla P40
  • 软件信息:
    • Driver Version,418.39
    • CUDA 9.0
    • cuDNN 7.5
  • Paddle commit:
commit 744279fe685dd0b8b426a686d84ad449da02366e
Author: Kevin <liujiezhangbupt@gmail.com>
Date:   Mon Aug 12 10:13:12 2019 +0800

    Refine embedding Api doc (#18820)
  • 测试代码:Add inference benchmark of ernie #164
  • 编译Paddle使用Docker镜像:paddlepaddle/paddle_manylinux_devel:cuda9.0_cudnn7
  • 编译测试程序,测试使用Docker镜像:paddlepaddle/paddle:latest-gpu-cuda9.0-cudnn7-dev
  • 测试结果:
    • GPU ratio,96%
    • Runtime,8.3554 ms/sample

NVIDIA BERT推理解决方案Faster Transformer开源了

Metadata

Metadata

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions