计算机科学与探索 ›› 2021, Vol. 15 ›› Issue (11): 2093-2104.DOI: 10.3778/j.issn.1673-9418.2104012

• 综述·探索 • 上一篇    下一篇

FPGA加速深度学习综述

刘腾达,朱君文,张一闻   

  1. 1. 武警工程大学 研究生大队,西安 710086
    2. 武警工程大学 信息工程学院,西安 710086
  • 出版日期:2021-11-01 发布日期:2021-11-09

Review on FPGA-Based Accelerators in Deep Learning

LIU Tengda, ZHU Junwen, ZHANG Yiwen   

  1. 1. Postgraduate Group, Engineering University of PAP, Xi'an 710086, China
    2. School of Information Engineering, Engineering University of PAP, Xi'an 710086, China
  • Online:2021-11-01 Published:2021-11-09

摘要:

近年来,由于互联网的高速发展和大数据时代的来临,人工智能随之大热,而推动人工智能迅猛发展的正是深度学习的崛起。大数据时代需要迫切解决的问题是如何将极为复杂繁多的数据进行有效的分析使用,进而充分挖掘利用数据的价值并造福人类。深度学习作为一种实现机器学习的技术,正是解决这一问题的重要法宝,它在处理数据过程中发挥着重要作用并且改变了传统的机器学习方法,已被广泛应用于语音识别、图像识别和自然语言处理等研究领域。如何有效加速深度学习的计算能力一直是科研研究的重点。FPGA凭借其强大的并行计算能力和低功耗等优势成为GPU在加速深度学习领域的有力竞争者。从深度学习的几种典型模型出发,在FPGA加速技术现有特点的基础上从针对神经网络模型的加速器、针对具体问题的加速器、针对优化策略的加速器和针对硬件模板的加速器四方面概括总结了FPGA加速深度学习的研究现状,然后对比了不同加速技术和模型的性能,最后对未来可能发展的方向进行了展望。

关键词: 深度学习, 神经网络, 现场可编程逻辑门阵列(FPGA), 硬件加速

Abstract:

For the past few years, with rapid development of Internet and big data,  artificial intelligence has become popular, and it is the rise of deep learning that promotes the rapid development of AI. The problem that needs to be solved urgently in the era of big data is how to effectively analyze and use extremely complex and diverse data, and then make full use of the value of data and benefit mankind. As a technology of machine learning, deep learning which has been widely used in speech recognition, image recognition, natural language processing and many other fields is an important magic weapon to solve this problem. It plays an increasingly important role in data processing and changes traditional machine learning methods. How to effectively accelerate the computing power of deep learning has always been the focus of scientific research. With strong parallel computing power and low power consumption, FPGA has become a strong competitor of GPU in the field of deep learning acceleration. Starting from the typical models of deep learning, on the basis of the existing characteristics of FPGA acceleration technology, the research status of various accelerators is summarized from four aspects: accelerators for neural network models, accelerators for a specific application, accelerators for optimization strategies, and general accelerator frameworks with hardware templates. Then, the performance of different acceleration technologies in different models is compared. Finally, the possible development direction in the future is prospected.

Key words: deep learning, neural networks, field programmable gate array (FPGA), hardware accelerator