Data compaction and memory bandwidth reduction for sparse neural networks

Invention Grant

US10096134B2 Data compaction and memory bandwidth reduction for sparse neural networks 有权

Please log in to see more content

Patent Title: Data compaction and memory bandwidth reduction for sparse neural networks
Application No.: US15422359

Application Date: 2017-02-01
Publication No.: US10096134B2

Publication Date: 2018-10-09
Inventor: Zhou Yan , Franciscus Wilhelmus Sijstermans , Yuanzhi Hua , Xiaojun Wang , Jeffrey Michael Pool , William J. Dally , Liang Chen
Applicant: NVIDIA Corporation
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corporation
Current Assignee: NVIDIA Corporation
Current Assignee Address: US CA Santa Clara
Agency: Zilka-Kotab, P.C.
Main IPC: H03M7/00
IPC: H03M7/00 ; G06T9/00 ; G06T1/20 ; G06T1/60

Data compaction and memory bandwidth reduction for sparse neural networks

Abstract:

A method, computer program product, and system for sparse convolutional neural networks that improves efficiency is described. Multi-bit data for input to a processing element is received at a compaction engine. The multi-bit data is determined to equal zero and a single bit signal is transmitted from the memory interface to the processing element in lieu of the multi-bit data, where the single bit signal indicates that the multi-bit data equals zero. A compacted data sequence for input to a processing element is received by a memory interface. The compacted data sequence is transmitted from the memory interface to an expansion engine. Non-zero values are extracted from the compacted data sequence and zeros are inserted between the non-zero values by the expansion engine to generate an expanded data sequence that is output to the processing element.

Public/Granted literature

US20180218518A1 DATA COMPACTION AND MEMORY BANDWIDTH REDUCTION FOR SPARSE NEURAL NETWORKS Public/Granted day:2018-08-02

Information query

Espacenet

IPC分类:

H	电学
H03	基本电子电路
H03M	一般编码、译码或代码转换（用射流方法入F15C4/00；光学模/数转换器入G02F7/00；专用于特殊应用的编码、译码或代码转换见有关小类，例如G01D，G01R，G06F，G06T，G09G，G10L，G11B，G11C，H04B，H04L，H04M，H04N；专用于密码技术或涉及需要保密的其他目的的编码或译码入G09C）
H03M7/00	把用给定序列的数字或给定数目的数字来表示信息的码，转换到用不同序列的数字或不同数目的数字来表示相同信息的码