Method, electronic device and computer readable medium for information processing for accelerating neural network training

Invention Grant

US11640528B2 Method, electronic device and computer readable medium for information processing for accelerating neural network training 有权

Please log in to see more content

Patent Title: Method, electronic device and computer readable medium for information processing for accelerating neural network training
Application No.: US16660259

Application Date: 2019-10-22
Publication No.: US11640528B2

Publication Date: 2023-05-02
Inventor: Zhiyu Cheng , Baopu Li , Yingze Bao
Applicant: Baidu USA LLC
Applicant Address: US CA Sunnyvale
Assignee: Baidu USA LLC
Current Assignee: Baidu USA LLC
Current Assignee Address: US CA Sunnyvale
Agency: Nixon Peabody LLP
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N20/00 ; G06N3/048

Method, electronic device and computer readable medium for information processing for accelerating neural network training

Abstract:

A method for information processing for accelerating neural network training. The method includes: acquiring a neural network corresponding to a deep learning task; and performing iterations of iterative training on the neural network based on a training data set. The training data set includes task data corresponding to the deep learning task. The iterative training includes: processing the task data in the training data set using a current neural network, and determining, based on a processing result of the neural network on the task data in a current iterative training, prediction loss of the current iterative training; determining a learning rate and a momentum in the current iterative training; and updating weight parameters of the current neural network by gradient descent based on a preset weight decay, and the learning rate, the momentum, and the prediction loss in the current iterative training. This method achieves efficient and low-cost deep learning-based neural network training.

Public/Granted literature

US20210117776A1 METHOD, ELECTRONIC DEVICE AND COMPUTER READABLE MEDIUM FOR INFORMATION PROCESSING FOR ACCELERATING NEURAL NETWORK TRAINING Public/Granted day:2021-04-22

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法