Joint learning of geometry and motion with three-dimensional holistic understanding

Invention Grant

US10970856B2 Joint learning of geometry and motion with three-dimensional holistic understanding 有权

Please log in to see more content

Patent Title: Joint learning of geometry and motion with three-dimensional holistic understanding
Application No.: US16233622

Application Date: 2018-12-27
Publication No.: US10970856B2

Publication Date: 2021-04-06
Inventor: Peng Wang , Chenxu Luo , Yang Wang
Applicant: Baidu USA, LLC
Applicant Address: US CA Sunnyvale
Assignee: Baidu USA, LLC
Current Assignee: Baidu USA, LLC
Current Assignee Address: US CA Sunnyvale
Agency: North Weber & Baugh LLP
Main IPC: G06T7/254
IPC: G06T7/254 ; G06T7/579 ; G06K9/74 ; G06N3/08 ; G06N20/00 ; G06T7/62

Joint learning of geometry and motion with three-dimensional holistic understanding

Abstract:

Described herein are systems and methods for jointly learning geometry and motion with three-dimensional holistic understanding. In embodiments, such approaches enforce the inherent geometrical consistency during the learning process, yielding improved results for both tasks. In embodiments, three parallel networks are adopted to predict the camera motion (e.g., MotionNet), dense depth map (e.g., DepthNet), and per-pixel optical flow between consecutive frames (e.g., FlowNet), respectively. The information of 2D flow, camera pose, and depth maps, are fed into a holistic 3D motion parser (HMP) to disentangle and recover per-pixel 3D motion of both rigid background and moving objects. Various loss terms are formulated to jointly supervise the three networks. Embodiments of an efficient iterative training strategy are disclosed for better performance and more efficient convergence. Performance on depth estimation, optical flow estimation, odometry, moving object segmentation, and scene flow estimation demonstrates the effectiveness of the disclosed systems and methods.

Public/Granted literature

US20200211206A1 JOINT LEARNING OF GEOMETRY AND MOTION WITH THREE-DIMENSIONAL HOLISTIC UNDERSTANDING Public/Granted day:2020-07-02

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06T	一般的图像数据处理或产生
G06T7/00	图像分析
G06T7/20	.运动分析（数字视频信号的编解码或解压缩的运动估计入H04N19/43,H04N19/51）
G06T7/254	..涉及图片相减