-
公开(公告)号:US10832374B2
公开(公告)日:2020-11-10
申请号:US14997448
申请日:2016-01-15
Inventor: Henning Zimmer , Olga Sorkine Hornung , Oliver Wang , Alexander Sorkine Hornung , Wenzel Jakob , Fabrice Pierre Armand Rousselle , Wojciech Krzysztof Jarosz , David M. Adler
Abstract: Particular embodiments perform a light path analysis of an image comprising a scene, wherein the scene comprises at least one refractive or reflective object. The image may be decomposed based on the light path analysis into a plurality of components, each of the components representing a contribution to lighting in the scene by a different type of light interaction. For each of the components, one or more motion vectors are extracted for each of the components in order to capture motion in the scene. Finally, a final contribution of each of the components to the image is computed based on the motion vectors.
-
公开(公告)号:US10270945B2
公开(公告)日:2019-04-23
申请号:US14308952
申请日:2014-06-19
Applicant: Disney Enterprises, Inc.
Inventor: Oliver Wang , Christopher Schroers , Henning Zimmer , Alexander Sorkine Hornung , Markus Gross
Abstract: There are provided systems and methods for an interactive synchronization of multiple videos. An example system includes a memory storing a first video and a second video, the first video including first video clips and the second video including second video clips. The system further includes a processor configured to calculate a histogram based on a number of features that are similar between the first video clips and the second video clips, generate a cost matrix based on the histogram, generate a first graph that includes first nodes based on the cost matrix, compute a path through the graph using the nodes, and align the first video with the second video using the path, where the path corresponds to playback speeds for the first video and the second video.
-
3.
公开(公告)号:US09900505B2
公开(公告)日:2018-02-20
申请号:US14339253
申请日:2014-07-23
Inventor: Federico Perazzi , Alexander Sorkine Hornung , Henning Zimmer , Oliver Wang , Peter Kaufmann , Scott Watson
CPC classification number: H04N5/23238 , G06T3/4038 , H04N5/2258 , H04N5/247 , H04N5/2628 , H04N5/265
Abstract: Systems and methods for generating a panoramic video from unstructured camera arrays. The systems and methods are configured to statically align corresponding image-frames of respective input video streams, warp the aligned image-frames according to a warping-order, and relax the warped image-frames thereby generating a temporally coherent panoramic video. Methods according to embodiments this invention utilize a new parallax-warping-error metric that is devised to capture structural differences created by parallax artifacts. The parallax-warping-error metric is effective in finding an optimal warping-order and in driving the warping process, resulting in a panoramic video with minimal parallax artifacts.
-
公开(公告)号:US20170345151A1
公开(公告)日:2017-11-30
申请号:US15165056
申请日:2016-05-26
Applicant: DISNEY ENTERPRISES, INC. , ETH Zürich
Inventor: Alexander Sorkine Hornung , Federico Perazzi , Oliver Wang , Nicolas Märki
CPC classification number: G06T7/10 , G06T2207/10016 , G06T2207/20092 , G06T2207/20112 , G11B27/28 , G11B27/3081 , G11B27/34
Abstract: This disclosure relates to system and methods for segmenting a video in a higher order dimensional space. A video may be segmented by obtaining visual information defining an image of the video. The visual information may include pixels of the image and may be represented in a display space having a first dimensionality. A designation of a subset of the visual information represented in the display space as a part of an object portrayed in the image may be obtained. The visual information and the designation may be represented in the higher order dimensional space having a second dimensionality greater than the first dimensionality. An association of the visual information represented in the higher order dimensional space with the object may be obtained. The association may be correlated with the visual information represented in the display space. The correlation may define a location of the object in the image.
-
公开(公告)号:US20170236290A1
公开(公告)日:2017-08-17
申请号:US15045102
申请日:2016-02-16
Applicant: Disney Enterprises, Inc.
Inventor: Alexander Sorkine Hornung , Federico Perazzi , Oliver Wang
CPC classification number: G06T7/11 , G06K9/00765 , G06K9/469 , G06K9/6273 , G06K9/6297 , G06T7/143 , G06T7/162 , G06T7/194 , G06T7/215 , G06T7/269 , G06T7/90 , G06T2207/10016 , G06T2207/10024 , G06T2207/20076 , G06T2207/20081 , G06T2207/20152 , G06T2207/30241
Abstract: Techniques and systems are described for performing video segmentation using fully connected object proposals. For example, a number of object proposals for a video sequence are generated. A pruning step can be performed to retain high quality proposals that have sufficient discriminative power. A classifier can be used to provide a rough classification and subsampling of the data to reduce the size of the proposal space, while preserving a large pool of candidate proposals. A final labeling of the candidate proposals can then be determined, such as a foreground or background designation for each object proposal, by solving for a posteriori probability of a fully connected conditional random field, over which an energy function can be defined and minimized.
-
公开(公告)号:US09571786B1
公开(公告)日:2017-02-14
申请号:US14884046
申请日:2015-10-15
Applicant: Disney Enterprises, Inc. , ETH Zürich
Inventor: Henning Zimmer , Alexander Sorkine Hornung , Simone Meyer , Max Grosse , Oliver Wang
CPC classification number: H04N7/0135 , G06T3/4084 , H04N5/783 , H04N7/0132
Abstract: Interpolating frames of a video may provide a technique for one or more of frame rate conversion, temporal upsampling for generating slow motion video, image morphing, virtual view synthesis, and/or other video applications. A system may be configured to interpolated frames of a video by leveraging frequency domain representations of individual frames. The frequency domain representations may be decomposed into set of discrete functions that make up the frequency domain representations. Corresponding functions from sets of functions associated with frames with which an interpolated frame is to be determined may be identified. Phase differences between corresponding functions may be determined. Interpolated functions between the corresponding functions may be determined based on the determined phased differences. Information describing spatial domain representations of interpolated frames may be determined based on the interpolated functions.
Abstract translation: 视频的插值帧可以提供用于帧速率转换,用于产生慢动作视频的时间上采样,图像变形,虚拟视图合成和/或其他视频应用中的一个或多个的技术。 系统可以被配置为通过利用各个帧的频域表示来内插视频的帧。 频域表示可以被分解为组成频域表示的离散函数集合。 可以识别与要被确定内插帧的帧相关联的函数集的相应函数。 可以确定相应功能之间的相位差。 可以基于确定的相位差来确定相应功能之间的内插函数。 可以基于内插函数来确定描述插值帧的空间域表示的信息。
-
公开(公告)号:US20160373717A1
公开(公告)日:2016-12-22
申请号:US14743618
申请日:2015-06-18
Applicant: Disney Enterprises, Inc.
Inventor: Oliver Wang , Marcus Magnor , Felix Klose , Jean-Charles Bazin , Alexander Sorkine Hornung
CPC classification number: G06T7/90 , G06T2207/10016 , G06T2207/10028 , H04N13/111 , H04N13/15
Abstract: There is provided a video processing system for use with a video having frames including a first frame and neighboring frames of the first frame. The system includes a memory storing a video processing application, and a processor. The processor is configured to execute the video processing application to sample scene points corresponding to an output pixel of the first frame of the frames of the video, the scene points including alternate observations of a same scene point from the neighboring frames of the first frame of the video, and filter the scene points corresponding to the output pixel to determine a color of the output pixel by calculating a weighted combination of the scene points corresponding to the output pixel.
Abstract translation: 提供了一种视频处理系统,用于具有包括第一帧和第一帧的相邻帧的帧的视频。 该系统包括存储视频处理应用的存储器和处理器。 处理器被配置为执行视频处理应用以对与视频的帧的第一帧的输出像素相对应的场景点,场景点包括来自第一帧的相邻帧的相同场景点的交替观察 视频,并且对与输出像素相对应的场景点进行滤波,以通过计算与输出像素对应的场景点的加权组合来确定输出像素的颜色。
-
公开(公告)号:US10832375B2
公开(公告)日:2020-11-10
申请号:US14997453
申请日:2016-01-15
Inventor: Henning Zimmer , Olga Sorkine Hornung , Oliver Wang , Alexander Sorkine Hornung , Wenzel Jakob , Fabrice Pierre Armand Rousselle , Wojciech Krzysztof Jarosz , David M. Adler
Abstract: Particular embodiments decompose an image comprising a scene into a diffuse component and a specular component. Each of the components represent a contribution to lighting in the scene. A set of motion vectors may be extracted in order to capture motion in the scene. Finally, a final contribution of each of the components to the image may be computed based on the motion vectors.
-
公开(公告)号:US20190096094A1
公开(公告)日:2019-03-28
申请号:US15715935
申请日:2017-09-26
Applicant: DISNEY ENTERPRISES, INC.
Inventor: Alex Sorkine-Hornung , Simone Meier , Jean-Charles Bazin , Sasha Schriber , Markus Gross , Oliver Wang
IPC: G06T7/00 , H04N19/583 , G06T7/207
Abstract: The present disclosure relates to an apparatus, system and method for processing transmedia content data. More specifically, the disclosure provides for identifying and inserting one item of media content within another item of media content, e.g. inserting a video within a video, such that the first item of media content appears as part of the second item. The invention involves analysing a first visual media item to identify one or more spatial locations to insert the second visual media item within the image data of the first visual media item, detecting characteristics of the one or more identified spatial locations, transforming the second visual media item according to the detected characteristics and combining the first visual media item and second visual media item by inserting the transformed second visual media item into the first visual media item at the one or more identified spatial locations.
-
10.
公开(公告)号:US10057561B2
公开(公告)日:2018-08-21
申请号:US15583772
申请日:2017-05-01
Applicant: Disney Enterprises, Inc. , ETH Zurich
Inventor: Benjamin Resch , Hendrik Lensch , Marc Pollefeys , Oliver Wang , Alexander Sorkine Hornung
CPC classification number: H04N13/264 , G06K9/00664 , G06T7/246 , G06T7/73 , G06T2207/10021 , G06T2207/30244
Abstract: Scenes reconstruction may be performed using videos that capture the scenes at high resolution and frame rate. Scene reconstruction may be associated with determining camera orientation and/or location (“camera pose”) throughout the video, three-dimensional coordinates of feature points detected in frames of the video, and/or other information. Individual videos may have multiple frames. Feature points may be detected in, and tracked over, the frames. Estimations of camera pose may be made for individual subsets of frames. One or more estimations of camera pose may be determined as fixed estimations. The estimated camera poses for the frames included in the subsets of frames may be updated based on the fixed estimations. Camera pose for frames not included in the subsets of frames may be determined to provide globally consistent camera poses and three-dimensional coordinates for feature points of the video.
-
-
-
-
-
-
-
-
-