Method and device for Quasi-Gibbs structure sampling by deep permutation for person identity inference

    公开(公告)号:US10339408B2

    公开(公告)日:2019-07-02

    申请号:US15388039

    申请日:2016-12-22

    Abstract: The present disclosure provides a method and device for visual appearance based person identity inference. The method may include obtaining a plurality of input images. The input images include a gallery set of images containing, persons-of-interest and a probe set of images containing person detections, and one input image corresponds to one person. The method may further include extracting N feature maps from the input images using a Deep Neural Network, N being a natural number; constructing N structure samples of the N feature maps using conditional random field (CRF) graphical models; learning the N structure samples from an implicit common latent feature space embedded in the N structure samples; and according to the learned structures, identifying one or more images from the probe set containing a same person-of-interest as an image in the gallery set.

    Scalable user intent mining using a multimodal restricted boltzmann machine

    公开(公告)号:US09910930B2

    公开(公告)日:2018-03-06

    申请号:US14587727

    申请日:2014-12-31

    CPC classification number: G06F17/30967

    Abstract: A method for scalable user intent mining is provided. The method includes detecting named entities from a plurality of query logs in a public query log dataset and generating features of the plurality of query logs based on the detected named entities. The method also includes applying a multimodal restricted boltzmann machine (RBM) on the generated features of the plurality of query logs to train a public multimodal RBM and generating a plurality of public query representations. Further, the method includes receiving a search query from a user, determining whether there are a plurality of history queries of the user. When there is no history query, user intent is predicted using the public multimodal RBM. When there are the history queries, the public multimodal RBM is applied on the plurality of history queries to train a personalized multimodal RBM, and the user intent is predicted using the personalized multimodal RBM.

    System and method for sharing information among multiple devices

    公开(公告)号:US09866631B2

    公开(公告)日:2018-01-09

    申请号:US14585296

    申请日:2014-12-30

    Inventor: Haohong Wang

    Abstract: A method for sharing information among multiple devices is provided. The method includes a sensing device sensing signals of at least one object associated with a targeting device and extracting at least one feature of the object from the sensed signals. The method also includes the sensing device broadcasting the extracted feature of the object on a determined network containing a plurality of targeting devices and receiving feedbacks from the plurality of targeting devices on the network in response to the broadcasting. Further, the method includes the sensing device automatically identifying one of the plurality of targeting devices based on the received feedbacks, synchronizing information with the identified targeting device and displaying the information to a user of the sensing device.

    Function-based action sequence derivation for personal assistant system

    公开(公告)号:US09619283B2

    公开(公告)日:2017-04-11

    申请号:US14810778

    申请日:2015-07-28

    CPC classification number: G06F9/4881 G06F9/453 G06N5/02 G06N5/022 G06N7/005

    Abstract: A method is provided for recommending a desired func sequence to a user. The method includes obtaining a user intention list including at least one user intention; separating the user intention into a plurality of tasks; and creating a task flow graph for the plurality of tasks based on user usage data. Each vertex in the task flow graph represents a task and indicating an importance of the task. The method also includes creating a func flow graph based on the user usage data and temporal sequences of the tasks and funcs, and each vertex in the func flow graph represents a func and indicating an importance of the func. Further, the method includes determining a desired func sequence to fulfill the user intention based on the user usage data, the task flow graph, and the func flow graph; and recommending the desired func sequence to the user.

    Intelligent TV system and method
    8.
    发明授权
    Intelligent TV system and method 有权
    智能电视系统及方法

    公开(公告)号:US08811673B1

    公开(公告)日:2014-08-19

    申请号:US13865329

    申请日:2013-04-18

    CPC classification number: H04N21/44008 G06F17/30811 G06K9/00771

    Abstract: A method is provided for an intelligent user-interaction system based on object detection. The method includes receiving an input video sequence corresponding to a video program, and dividing the input video sequence into a plurality of video shots, each containing one or more video frames. The method also includes detecting possible object occurrences in each of the plurality of video shots, and analyzing possible paths of an object in a video shot using a multimodal-cue approach. Further, the method includes aggregating the path-based selected object occurrences across the plurality of video shots to detect objects, and generating a complete list of the object occurrences across the plurality of video shots.

    Abstract translation: 提供了一种基于对象检测的智能用户交互系统的方法。 该方法包括接收对应于视频节目的输入视频序列,并且将输入视频序列划分成多个视频镜头,每个视频片段包含一个或多个视频帧。 该方法还包括检测多个视频镜头中的每一个中的可能的对象事件,以及使用多模式提示方法分析视频拍摄中的对象的可能路径。 此外,该方法包括在多个视频镜头之间聚合基于路径的所选择的对象出现以检测对象,以及在多个视频镜头中生成对象出现的完整列表。

    ONE-CLICK FILMMAKING
    9.
    发明公开

    公开(公告)号:US20230237268A1

    公开(公告)日:2023-07-27

    申请号:US17732167

    申请日:2022-04-28

    Inventor: Haohong Wang

    CPC classification number: G06F40/289 G10L15/26 G06F40/205 H04N21/85 H04N21/816

    Abstract: A method and device for one-click filmmaking are provided. The method includes: obtaining a script from a user, detecting a single user operation, in response to the single user operation, obtaining a plurality of shots and estimating information of the plurality of shots based on the script, and automatically generating a film based on an auto-cinematography algorithm and the estimated information of the plurality of shots. The estimated information of one of the plurality of shots comprises at least one of a character of a shot, a scene of the shot, one or more positions of the character in the shot, a duration of the shot, or a shot type.

Patent Agency Ranking