Invention Grant
- Patent Title: Systems and methods for manipulating electronic content based on speech recognition
-
Application No.: US16014178Application Date: 2018-06-21
-
Publication No.: US10657985B2Publication Date: 2020-05-19
- Inventor: Peter F. Kocks , Guoning Hu , Ping-Hao Wu
- Applicant: Oath Inc.
- Applicant Address: US VA Dulles
- Assignee: Oath Inc.
- Current Assignee: Oath Inc.
- Current Assignee Address: US VA Dulles
- Agency: Bookoff McAndrews, PLLC
- Main IPC: G10L25/57
- IPC: G10L25/57 ; G10L15/06 ; G06F16/783 ; G10L17/00 ; G10L15/08 ; H04N21/439 ; G06F16/432 ; H04N21/466

Abstract:
Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.
Public/Granted literature
- US20180301161A1 SYSTEMS AND METHODS FOR MANIPULATING ELECTRONIC CONTENT BASED ON SPEECH RECOGNITION Public/Granted day:2018-10-18
Information query