Invention Grant
- Patent Title: Speaker diarization with early-stop clustering
-
Application No.: US17432454Application Date: 2019-03-29
-
Publication No.: US12112759B2Publication Date: 2024-10-08
- Inventor: Liping Chen , Kao-Ping Soong
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Schwegman Lundberg & Woessner, P.A.
- International Application: PCT/CN2019/080617 2019.03.29
- International Announcement: WO2020/199013A 2020.10.08
- Date entered country: 2021-08-19
- Main IPC: G10L17/16
- IPC: G10L17/16 ; G10L17/02 ; G10L17/06 ; G10L17/18 ; G10L21/028

Abstract:
A method and apparatus for speaker diarization with early-stop clustering, segmenting an audio stream into at least one speech segment (710), the audio stream comprising speeches from at least one speaker; clustering the at least one speech segment into a plurality of clusters (720), the number of the plurality of clusters being greater than the number of the at least one speaker; selecting, from the plurality of clusters, at least one cluster of the highest similarity (730), the number of the selected at least one cluster being equal to the number of the at least one speaker; establishing a speaker classification model based on the selected at least one cluster (740); and aligning, through the speaker classification model, speech frames in the audio stream to the at least one speaker (750).
Public/Granted literature
- US20220122615A1 SPEAKER DIARIZATION WITH EARLY-STOP CLUSTERING Public/Granted day:2022-04-21
Information query