Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020117505 - SWITCHING BETWEEN SPEECH RECOGNITION SYSTEMS

Publication Number WO/2020/117505
Publication Date 11.06.2020
International Application No. PCT/US2019/062867
International Filing Date 22.11.2019
IPC
G10L 15/26 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
G10L 15/22 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 15/28 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
H04M 3/42 2006.01
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
MTELEPHONIC COMMUNICATION
3Automatic or semi-automatic exchanges
42Systems providing special services or facilities to subscribers
CPC
G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
G10L 15/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
G10L 15/28
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
H04M 2201/39
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
MTELEPHONIC COMMUNICATION
2201Electronic components, circuits, software, systems or apparatus used in telephone systems
39using speech synthesis
H04M 2201/40
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
MTELEPHONIC COMMUNICATION
2201Electronic components, circuits, software, systems or apparatus used in telephone systems
40using speech recognition
H04M 2203/552
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
MTELEPHONIC COMMUNICATION
2203Aspects of automatic or semi-automatic exchanges
55related to network data storage and management
552Call annotations
Applicants
  • SORENSON IP HOLDINGS, LLC [US]/[US]
Inventors
  • THOMSON, David
  • BLACK, David
  • SKAGGS, Jonathan
  • BOEHME, Kenneth
  • ROYLANCE, Shane
Agents
  • PARKE, Brian
  • BARBER, Daniel, R.
  • BENNS, Jonathan, M.
  • BRAITHWALTE, Jared, J
  • BROOKS, Michelle
Priority Data
16/209,59404.12.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) SWITCHING BETWEEN SPEECH RECOGNITION SYSTEMS
(FR) COMMUTATION ENTRE DES SYSTÈMES DE RECONNAISSANCE VOCALE
Abstract
(EN)
A method may include obtaining first audio data originating at a first device during a communication session between the first device and a second device. The method may also include obtaining an availability of revoiced transcription units in a transcription system and in response to establishment of the communication session, selecting, based on the availability of revoiced transcription units, a revoiced transcription unit instead of a non-revoiced transcription unit to generate a transcript of the first audio data. The method may also include obtaining revoiced audio generated by a revoicing of the first audio data by a captioning assistant and generating a transcription of the revoiced audio using an automatic speech recognition system. The method may further include in response to selecting the re voiced transcription unit, directing the transcription of the revoiced audio to the second device as the transcript of the first audio data.
(FR)
L'invention concerne un procédé qui peut consister à obtenir des premières données audio provenant d'un premier dispositif durant une session de communication entre le premier dispositif et un second dispositif. Le procédé peut également consister à obtenir une disponibilité d'unités de transcription répétée dans un système de transcription et, en réponse à l'établissement de la session de communication, sélectionner, sur la base de la disponibilité d'unités de transcription répétée, une unité de transcription répétée au lieu d'une unité de transcription non-répétée pour générer une transcription des premières données audio. Le procédé peut également consister à obtenir un audio répété généré par une répétition des premières données audio par un assistant de sous-titrage, et générer une transcription de l'audio répété à l'aide d'un système de reconnaissance vocale automatique. Le procédé peut en outre consister, en réponse à la sélection de l'unité de transcription répétée, à diriger la transcription de l'audio répété vers le second dispositif en tant que transcription des premières données audio.
Latest bibliographic data on file with the International Bureau