Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020117506 - TRANSCRIPTION GENERATION FROM MULTIPLE SPEECH RECOGNITION SYSTEMS

Publication Number WO/2020/117506
Publication Date 11.06.2020
International Application No. PCT/US2019/062870
International Filing Date 22.11.2019
IPC
G10L 15/26 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
H04M 3/42 2006.01
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
MTELEPHONIC COMMUNICATION
3Automatic or semi-automatic exchanges
42Systems providing special services or facilities to subscribers
H04M 1/247 2006.01
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
MTELEPHONIC COMMUNICATION
1Substation equipment, e.g. for use by subscribers
247Telephone sets including user guidance or feature selection means facilitating their use
G10L 15/32 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
CPC
G10L 15/187
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
183using context dependencies, e.g. language models
187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
G10L 15/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
G10L 15/30
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
G10L 15/32
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
H04M 1/2475
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
MTELEPHONIC COMMUNICATION
1Substation equipment, e.g. for use by subscribers; Analogous equipment at exchanges
247Telephone sets including user guidance or features selection means facilitating their use; ; Fixed telephone terminals for accessing a variety of communication services via the PSTN network
2474Telephone terminals specially adapted for disabled people
2475for a hearing impaired user
Applicants
  • SORENSON IP HOLDINGS, LLC [US]/[US]
Inventors
  • THOMSON, David
  • ADAMS, Jadie
  • SKAGGS, Jonathan
  • MCCLELLAN, Joshua
  • ROYLANCE, Shane
Agents
  • PARKE, Brian
  • BARBER, Daniel, R.
  • BENNS, Jonathan, M.
  • BRAITHWAITE, Jared, J.
  • BROOKS, Michelle
Priority Data
16/209,62304.12.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) TRANSCRIPTION GENERATION FROM MULTIPLE SPEECH RECOGNITION SYSTEMS
(FR) GÉNÉRATION DE TRANSCRIPTION À PARTIR DE MULTIPLES SYSTÈMES DE RECONNAISSANCE VOCALE
Abstract
(EN)
A method may include obtaining first audio data originating at a first device during a communication session between the first device and a second device. The method may also include obtaining a first text string that is a transcription of the first audio data, where the first text string may be generated using automatic speech recognition technology using the first audio data. The method may also include obtaining a second text string that is a transcription of second audio data, where the second audio data may include a revoicing of the first audio data by a captioning assistant and the second text string may be generated by the automatic speech recognition technology using the second audio data. The method may further include generating an output text string from the first text string and the second text string and using the output text string as a transcription of the speech.
(FR)
L'invention concerne un procédé qui peut consister à obtenir des premières données audio provenant d'un premier dispositif durant une session de communication entre le premier dispositif et un second dispositif. Le procédé peut également consister à obtenir une première chaîne de texte qui est une transcription des premières données audio, la première chaîne de texte pouvant être générée à l'aide d'une technologie de reconnaissance vocale automatique en utilisant les premières données audio. Le procédé peut également consister à obtenir une seconde chaîne de texte qui est une transcription de secondes données audio, les secondes données audio pouvant comprendre une reformulation des premières données audio par un assistant de sous-titrage et la seconde chaîne de texte pouvant être générée par la technologie de reconnaissance vocale automatique en utilisant les secondes données audio. Le procédé peut en outre consister à générer une chaîne de texte de sortie à partir de la première chaîne de texte et de la seconde chaîne de texte et à utiliser la chaîne de texte de sortie en tant que transcription de la parole.
Also published as
Latest bibliographic data on file with the International Bureau