Search International and National Patent Collections
Some content of this application is unavailable at the moment.
If this situation persists, please contact us atFeedback&Contact
1. (WO2017136016) RE-RECOGNIZING SPEECH WITH EXTERNAL DATA SOURCES
Latest bibliographic data on file with the International Bureau

Pub. No.: WO/2017/136016 International Application No.: PCT/US2016/062753
Publication Date: 10.08.2017 International Filing Date: 18.11.2016
IPC:
G10L 15/19 (2013.01)
G PHYSICS
10
MUSICAL INSTRUMENTS; ACOUSTICS
L
SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15
Speech recognition
08
Speech classification or search
18
using natural language modelling
183
using context dependencies, e.g. language models
19
Grammatical context, e.g. disambiguation of recognition hypotheses based on word sequence rules
Applicants:
GOOGLE LLC [US/US]; 1600 Amphitheatre Parkway Mountain View, California 94043, US
Inventors:
STROHMAN, Trevor D.; US
SCHALKWYK, Johan; US
SKOBELTSYN, Gleb; US
Agent:
GROSVENOR, Stephanie D.; US
JORDAN, David E. A.; US
Priority Data:
15/016,60905.02.2016US
Title (EN) RE-RECOGNIZING SPEECH WITH EXTERNAL DATA SOURCES
(FR) NOUVELLE RECONNAISSANCE DE LA PAROLE AVEC DES SOURCES DE DONNÉES EXTERNES
Abstract:
(EN) Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions. Described features may enable data from an external data source to be used in generating more accurate transcriptions without modifying an existing automated speech recognizer, or may avoid re-compiling of an automated speech recognizer.
(FR) La présente invention concerne des procédés, y compris des programmes informatiques codés sur un support de stockage informatique, permettant d'améliorer la reconnaissance de la parole en se basant sur des sources de données externes. Selon un aspect, un procédé consiste à obtenir une transcription candidate initiale d'un énoncé à l'aide d'un dispositif de reconnaissance vocale automatique et à identifier, sur la base d'un modèle linguistique qui n'est pas utilisé par le dispositif de reconnaissance vocale automatique lors de la génération de la transcription candidate initiale, un ou plusieurs termes qui sont phonétiquement similaires à un ou plusieurs termes qui surviennent dans la transcription candidate initiale. Des actions supplémentaires consistent à générer une ou plusieurs transcriptions candidates supplémentaires en se basant sur un ou plusieurs termes identifiés et à sélectionner une transcription parmi les transcriptions candidates. Les caractéristiques décrites peuvent permettre l'utilisation de données provenant d'une source de données externe lors de la génération de transcriptions plus précises sans modifier un dispositif de reconnaissance vocale automatique ou peuvent éviter une nouvelle compilation d'un dispositif de reconnaissance vocale automatique.
front page image
Designated States: AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JP, KE, KG, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW
African Regional Intellectual Property Organization (ARIPO) (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW)
Eurasian Patent Organization (AM, AZ, BY, KG, KZ, RU, TJ, TM)
European Patent Office (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR)
African Intellectual Property Organization (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG)
Publication Language: English (EN)
Filing Language: English (EN)
Also published as:
DE202016008230DE102016125954CN107045871IN201847017093KR1020180066216EP3360129
RU0002688277