Processing

Please wait...

Settings

Settings

Goto Application

1. WO2018151768 - LANGUAGE MODEL BIASING SYSTEM

Publication Number WO/2018/151768
Publication Date 23.08.2018
International Application No. PCT/US2017/057369
International Filing Date 19.10.2017
Chapter 2 Demand Filed 06.06.2018
IPC
G10L 15/18 2013.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
G10L 15/197 2013.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
183using context dependencies, e.g. language models
19Grammatical context, e.g. disambiguation of recognition hypotheses based on word sequence rules
197Probabilistic grammars, e.g. word n-grams
CPC
G10L 15/01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
01Assessment or evaluation of speech recognition systems
G10L 15/07
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
065Adaptation
07to the speaker
G10L 15/1815
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
G10L 15/187
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
183using context dependencies, e.g. language models
187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
G10L 15/197
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
183using context dependencies, e.g. language models
19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
197Probabilistic grammars, e.g. word n-grams
G10L 15/30
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
Applicants
  • GOOGLE LLC [US]/[US]
Inventors
  • ALEKSIC, Petar
  • MENGIBAR, Pedro J. Moreno
Agents
  • MOSTELLER, Matthew P.
  • JEPSEN, Nicholas
  • MARKS & CLERK LLP
Priority Data
15/432,62014.02.2017US
Publication Language English (en)
Filing Language English (EN)
Designated States
Title
(EN) LANGUAGE MODEL BIASING SYSTEM
(FR) SYSTÈME D'ORIENTATION DE MODÈLE LINGUISTIQUE
Abstract
(EN) Methods, systems, and apparatus for receiving audio data corresponding to a user utterance and context data, identifying an initial set of one or more n-grams from the context data, generating an expanded set of one or more n-grams based on the initial set of n-grams, adjusting a language model based at least on the expanded set of n-grams, determining one or more speech recognition candidates for at least a portion of the user utterance using the adjusted language model, adjusting a score for a particular speech recognition candidate determined to be included in the expanded set of n-grams, determining a transcription of user utterance that includes at least one of the one or more speech recognition candidates, and providing the transcription of the user utterance for output.
(FR) L'invention concerne des procédés, des systèmes et un appareil permettant de recevoir des données audio correspondant à un énoncé d'utilisateur et des données de contexte, d'identifier un ensemble initial d'un ou plusieurs n-grammes à partir des données de contexte, de générer un ensemble étendu d'un ou de plusieurs n-grammes sur la base de l'ensemble initial de n-grammes, de régler un modèle linguistique sur la base au moins de l'ensemble étendu de n-grammes, de déterminer un ou plusieurs candidats de reconnaissance vocale pour au moins une partie de l'énoncé d'utilisateur à l'aide du modèle linguistique réglé, de régler un score pour un candidat de reconnaissance vocale particulier déterminé comme étant inclus dans l'ensemble étendu de n-grammes, de déterminer une transcription d'un énoncé d'utilisateur qui comprend au moins un candidat de reconnaissance vocale parmi le ou les candidats de reconnaissance vocale, et de fournir la transcription de l'énoncé d'utilisateur à émettre en sortie.
Latest bibliographic data on file with the International Bureau