Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020068909 - SYSTEMS AND METHODS FOR SELECTIVE WAKE WORD DETECTION USING NEURAL NETWORK MODELS

Publication Number WO/2020/068909
Publication Date 02.04.2020
International Application No. PCT/US2019/052841
International Filing Date 25.09.2019
IPC
G10L 15/30 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
CPC
G10L 15/14
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
14using statistical models, e.g. Hidden Markov Models [HMMs]
G10L 15/16
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
16using artificial neural networks
G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
G10L 15/30
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
G10L 15/32
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
G10L 2015/088
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
088Word spotting
Applicants
  • SONOS, INC. [US]/[US]
Inventors
  • FAINBERG, Joachim
  • GIACOBELLO, Daniele
  • HARTUNG, Klaus
Agents
  • LINCICUM, Matt
  • FOX, Mary, L.
  • KUMAR, Vijay, S.
Priority Data
16/145,27528.09.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) SYSTEMS AND METHODS FOR SELECTIVE WAKE WORD DETECTION USING NEURAL NETWORK MODELS
(FR) SYSTÈMES ET PROCÉDÉS DE DÉTECTION SÉLECTIVE DE MOT D'ACTIVATION À L'AIDE DE MODÈLES DE RÉSEAU NEURONAL
Abstract
(EN)
Systems and methods for media playback via a media playback system include capturing sound data via a network microphone device and identifying a candidate wake word in the sound data. Based on identification of the candidate wake word in the sound data, the system selects a first wake- word engine from a plurality of wake-word engines. Via the first wake-word engine, the system analyzes the sound data to detect a confirmed wake word, and, in response to detecting the confirmed wake word, transmits a voice utterance of the sound data to one or more remote computing devices associated with a voice assistant service.
(FR)
Des systèmes et des procédés de lecture multimédia au moyen d'un système de lecture multimédia consistent à capturer des données sonores au moyen d'un dispositif de microphone réseau, ainsi qu’à identifier un mot d’activation candidat dans les données sonores. D’après l'identification du mot d’activation candidat dans les données sonores, le système sélectionne un premier moteur de mots d’activation parmi une pluralité de moteurs de mots d’activation. Au moyen du premier moteur de mots d’activation, le système analyse les données sonores afin de détecter un mot d’activation confirmé et, en réponse à la détection du mot d’activation confirmé, transmet un énoncé vocal des données sonores à un ou plusieurs dispositifs informatiques à distance associés à un service d'assistant vocal.
Also published as
CA3067776
EP2019783874
KRKR1020207003504
Latest bibliographic data on file with the International Bureau