Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020112462 - SYSTEMS AND METHODS FOR TRAINING A CONTROL SYSTEM BASED ON PRIOR AUDIO INPUTS

Publication Number WO/2020/112462
Publication Date 04.06.2020
International Application No. PCT/US2019/062496
International Filing Date 20.11.2019
IPC
G10L 25/51 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
48specially adapted for particular use
51for comparison or discrimination
G06N 20/00 2019.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
20Machine learning
G06F 3/16 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
16Sound input; Sound output
CPC
G06F 3/167
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
16Sound input; Sound output
167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
G06N 20/00
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
20Machine learning
G10L 15/063
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
063Training
G10L 15/1815
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
G10L 15/1822
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
1822Parsing for meaning understanding
G10L 15/183
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
183using context dependencies, e.g. language models
Applicants
  • ROVI GUIDES, INC. [US]/[US]
Inventors
  • JAMES, Bryan
  • MALHOTRA, Manik
Agents
  • GUILIANO, Joseph M.
  • BOIANO, Anthony, A.
  • DEREVJANIK, Mario
  • FEUSTEL, Richard, M.
  • FLINDERS, P., Matthew
  • GARG, Rohini, K.
Priority Data
16/201,67927.11.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) SYSTEMS AND METHODS FOR TRAINING A CONTROL SYSTEM BASED ON PRIOR AUDIO INPUTS
(FR) SYSTÈMES ET PROCÉDÉS D'APPRENTISSAGE D'UN SYSTÈME DE COMMANDE SUR LA BASE D'ENTRÉES AUDIO ANTÉRIEURES
Abstract
(EN)
Systems and methods are disclosed herein for training a control system based on prior audio inputs. The disclosed systems and methods receive a non-lexical or inteij ectional audio input. State change indications are also received and stored by the system within a predefined period of time starting from the time the system received the audio input. The system then receives a subsequent audio input. If the audio inputs of both the audio input and the subsequent audio input match, and contextual information for the audio input and the subsequent audio input match, the system stores a match association, comprising a confidence factor, for the subsequent audio input to the audio input in the associative data structure. If the confidence factor is greater than a preconfigured confidence level, the system executes one or more functions based on stored state change indications.
(FR)
L'invention concerne des systèmes et des procédés permettant d'entraîner un système de commande sur la base d'entrées audio antérieures. Les systèmes et les procédés selon l'invention reçoivent une entrée audio non lexicale ou interjectionelle. Des indications de changement d'état sont également reçus et stockés par le système dans un laps de temps prédéfini à partir de l'instant où le système a reçu l'entrée audio. Le système reçoit ensuite une entrée audio subséquente. Si les entrées audio à la fois de l'entrée audio et de l'entrée audio ultérieure correspondent, et si des informations contextuelles de l'entrée audio et l'entrée audio ultérieure correspondent, le système stocke une association de correspondances, comportant un facteur de confiance, pour l'entrée audio ultérieure jusqu'à l'entrée audio dans la structure de données associative. Si le facteur de confiance est supérieur à un niveau de confiance préconfiguré, le système exécute une ou plusieurs fonctions sur la base d'indications de changement d'état stockées.
Also published as
Latest bibliographic data on file with the International Bureau