Processing

Please wait...

Settings

Settings

Goto Application

1. WO2022164646 - DETERMINATION OF TASK URGENCY BASED ON ACOUSTIC FEATURES OF AUDIO DATA

Publication Number WO/2022/164646
Publication Date 04.08.2022
International Application No. PCT/US2022/012388
International Filing Date 13.01.2022
IPC
G10L 25/51 2013.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
48specially adapted for particular use
51for comparison or discrimination
CPC
G06N 3/08
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
08Learning methods
G10L 15/02
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
02Feature extraction for speech recognition; Selection of recognition unit
G10L 15/063
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
063Training
G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
G10L 2015/223
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
223Execution procedure of a spoken command
G10L 25/51
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
48specially adapted for particular use
51for comparison or discrimination
Applicants
  • MICROSOFT TECHNOLOGY LICENSING, LLC [US]/[US]
Inventors
  • NOURI, Elnaz
Agents
  • CHATTERJEE, Aaron C.
  • BARKER, Doug
  • CHEN, Wei-Chen Nicholas
  • CHOI, Daniel
  • CHURNA, Timothy
  • DINH, Phong
  • EVANS, Patrick
  • GABRYJELSKI, Henry
  • GUPTA, Anand
  • HWANG, William C.
  • JARDINE, John S.
  • LEE, Sunah
  • LEMMON, Marcus
  • MARQUIS, Thomas
  • MEYERS, Jessica
  • SPELLMAN, Steven
  • SULLIVAN, Kevin
  • WALKER, Matt
  • WIGHT, Stephen A.
  • WISDOM, Gregg
  • WONG, Thomas S.
  • ZHANG, Hannah
  • AKHTER, Julia
  • KADOURA, Judy M.
  • NIU, Bo
  • BROWN, Renee
  • TRAN, Kimberly
Priority Data
17/163,17629.01.2021US
Publication Language English (en)
Filing Language English (EN)
Designated States
Title
(EN) DETERMINATION OF TASK URGENCY BASED ON ACOUSTIC FEATURES OF AUDIO DATA
(FR) DÉTERMINATION D'URGENCE DE TÂCHE SUR LA BASE DE CARACTÉRISTIQUES ACOUSTIQUES DE DONNÉES AUDIO
Abstract
(EN) Systems and methods are provided for determining importance and urgency of a task based on acoustic features of audio input associated with the task. The determining includes classifying the task into one or more classes associated with importance, urgency, and priority of the task. The classification may use a trained machine learning model of acoustic features and embedding for a neural network. The task classifier uses feature acoustics of either or both the foreground and background audio. The feature acoustics include a pitch, a tone, and a volume over a time duration of the audio input. A combination of the acoustic features determines a class associated with the task. The machine learning model includes a regression model of acoustic features over time and a model with embedding for a neural network.
(FR) L'invention concerne des systèmes et des procédés pour la détermination de l'importance et de l'urgence d'une tâche sur la base de caractéristiques acoustiques d'entrée audio associées à la tâche. La détermination comprend la classification de la tâche en une ou plusieurs classes associées à l'importance, à l'urgence et à la priorité de la tâche. La classification peut faire appel à un modèle d'apprentissage machine entraîné de caractéristiques acoustiques et à un plongement pour un réseau neuronal. Le classificateur de tâche utilise l'acoustique de caractéristique de l'audio d'avant-plan et/ou d'arrière-plan. L'acoustique de caractéristique comprend une hauteur, une tonalité et un volume sur une durée de l'entrée audio. Une combinaison des caractéristiques acoustiques détermine une classe associée à la tâche. Le modèle d'apprentissage machine comprend un modèle de régression de caractéristiques acoustiques dans le temps et un modèle avec plongement pour un réseau neuronal.
Related patent documents
Latest bibliographic data on file with the International Bureau