Processing

Please wait...

Settings

Settings

Goto Application

1. US20210142792 - Systems and Methods for Identifying and Providing Information About Semantic Entities in Audio Signals

Office
United States of America
Application Number 17047472
Application Date 30.07.2018
Publication Number 20210142792
Publication Date 13.05.2021
Publication Kind A1
IPC
G10L 15/18
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
G06F 16/683
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
60of audio data
68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
683using metadata automatically derived from the content
G06N 20/00
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
20Machine learning
G06F 3/0481
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
01Input arrangements or combined input and output arrangements for interaction between user and computer
048Interaction techniques based on graphical user interfaces
0481based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
G10L 15/30
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
28Constructional details of speech recognition systems
30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
H04R 1/10
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
1Details of transducers
10Earpieces; Attachments therefor
CPC
G06F 16/685
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
60of audio data
68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
683using metadata automatically derived from the content
685using automatically derived transcript of audio data, e.g. lyrics
G10L 15/1815
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
G06N 20/00
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
20Machine learning
G06F 3/0481
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
01Input arrangements or combined input and output arrangements for interaction between user and computer
048Interaction techniques based on graphical user interfaces [GUI]
0481based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
H04R 1/08
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
1Details of transducers, ; loudspeakers or microphones
08Mouthpieces; Microphones;; Attachments therefor
H04R 1/1016
HELECTRICITY
04ELECTRIC COMMUNICATION TECHNIQUE
RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
1Details of transducers, ; loudspeakers or microphones
10Earpieces; Attachments therefor ; ; Earphones; Monophonic headphones
1016Earpieces of the intra-aural type
Applicants Google LLC
Inventors Tim Wantland
Brandon Barbello
Title
(EN) Systems and Methods for Identifying and Providing Information About Semantic Entities in Audio Signals
Abstract
(EN)

Systems and methods for determining identifying semantic entities in audio signals are provided. A method can include obtaining, by a computing device comprising one or more processors and one or more memory devices, an audio signal concurrently heard by a user. The method can further include analyzing, by a machine-learned model stored on the computing device, at least a portion of the audio signal in a background of the computing device to determine one or more semantic entities. The method can further include displaying the one or more semantic entities on a display screen of the computing device.


Related patent documents