Processing

Please wait...

Settings

Settings

Goto Application

1. CN112166424 - SYSTEMS AND METHODS FOR IDENTIFYING AND PROVIDING INFORMATION ABOUT SEMANTIC ENTITIES IN AUDIO SIGNALS

Office
China
Application Number 201880093529.9
Application Date 30.07.2018
Publication Number 112166424
Publication Date 01.01.2021
Publication Kind A
IPC
G06F 16/683
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
60of audio data
68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
683using metadata automatically derived from the content
CPC
G06F 16/685
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
60of audio data
68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
683using metadata automatically derived from the content
685using automatically derived transcript of audio data, e.g. lyrics
G06F 3/017
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
01Input arrangements or combined input and output arrangements for interaction between user and computer
017Gesture based interaction, e.g. based on a set of recognized hand gestures
G06F 3/0481
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
01Input arrangements or combined input and output arrangements for interaction between user and computer
048Interaction techniques based on graphical user interfaces [GUI]
0481based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
G06F 3/167
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
3Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
16Sound input; Sound output
167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
G10L 15/1815
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
18using natural language modelling
1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
Applicants GOOGLE INC.
谷歌有限责任公司
Inventors WANTLAND TIM
T.万特兰
BARBELLO BRANDON
B.巴贝洛
Agents 北京市柳沈律师事务所 11105
Title
(EN) SYSTEMS AND METHODS FOR IDENTIFYING AND PROVIDING INFORMATION ABOUT SEMANTIC ENTITIES IN AUDIO SIGNALS
(ZH) 用于识别和提供关于音频信号中的语义实体的信息的系统和方法
Abstract
(EN) Systems and methods for determining identifying semantic entities in audio signals are provided. A method can include the step of obtaining, by a computing device comprising one or more processors andone or more memory devices, an audio signal concurrently heard by a user. The method can further include the steps: analyzing, by a machine-learned model stored on the computing device, at least a portion of the audio signal in a background of the computing device to determine one or more semantic entities. The method can further include the step of displaying the one or more semantic entities ona display screen of the computing device.
(ZH) 提供了用于确定识别音频信号中的语义实体的系统和方法。一种方法可以包括由包括一个或多个处理器和一个或多个存储器设备的计算设备获得用户同时听到的音频信号。该方法还可以包括由存储在计算设备上的机器学习模型在计算设备的后台中分析音频信号的至少一部分,以确定一个或多个语义实体。该方法还可以包括在计算设备的显示屏上显示所述一个或多个语义实体。
Related patent documents