Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020135324 - AUDIO SIGNAL PROCESSING

Publication Number WO/2020/135324
Publication Date 02.07.2020
International Application No. PCT/CN2019/127397
International Filing Date 23.12.2019
IPC
G10L 17/00 2013.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
17Speaker identification or verification
G10L 15/16 2006.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
16using artificial neural networks
CPC
G06N 20/10
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
20Machine learning
10using kernel methods, e.g. support vector machines [SVM]
G06N 3/0445
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
04Architectures, e.g. interconnection topology
0445Feedback networks, e.g. hopfield nets, associative networks
G06N 3/0454
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
04Architectures, e.g. interconnection topology
0454using a combination of multiple neural nets
G06N 3/0481
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
04Architectures, e.g. interconnection topology
0481Non-linear activation functions, e.g. sigmoids, thresholds
G06N 3/08
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
08Learning methods
G06N 7/005
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
7Computer systems based on specific mathematical models
005Probabilistic networks
Applicants
  • ALIBABA GROUP HOLDING LIMITED
Inventors
  • ZHAO, Yan
  • LIU, Gang
  • LEI, Yun
Agents
  • BEIJING SANYOU INTELLECTUAL PROPERTY AGENCY LTD.
Priority Data
16/236,20828.12.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) AUDIO SIGNAL PROCESSING
(FR) TRAITEMENT DE SIGNAUX AUDIO
Abstract
(EN)
Systems and methods are provided for improving audio signal processing by receiving an audio signal; obtaining a plurality of multi-dimensional features based on the audio signal; obtaining a plurality of segment-level representations based on the plurality of multi-dimensional features; obtaining an utterance-level representation based on the plurality of segment-level representations; and recognizing a speaker from the audio signal based on the utterance-level representation.
(FR)
La présente invention concerne des systèmes et des procédés pour améliorer un traitement de signaux audio, comprenant : la réception d’un signal audio ; l’obtention d’une pluralité de caractéristiques multidimensionnelles sur la base du signal audio ; l’obtention d’une pluralité de représentations de niveau de segment sur la base de la pluralité des caractéristiques multidimensionnelles ; l’obtention d’une représentation de niveau d’énoncé sur la base de la pluralité des représentations de niveau de segment ; et la reconnaissance d’un locuteur à partir du signal audio sur la base de la représentation de niveau d’énoncé.
Related patent documents
Latest bibliographic data on file with the International Bureau