Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020231449 - SPEECH SYNTHESIS UTILIZING AUDIO WAVEFORM DIFFERENCE SIGNAL(S)

Publication Number WO/2020/231449
Publication Date 19.11.2020
International Application No. PCT/US2019/033104
International Filing Date 20.05.2019
IPC
G10L 13/02 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
13Speech synthesis; Text to speech systems
02Methods for producing synthetic speech; Speech synthesisers
G06N 3/04 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
04Architecture, e.g. interconnection topology
CPC
G06N 3/006
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
004Artificial life, i.e. computers simulating life
006based on simulated virtual individual or collective life forms, e.g. single "avatar", social simulations, virtual worlds or particle swarm optimisation
G06N 3/0445
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
04Architectures, e.g. interconnection topology
0445Feedback networks, e.g. hopfield nets, associative networks
G06N 3/0454
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
04Architectures, e.g. interconnection topology
0454using a combination of multiple neural nets
G06N 3/084
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
08Learning methods
084Back-propagation
G06N 7/005
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
7Computer systems based on specific mathematical models
005Probabilistic networks
G10L 13/02
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
13Speech synthesis; Text to speech systems
02Methods for producing synthetic speech; Speech synthesisers
Applicants
  • DEEPMIND TECHNOLOGIES LIMITED [GB]/[GB]
Inventors
  • COBO RUS, Luis Carlos
  • KALCHBRENNER, Nal
  • ELSEN, Erich
  • GU, Chenjie
Agents
  • PORTNOV, Michael
Priority Data
62/848,31415.05.2019US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) SPEECH SYNTHESIS UTILIZING AUDIO WAVEFORM DIFFERENCE SIGNAL(S)
(FR) SYNTHÈSE VOCALE UTILISANT UN OU PLUSIEURS SIGNAUX DE DIFFÉRENCE DE FORME D'ONDE AUDIO
Abstract
(EN)
Techniques are disclosed that enable generation of an audio waveform representing synthesized speech based on a difference signal determined using an autoregressive model. Various implementations include using a distribution of the difference signal values to represent sounds found in human speech with a higher level of granularity than sounds not frequently found in human speech. Additional or alternative implementations include using one or more speakers of a client device to render the generated audio waveform.
(FR)
La présente invention concerne des techniques qui permettent de générer une forme d'onde audio représentant un langage synthétisé, sur la base d'un signal de différence déterminé en utilisant un modèle autorégressif. Selon divers modes de réalisation, la présente invention consiste à utiliser une distribution des valeurs de signal de différence pour représenter des sons trouvés dans le langage humain avec un niveau de granularité plus élevé que des sons qui ne sont pas fréquemment trouvés dans le langage humain. Selon des modes de réalisation supplémentaires ou alternatifs, la présente invention consiste à utiliser un ou plusieurs haut-parleurs d'un dispositif client pour restituer la forme d'onde audio générée.
Latest bibliographic data on file with the International Bureau