Search International and National Patent Collections

1. (WO2018183650) END-TO-END TEXT-TO-SPEECH CONVERSION

Pub. No.:    WO/2018/183650    International Application No.:    PCT/US2018/025101
Publication Date: Fri Oct 05 01:59:59 CEST 2018 International Filing Date: Fri Mar 30 01:59:59 CEST 2018
IPC: G10L 13/04
G10L 15/16
G06N 3/08
Applicants: GOOGLE LLC
Inventors: BENGIO, Samuel
WANG, Yuxuan
YANG, Zongheng
CHEN, Zhifeng
WU, Yonghui
AGIOMYRGIANNAKIS, Ioannis
WEISS, Ron J.
JAITLY, Navdeep
RIFKIN, Ryan M.
CLARK, Robert Andrew James
LE, Quoc V.
RYAN, Russell J.
XIAO, Ying
Title: END-TO-END TEXT-TO-SPEECH CONVERSION
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.