Processing

Please wait...

Settings

Settings

Goto Application

1. WO2018132187 - CHARACTERISTIC-BASED SPEECH CODEBOOK SELECTION

Publication Number WO/2018/132187
Publication Date 19.07.2018
International Application No. PCT/US2017/063438
International Filing Date 28.11.2017
IPC
G10L 19/00 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 25/51 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
48specially adapted for particular use
51for comparison or discrimination
G10L 25/30 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
27characterised by the analysis technique
30using neural networks
G10L 25/63 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
48specially adapted for particular use
51for comparison or discrimination
63for estimating an emotional state
CPC
G10L 15/02
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
02Feature extraction for speech recognition; Selection of recognition unit
G10L 15/16
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
16using artificial neural networks
G10L 19/00
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
04using predictive techniques
16Vocoder architecture
18Vocoders using multiple modes
22Mode decision, i.e. based on audio signal content versus external parameters
G10L 2019/0001
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
0001Codebooks
G10L 25/30
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
27characterised by the analysis technique
30using neural networks
Applicants
  • QUALCOMM INCORPORATED [US]/[US]
Inventors
  • GUO, Yinyi
  • VISSER, Erik
Agents
  • TOLER, Jeffrey G.
Priority Data
15/405,15912.01.2017US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) CHARACTERISTIC-BASED SPEECH CODEBOOK SELECTION
(FR) SÉLECTION DE LIVRE DE CODES DE PAROLE BASÉ SUR DES CARACTÉRISTIQUES
Abstract
(EN)
An apparatus includes a speech processing engine configured to receive data corresponding to speech and to determine whether a first characteristic associated with the speech differs from a reference characteristic by at least a threshold amount. The apparatus further includes a selection circuit responsive to the speech processing engine. The selection circuit is configured to select a particular speech codebook from among a plurality of speech codebooks based on the first characteristic differing from the reference characteristic by at least the threshold amount. The particular speech codebook is associated with the first characteristic. This first characteristic is based on an emotion of the user, an environement of the user, and estimated age of the user or an estimated distance of the user from a microphone.
(FR)
L’invention concerne un appareil comprenant un moteur de traitement de la parole configuré pour recevoir des données correspondant à la parole et pour déterminer si une première caractéristique associée à la parole diffère d'une caractéristique de référence par au moins un seuil. L'appareil comprend en outre un circuit de sélection sensible au moteur de traitement de la parole. Le circuit de sélection est configuré pour sélectionner un livre de codes de parole particulier parmi une pluralité de livres de codes de parole sur la base de la première caractéristique différente de la caractéristique de référence par au moins le seuil. Le livre de codes de parole particulier est associé à la première caractéristique. Cette première caractéristique est basée sur une émotion de l'utilisateur, un environnement de l'utilisateur, et l'âge estimé de l'utilisateur ou une distance estimée de l'utilisateur à partir d'un microphone.
Latest bibliographic data on file with the International Bureau