Processing

Please wait...

Settings

Settings

Goto Application

1. WO2002019319 - SPEECH PROCESSING DEVICE AND SPEECH PROCESSING METHOD

Publication Number WO/2002/019319
Publication Date 07.03.2002
International Application No. PCT/JP2001/007518
International Filing Date 31.08.2001
IPC
G10L 11/02 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
11Determination or detection of speech or audio characteristics not restricted to a single one of groups G10L15/-G10L21/155
02Detection of presence or absence of speech signals
G10L 19/00 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/14 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
04using predictive techniques
14Details not provided for in groups G10L19/06-G10L19/1287
CPC
G10L 19/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
04using predictive techniques
26Pre-filtering or post-filtering
G10L 2019/0011
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
0001Codebooks
0011Long term prediction filters, i.e. pitch estimation
G10L 2025/783
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
78Detection of presence or absence of voice signals
783based on threshold decision
G10L 25/78
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
78Detection of presence or absence of voice signals
Applicants
  • MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. [JP]/[JP] (AllExceptUS)
  • WANG, Youhua [CN]/[JP] (UsOnly)
  • YOSHIDA, Koji [JP]/[JP] (UsOnly)
Inventors
  • WANG, Youhua
  • YOSHIDA, Koji
Agents
  • WASHIDA, Kimihito
Priority Data
2000-26419731.08.2000JP
2001-25947329.08.2001JP
Publication Language Japanese (JA)
Filing Language Japanese (JA)
Designated States
Title
(EN) SPEECH PROCESSING DEVICE AND SPEECH PROCESSING METHOD
(FR) DISPOSITIF DE TRAITEMENT VOCAL ET PROCEDE DE TRAITEMENT VOCAL
Abstract
(EN)
A voice/nonvoice judging section (106) judges that a section of the voice spectrum is a voice section containing a voice component if the difference between the voice spectrum signal and the value of a noise base is a predetermined threshold or more and otherwise judges that the section is a nonvoice section containing no voice components and containing only noise. A comb filter generating section (107) generates a comb filter for enhancing the voice pitch according to whether or not a voice component is contained in each frequency bin. A damping coefficient calculating section (108) multiplies the comb filter by a damping coefficient based on a frequency characteristic, determines the damping coefficient of the input signal for each frequency bin, and outputs the damping coefficient of each frequency bin to a multiplying section (109). The multiplying section (109) multiplies the voice spectrum by the damping coefficient for each frequency bin unit. A frequency synthesizing section (110) combines the spectra of the frequency bin units determined by the multiplication to synthesize a voice spectrum continuous in a frequency range in units of a predetermined processing time.
(FR)
L'invention concerne une section (106) d'évaluation vocal/non vocal, qui permet d'établir qu'une partie du spectre vocal est une partie vocale contenant une composante vocale lorsque la différence entre le signal du spectre vocal et la valeur d'une base comprenant du bruit est égale ou supérieure à un seuil prédéterminé, et d'établir dans le cas contraire que cette partie est une partie non vocale ne contenant pas de composantes vocales et ne contenant que du bruit. Une section génératrice (107) de filtre en peigne génère un filtre en peigne permettant d'augmenter la hauteur de voix dans la mesure où une composante vocale est contenue dans chaque canal de fréquence. Une section (108) de calcul de coefficient d'amortissement multiplie le filtre en peigne par un coefficient d'amortissement basé sur une caractéristique de fréquence, détermine le coefficient d'amortissement du signal d'entrée dans chaque canal de fréquence et envoie le coefficient d'amortissement de chaque canal de fréquence à une section (109) de multiplication. Cette section (109) de multiplication multiplie le spectre vocal avec le coefficient d'amortissement pour chaque unité de canal de fréquence. Une section (110) de synthèse de fréquence combine les spectres des unités de canaux de fréquence déterminées par la multiplication afin de synthétiser un spectre vocal continu dans un intervalle de fréquence, sous forme d'unités d'un temps de traitement prédéterminé.
Latest bibliographic data on file with the International Bureau