Processing

Please wait...

Settings

Settings

Goto Application

1. CN113841198 - SIGNAL COMPONENT ESTIMATION USING COHERENCE

Office
China
Application Number 202080036549.X
Application Date 30.04.2020
Publication Number 113841198
Publication Date 24.12.2021
Publication Kind A
IPC
G10L 21/0232
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0216characterised by the method used for estimating noise
0232Processing in the frequency domain
G10L 21/0264
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0264characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
G10L 21/0208
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
G10L 19/012
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
012Comfort noise or silence coding
CPC
G10L 21/0232
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0216characterised by the method used for estimating noise
0232Processing in the frequency domain
G10L 21/0264
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0264characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
G10L 21/0208
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
G10L 19/012
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
012Comfort noise or silence coding
G10L 2021/02082
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
02082the noise being echo, reverberation of the speech
G10L 25/21
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
03characterised by the type of extracted parameters
21the extracted parameters being power information
Applicants BOSE CORPORATION
伯斯有限公司
Inventors CHEUNG SHIUFUN
张绍勋
SONG ZUKUI
宋祖揆
HERA CRISTIAN MARIUS
C·M·赫拉
PAN DAVIS Y
D·Y·潘
Agents 北京市金杜律师事务所 11256
Priority Data 62/841,608 01.05.2019 US
Title
(EN) SIGNAL COMPONENT ESTIMATION USING COHERENCE
(ZH) 使用相干性的信号分量估计
Abstract
(EN) Systems, methods, and machine-readable storage devices that receive an input signal representing audio captured using a microphone. The input signal includes portions that represent acoustic output from one or more audio sources, and a portion that represents other acoustic energy in the environment. A frequency domain representation of the input signal is iteratively modified to substantially reduce effects due to all but a selected one of the portions, from which an estimate of the power spectral density, PSD, of the selected portion is determined. Based upon the estimated PSD a noise or echo component is reduced, or a replacement noise is provided.The iterative modification involves a diagonalization of the cross-spectral density matrix to remove content coherent with a first audio input from the auto and cross-spectra of other signals.
(ZH) 本发明公开了接收表示使用麦克风捕获的音频的输入信号的系统、方法和机器可读存储设备。该输入信号包括表示来自一个或多个音频源的声学输出的多个部分,以及表示环境中的其他声能的一部分。迭代地修改该输入信号的频域表示,以显著减小由于除了该多个部分中的所选择的一者之外的所有部分而导致的影响,由此表示来确定对所选择的一部分的功率谱密度(PSD)的估计。基于所估计的PSD,减小噪声或回声分量,或者提供替换噪声。该迭代修改涉及交叉频谱密度矩阵的对角化,以移除与其他信号的自动频谱和交叉频谱中的第一音频输入相干的内容。