Processing

Please wait...

Settings

Settings

Goto Application

1. WO2021011814 - ADAPTING SIBILANCE DETECTION BASED ON DETECTING SPECIFIC SOUNDS IN AN AUDIO SIGNAL

Publication Number WO/2021/011814
Publication Date 21.01.2021
International Application No. PCT/US2020/042400
International Filing Date 16.07.2020
IPC
G10L 25/60 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
48specially adapted for particular use
51for comparison or discrimination
60for measuring the quality of voice signals
G10L 25/18 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
03characterised by the type of extracted parameters
18the extracted parameters being spectral information of each sub-band
G10L 25/30 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
27characterised by the analysis technique
30using neural networks
G10L 25/78 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/-G10L21/129
78Detection of presence or absence of voice signals
G10L 21/0232 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0216characterised by the method used for estimating noise
0232Processing in the frequency domain
CPC
G10L 21/0232
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0216characterised by the method used for estimating noise
0232Processing in the frequency domain
G10L 25/18
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
03characterised by the type of extracted parameters
18the extracted parameters being spectral information of each sub-band
G10L 25/30
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
27characterised by the analysis technique
30using neural networks
G10L 25/60
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
48specially adapted for particular use
51for comparison or discrimination
60for measuring the quality of voice signals
G10L 25/78
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
25Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
78Detection of presence or absence of voice signals
Applicants
  • DOLBY LABORATORIES LICENSING CORPORATION [US]/[US]
Inventors
  • MA, Yuanxing
  • LI, Kai
  • FANG, Qianqian
Agents
  • DOLBY LABORATORIES, INC.
  • ANDERSEN, Robert L.
  • BROWN, Tyrome Y.
  • ESTES, Ernest L.
  • HOGLUND, Heath W.
  • KONSTANTINIDES, Konstantinos
  • MA, Xin
  • PURTILL, Elizabeth
  • TANASE, Iuliana
  • YON, Sanghyok
  • ZHANG, Yiming
  • DOLBY INTERNATIONAL AB PATENT GROUP EUROPE
Priority Data
62/884,32008.08.2019US
PCT/CN2019/09639917.07.2019CN
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) ADAPTING SIBILANCE DETECTION BASED ON DETECTING SPECIFIC SOUNDS IN AN AUDIO SIGNAL
(FR) ADAPTATION DE DÉTECTION DE SIFFLEMENT SUR LA BASE DE LA DÉTECTION DE SONS SPÉCIFIQUES DANS UN SIGNAL AUDIO
Abstract
(EN)
A method is disclosed herein for adapting parameters of a sibilance detector. Time-frequency features are extracted from an audio signal being received and. Based on those time-frequency features, a determination is made of whether the audio signal includes a short-term feature or a long-term feature. In accordance with determining that the audio signal includes the short-term feature or the long-term feature, one or more parameters of a sibilance detector for detecting sibilance in the audio signal are adapted. Sibilance in the audio signal, is detected using the sibilance detector with the one or more adapted parameters.
(FR)
L'invention concerne un procédé d'adaptation de paramètres d'un détecteur de sifflement. Des caractéristiques temps-fréquence sont extraites d'un signal audio reçu et, sur la base de ces caractéristiques temps-fréquence, il est déterminé si le signal audio comprend une caractéristique à court terme ou une caractéristique à long terme. Conformément à la détermination du fait que le signal audio comprend la caractéristique à court terme ou la caractéristique à long terme, un ou plusieurs paramètres d'un détecteur de sifflement servant à détecter un sifflement dans le signal audio sont adaptés. Le sifflement dans le signal audio est détecté à l'aide du détecteur de sifflement avec le ou les paramètres adaptés.
Latest bibliographic data on file with the International Bureau