Processing

Please wait...

Settings

Settings

Goto Application

1. WO2022010189 - APPARATUS AND METHOD FOR AUDIO ENCODING/DECODING ROBUST TO TRANSITION SEGMENT ENCODING DISTORTION

Publication Number WO/2022/010189
Publication Date 13.01.2022
International Application No. PCT/KR2021/008417
International Filing Date 02.07.2021
IPC
G10L 19/00 2006.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/005 2013.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
005Correction of errors induced by the transmission channel, if related to the coding algorithm
G10L 19/16 2013.1
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
04using predictive techniques
16Vocoder architecture
CPC
G10L 19/00
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 19/0017
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
0017Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
G10L 19/005
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
005Correction of errors induced by the transmission channel, if related to the coding algorithm
G10L 19/16
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
04using predictive techniques
16Vocoder architecture
G10L 19/167
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
19Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
04using predictive techniques
16Vocoder architecture
167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
Applicants
  • 한국전자통신연구원 ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE [KR]/[KR]
Inventors
  • 백승권 BEACK, Seung Kwon
  • 성종모 SUNG, Jongmo
  • 이미숙 LEE, Mi Suk
  • 이태진 LEE, Tae Jin
  • 임우택 LIM, Woo-taek
  • 장인선 JANG, Inseon
Agents
  • 특허법인 무한 MUHANN PATENT & LAW FIRM
Priority Data
10-2020-008308606.07.2020KR
10-2020-018662829.12.2020KR
Publication Language Korean (ko)
Filing Language Korean (KO)
Designated States
Title
(EN) APPARATUS AND METHOD FOR AUDIO ENCODING/DECODING ROBUST TO TRANSITION SEGMENT ENCODING DISTORTION
(FR) APPAREIL ET PROCÉDÉ DE CODAGE/DÉCODAGE AUDIO ROBUSTE DE DISTORSION DE CODAGE DE SEGMENT DE TRANSITION
(KO) 천이구간 부호화 왜곡에 강인한 오디오 부호화/복호화 장치 및 방법
Abstract
(EN) Disclosed are an apparatus and method for audio encoding/decoding robust to transition segment encoding distortion. The method for audio encoding may comprise the steps of: outputting a frequency domain signal by performing time-to-frequency (T/F) conversion on an input signal; outputting a frequency domain residual signal obtained by removing a frequency axis envelope from the frequency domain signal, by applying frequency domain noise shaping (FDNS) encoding to the frequency domain signal; outputting a time domain residual signal obtained by removing a time axis envelope, by performing linear prediction coefficient (LPC) analysis on the basis of the frequency domain residual signal; and quantizing and transmitting the time domain residual signal.
(FR) Sont ici divulgués un appareil et un procédé de codage/décodage audio robuste de distorsion de codage de segment de transition. Le procédé de codage audio peut comprendre les étapes consistant : à émettre un signal de domaine fréquentiel en exécutant une conversion temps-fréquence (T/F) sur un signal d'entrée ; à émettre un signal résiduel de domaine fréquentiel obtenu en éliminant une enveloppe d'axe de fréquence du signal de domaine fréquentiel, en exécutant un codage de mise en forme de bruit dans le domaine fréquentiel (FDNS) sur le signal de domaine fréquentiel ; à émettre un signal résiduel de domaine temporel obtenu en éliminant une enveloppe d'axe temporel, en exécutant une analyse de coefficient de prédiction linéaire (LPC) sur la base du signal résiduel du domaine fréquentiel ; et à quantifier et transmettre le signal résiduel du domaine temporel.
(KO) 천이구간 부호화 왜곡에 강인한 오디오 부호화/복호화 장치 및 방법이 개시된다. 오디오 부호화 방법은 입력 신호를 T/F(time-to-frequency) 변환하여 주파수 영역 신호를 출력하는 단계; 상기 주파수 영역 신호에 FDNS(frequency domain noise shaping) 부호화를 적용하여 상기 주파수 영역 신호에서 주파수축 포락선이 제거된 주파수 영역 잔차 신호를 출력하는 단계; 상기 주파수 영역 잔차 신호를 기초로 LPC(linear prediction coefficient) 분석을 수행하여 시간축 포락선이 제거된 시간 영역 잔차 신호를 출력하는 단계; 및 상기 시간 영역 잔차 신호를 양자화하여 전송하는 단계를 포함할 수 있다.
Related patent documents
Latest bibliographic data on file with the International Bureau