Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020113935 - METHOD AND APPARATUS FOR INCREASING VOICE WAKE-UP SUCCESS RATE AND STORAGE MEDIUM

Publication Number WO/2020/113935
Publication Date 11.06.2020
International Application No. PCT/CN2019/091258
International Filing Date 14.06.2019
IPC
G10L 21/0216 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0216characterised by the method used for estimating noise
G10L 17/24 2013.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
17Speaker identification or verification
22Interactive procedures; Man-machine interfaces
24 the user being prompted to utter a password or a predefined phrase
G10L 15/22 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 15/26 2006.01
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
CPC
G10L 15/22
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
22Procedures used during a speech recognition process, e.g. man-machine dialogue
G10L 15/26
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
26Speech to text systems
G10L 17/24
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
17Speaker identification or verification
22Interactive procedures; Man-machine interfaces
24the user being prompted to utter a password or a predefined phrase
G10L 2021/02166
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0216characterised by the method used for estimating noise
02161Number of inputs available containing the signal or the noise to be suppressed
02166Microphone arrays; Beamforming
G10L 21/0216
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
21Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
02Speech enhancement, e.g. noise reduction or echo cancellation
0208Noise filtering
0216characterised by the method used for estimating noise
Applicants
  • 北京云知声信息技术有限公司 BEIJING UNISOUND INFORMATION TECHNOLOGY CO., LTD [CN]/[CN]
Inventors
  • 关海欣 GUAN, Haixin
Agents
  • 北京冠和权律师事务所 BEIJING CROWN & RIGHTS LAW FIRM
Priority Data
201811466502.803.12.2018CN
Publication Language Chinese (ZH)
Filing Language Chinese (ZH)
Designated States
Title
(EN) METHOD AND APPARATUS FOR INCREASING VOICE WAKE-UP SUCCESS RATE AND STORAGE MEDIUM
(FR) PROCÉDÉ ET APPAREIL D’AUGMENTATION DE TAUX DE RÉUSSITE D’ACTIVATION VOCALE ET SUPPORT D'INFORMATIONS
(ZH) 一种提升语音唤醒成功率的方法、装置及存储介质
Abstract
(EN)
A method and apparatus for increasing a voice wake-up success rate and a storage medium. The method is used for increasing the success rate for performing a voice wake-up operation on a terminal device in a dormant state. Original voice wake-up and microphone array signal processing which are relatively independent and are not correlated are organically combined, and a closed-loop feedback loop is constructed by correlating respective information of both; by means of the closed-loop feedback loop, the voice wake-up provides a true and accurate signal data range for the microphone array signal processing, so that the microphone array signal processing obtains accurate statistics information related to signals and noises, voice data of which an interfering noise is removed is transmitted to a wake-up engine so that a precise and quick wake-up result can be obtained.
(FR)
La présente invention concerne un procédé et à un appareil d'augmentation de taux de réussite d'activation vocale et un support d’informations. Le procédé sert à augmenter le taux de réussite afin d'effectuer une opération d'activation vocale sur un dispositif terminal à l'état de veille. Une activation vocale originale et un traitement de signal de réseau de microphones, qui sont relativement indépendants et qui ne sont pas corrélés, sont combinés organiquement et une boucle de rétroaction en boucle fermée est construite par corrélation d'informations respectives des deux ; au moyen de la boucle de rétroaction en boucle fermée, l'activation vocale fournit une plage de données de signal vraie et précise destinée au traitement de signal de réseau de microphones, afin que le traitement de signal de réseau de microphones obtienne des informations de statistiques précises concernant des signaux et des bruits, des données vocales dont un bruit d'interférence est éliminé étant transmises à un moteur d'activation afin de pouvoir obtenir un résultat d'activation précis et rapide.
(ZH)
一种提升语音唤醒成功率的方法、装置及存储介质,方法用于提升对处于休眠状态的终端设备进行语音唤醒操作的成功率,将原有的相对独立且互不联系的语音唤醒和麦克风阵列信号处理这两者进行有机结合,并通过关联这两者各自的信息以构建一个闭环反馈回路,闭环反馈回路使得语音唤醒为麦克风阵列信号处理提供真实准确的信号数据区间,以使麦克风阵列信号处理获得关于信号和噪声的准确统计量信息,将去除干扰噪声的语音数据传送至唤醒引擎后即可得到精准快速的唤醒结果。
Also published as
Latest bibliographic data on file with the International Bureau