WIPO logo
Mobile | Deutsch | Español | Français | 日本語 | 한국어 | Português | Русский | 中文 | العربية |
PATENTSCOPE

Search International and National Patent Collections
World Intellectual Property Organization
Search
 
Browse
 
Translate
 
Options
 
News
 
Login
 
Help
 
Machine translation
1. (WO2018053077) MICROPHONE SELECTION AND MULTI-TALKER SEGMENTATION WITH AMBIENT AUTOMATED SPEECH RECOGNITION (ASR)
Latest bibliographic data on file with the International Bureau    Submit observation

Pub. No.:    WO/2018/053077    International Application No.:    PCT/US2017/051480
Publication Date: 22.03.2018 International Filing Date: 14.09.2017
IPC:
G10L 17/06 (2013.01), G10L 25/03 (2013.01)
Applicants: NUANCE COMMUNICATIONS, INC. [US/US]; One Wayside Road Burlington, Massachusetts 01803 (US)
Inventors: PARADA, Pablo Peso; (US).
SHARMA, Dushyant; (US).
NAYLOR, Patrick; (US)
Agent: DANNENBERG, Ross; (US)
Priority Data:
62/394,286 14.09.2016 US
15/403,481 11.01.2017 US
Title (EN) MICROPHONE SELECTION AND MULTI-TALKER SEGMENTATION WITH AMBIENT AUTOMATED SPEECH RECOGNITION (ASR)
(FR) SÉLECTION DE MICROPHONES ET SEGMENTATION DE MULTIPLES LOCUTEURS AVEC RECONNAISSANCE VOCALE AUTOMATIQUE (ASR) AMBIANTE
Abstract: front page image
(EN)Disclosed methods and systems are directed to determining a best microphone pair and segmenting sound signals. The methods and systems may include receiving a collection of sound signals comprising speech from one or more audio sources (e.g., meeting participants) and/or background noise. The methods and systems may include calculating a TDOA and determining, based on the TDOA and via robust statistics, the best pair of microphones. The methods and systems may also include segmenting sound signals from multiple sources.
(FR)La présente invention concerne des procédés et des systèmes servant à déterminer la meilleure paire de microphones et à segmenter des signaux sonores. Les procédés et les systèmes peuvent comprendre la réception d’un ensemble collecté de signaux sonores comprenant des paroles provenant d’une ou plusieurs sources audio (par exemple des participants à une réunion) et/ou du bruit de fond. Les procédés et les systèmes peuvent comprendre le calcul d’un retard de temps à l’arrivée (TDOA) et la détermination, sur la base du TDOA et via des statistiques robustes, de la meilleure paire de microphones. Les procédés et les systèmes peuvent également comprendre la segmentation de signaux sonores provenant de multiples sources.
Designated States: AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW.
African Regional Intellectual Property Organization (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW)
Eurasian Patent Organization (AM, AZ, BY, KG, KZ, RU, TJ, TM)
European Patent Office (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR)
African Intellectual Property Organization (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG).
Publication Language: English (EN)
Filing Language: English (EN)