WIPO logo
Mobile | Deutsch | Español | Français | 日本語 | 한국어 | Português | Русский | 中文 | العربية |
PATENTSCOPE

Search International and National Patent Collections
World Intellectual Property Organization
Search
 
Browse
 
Translate
 
Options
 
News
 
Login
 
Help
 
Machine translation
1. (WO2017058584) EXTRACTING FACTS FROM UNSTRUCTURED INFORMATION
Latest bibliographic data on file with the International Bureau   

Pub. No.:    WO/2017/058584    International Application No.:    PCT/US2016/052732
Publication Date: 06.04.2017 International Filing Date: 21.09.2016
IPC:
G06F 17/30 (2006.01)
Applicants: MICROSOFT TECHNOLOGY LICENSING, LLC [US/US]; Attn: Patent Group Docketing (Bldg. 8/1000) One Microsoft Way Redmond, Washington 98052-6399 (US)
Inventors: CHALABI, Achraf Abdel Moneim Tawfik; (US).
ABDELBAKI, Ahmed Mohamed Emad Morsi; (US).
ANDERSON, Brandon Robert; (US).
ABDEL-REHEEM, Eslam Kamal Abdel-Aal; (US).
CHEN, Deqing; (US).
GERGUIS, Michel Naim Naguib; (US).
ABDELAZIZ, Sayed Hassan Sayed; (US).
MARTON, Yuval Yehezkel; (US)
Agent: MINHAS, Sandip; (US).
CHEN, Wei-Chen Nicholas; (US).
DRAKOS, Katherine J.; (US).
KADOURA, Judy M.; (US).
HOLMES, Danielle J.; (US).
SWAIN, Cassandra T.; (US).
WONG, Thomas S.; (US).
CHOI, Daniel; (US)
Priority Data:
14/867,620 28.09.2015 US
15/226,807 02.08.2016 US
Title (EN) EXTRACTING FACTS FROM UNSTRUCTURED INFORMATION
(FR) EXTRACTION DE FAITS D'INFORMATIONS NON STRUCTURÉES
Abstract: front page image
(EN)A computer-implemented technique is described herein for extracting facts from unstructured text documents provided by one or more information sources. The technique uses a pipeline to perform this operation that involves, at least in part, providing a corpus of information items, extracting candidate facts from the information items, merging synonymous argument values associated with the candidate facts, organizing the candidate facts into relation clusters, and assessing the confidence level of the candidate facts within the relation clusters.
(FR)L'invention concerne une technique mise en oeuvre sur ordinateur pour extraire des faits de documents textuels non structurés fournis par une ou plusieurs sources d'informations. La présente technique utilise un pipeline pour exécuter cette opération, celle-ci consistant, au moins en partie, à fournir un corpus d'éléments d'informations, à extraire des faits des éléments d'informations candidats, à fusionner des valeurs d'arguments synonymes associées aux faits candidats, à organiser les faits candidats en groupements de relations, et à évaluer le niveau de confiance des faits candidats au sein des groupements de relations.
Designated States: AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JP, KE, KG, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW.
African Regional Intellectual Property Organization (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW)
Eurasian Patent Organization (AM, AZ, BY, KG, KZ, RU, TJ, TM)
European Patent Office (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR)
African Intellectual Property Organization (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG).
Publication Language: English (EN)
Filing Language: English (EN)