Processing

Please wait...

Settings

Settings

Goto Application

1. WO2022003462 - QUICK DATA EXPLORATION

Publication Number WO/2022/003462
Publication Date 06.01.2022
International Application No. PCT/IB2021/055202
International Filing Date 14.06.2021
IPC
G06F 16/34 2019.1
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
34Browsing; Visualisation therefor
G06N 20/00 2019.1
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
20Machine learning
CPC
G06F 16/212
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
20of structured data, e.g. relational data
21Design, administration or maintenance of databases
211Schema design and management
212with details for data modelling support
G06F 16/215
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
20of structured data, e.g. relational data
21Design, administration or maintenance of databases
215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
G06F 16/2462
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
20of structured data, e.g. relational data
24Querying
245Query processing
2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
2462Approximate or statistical queries
G06F 16/254
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
20of structured data, e.g. relational data
25Integrating or interfacing systems involving database management systems
254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
G06K 9/6253
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
6253User interactive design
G06K 9/6256
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
6256Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
Applicants
  • INTERNATIONAL BUSINESS MACHINES CORPORATION [US]/[US]
  • IBM (CHINA) INVESTMENT COMPANY LTD. [CN]/[CN] (MG)
  • IBM DEUTSCHLAND GMBH [DE]/[DE] (MG)
Inventors
  • KANIA, Tomasz
  • GEDLICZKA, Tymoteusz
  • BRANDYS, Szymon
  • PITULA, Krzysztof
  • MADEJ, Maciej
  • GRZYWNA, Piotr
Agents
  • VETTER, Svenja
Priority Data
16/918,27601.07.2020US
Publication Language English (en)
Filing Language English (EN)
Designated States
Title
(EN) QUICK DATA EXPLORATION
(FR) EXPLORATION RAPIDE DE DONNÉES
Abstract
(EN) A computer-implemented method for quick data exploration of data to be uploaded may be provided. The method (100) comprises uploading from a local system a first data set (102), determining that the first data set is not corrupted (104). The method (100) also comprises in parallel to the uploading performing selecting from the first data set a predefined number of records and building a second data set (106), determining statistical data and metadata about the first data set (108), and visualizing the second data set, the statistical data and the metadata (110).
(FR) L'invention concerne un procédé mis en œuvre par ordinateur pour une exploration rapide de données sur des données à téléverser. Le procédé (100) comprend le téléversement, à partir d'un système local, d'un premier ensemble de données (102), et la détermination du fait que le premier ensemble de données n'est pas corrompu (104). Le procédé (100) comprend également, en parallèle avec le téléversement, la réalisation d'une sélection, à partir du premier ensemble de données, d'un nombre prédéfini d'enregistrements et la construction d'un second ensemble de données (106), la détermination de données statistiques et de métadonnées concernant le premier ensemble de données (108), et la visualisation du second ensemble de données, des données statistiques et des métadonnées (110).
Related patent documents
Latest bibliographic data on file with the International Bureau