Search International and National Patent Collections

1. (WO2017153283) METHOD FOR IMPORTING FILES WHICH ARE IN BINARY FORMAT INTO A DISTRIBUTED FILE SYSTEM, AND ASSOCIATED DISTRIBUTED FILE SYSTEM

Pub. No.:    WO/2017/153283    International Application No.:    PCT/EP2017/055042
Publication Date: Fri Sep 15 01:59:59 CEST 2017 International Filing Date: Sat Mar 04 00:59:59 CET 2017
IPC: G06F 17/30
Applicants: SIEMENS AKTIENGESELLSCHAFT
Inventors: BRONNER, Johanna
LALIC, Marko
HAPFELMEIER, Andreas
Title: METHOD FOR IMPORTING FILES WHICH ARE IN BINARY FORMAT INTO A DISTRIBUTED FILE SYSTEM, AND ASSOCIATED DISTRIBUTED FILE SYSTEM
Abstract:
The invention relates to a method for importing files (F1, F2, F3) which are in binary format (BF) into a distributed file system (HDFS), characterized by the following steps: a) scanning a first origin file (F1) and at least one further second origin file (F2; F3) from said files and b) continuously converting the scanned parts of the first origin file and of the at least one further second origin file into a uniformly specifiable data format that is readable for the distributed file system (HDFS), c) introducing the converted parts of the origin files, at least partly in parallel, into a temporary storage area in the distributed file system (HDFS), also called staging area (SG), independently of each other, wherein the converted parts of the first origin file (F1) are temporarily stored in at least one first temporary storage file (C11,..., C1n) and the converted parts of the second origin file (F2; F3) are temporarily stored in at least one further second temporary storage file (C21,...,C2n; C31,...,C3n), and d) providing one or more intermediate files (C11,..., C1n; C21,...,C2n; C31,...,C3n) of the origin files (F1; F2; F3) for the further processing thereof after a specifiable threshold for the maximum degree of filling of the temporary storage file in question has been exceeded.