Some content of this application is unavailable at the moment.
If this situation persist, please contact us atFeedback&Contact
1. (WO2019025960) REPLICATING AND MIGRATING FILES TO SECONDARY STORAGE SITES
Note: Text based on automatic Optical Character Recognition processes. Please use the PDF version for legal matters

CLAIMS

1. A computer program product for managing files in a fileset at a primary storage site and secondary storage sites over a network, wherein the computer program product comprises a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause operations, the operations comprising:

migrating files in a fileset at the primary storage site to a file container, wherein data for the files migrated to the file container is distributed among the secondary storage sites;

updating file metadata for each of the migrated files to include access information on the migrated file in the file container;

deleting data for the migrated files in a local storage; and

replicating the file metadata for the migrated files to the secondary storage sites over the network to store in filesets at the secondary storage sites without replicating file data for the migrated files to store in the filesets the secondary storage sites.

2. The computer program product of claim 1 , wherein the file container is distributed across the primary and the secondary storage sites, and wherein the distributing the data for the files migrated to the file container causes each migrated file of the migrated files to be encoded into fragments and distributed among the secondary storage sites.

3. The computer program product of claim 2, wherein the operations further comprise:

migrating the file metadata for the files migrated to the file container to a metadata container distributed across the primary and the secondary storage sites; and

copying a full copy of the file metadata to the metadata container at the secondary storage sites in response to the migrating the file metadata to the metadata container.

4. The computer program product of claim 2, wherein a migrated file of the migrated files is recalled by retrieving the fragments in the file container distributed at the secondary storage sites and then decoding the retrieved fragments to reconstruct the migrated file.

5. The computer program product of claim 1, wherein the operations further comprise:

determining whether files in the fileset comprise file that have not been accessed within a threshold time, wherein the migrated files comprise a first group of files that have not been accessed within the threshold time; and pre-migrating a second group of files in the fileset that have been accessed within the threshold time to the file container while leaving a copy of the files in the second group in the fileset.

6. The computer program product of claim 1, wherein the operations further comprise:

determining whether a condition is satisfied with respect to the files being migrated; and

replicating at least a portion of any of the files satisfying the condition to the secondary storage sites over the network.

7. The computer program product of claim 1 , wherein the operations further comprise:

receiving an update to one of the files in the fileset comprising an updated file;

updating the file metadata for the updated file to include information on the updated file; and replicating the file metadata for the updated file to the secondary storage sites over the network.

8. The computer program product of claim 7, wherein the operations further comprise:

replicating updated data of the updated file to the secondary storage sites over the network, wherein the updated data comprises changes to the updated file that is less than full content of the updated file.

9. The computer program product of claim 8, wherein the replicating the updated data to the secondary storage sites causes a secondary storage site of the secondary storage sites to perform:

updating a previous version of the updated file at the secondary storage site with the updated data when the secondary storage site includes a copy of the updated file;

recalling from the file container the updated file when the secondary storage site does not include a copy of the updated file; and

updating the recalled updated file with the updated data.

10. The computer program product of claim 1 , wherein the operations further comprise:

determining whether a new file in the fileset whose file metadata was replicated to the secondary storage sites has not been migrated to the file container within a replication time threshold based; and

replicating the new file to the secondary storage sites over the network in response to determining that the new file has not been migrated to the file container within the replication time threshold.

11. The computer program product of claim 1, wherein the operations further comprise:

receiving a delete request to delete a file in the fileset;

deleting the file and the file metadata for the file to delete in the fileset;

sending a delete request to delete the file from the file container, which causes issuance of commands to delete data of the file distributed at the secondary storage sites; and

sending a delete request to delete the file metadata from a metadata container, which causes issuance of commands to delete the file metadata at the secondary storage sites.

12. The computer program product of claim 1 , wherein the fileset at the primary storage site comprises a read and write fileset, and wherein the secondary storage sites have read-only filesets, and wherein the secondary storage sites comprise disaster recovery sites to use for fileset operations if the primary storage site is unavailable on the network.

13. A system for managing files in a fileset at a primary storage site in communication with a plurality of secondary storage sites over a network, comprising:

a local storage having filesets;

an object storage having a file container

a server in communication with the local storage and the object storage to perform operations comprising: migrating files in a fileset of the filesets to the file container, wherein data for the files migrated to the file container is distributed among the secondary storage sites;

updating file metadata for each of the migrated files to include access information on the migrated file in the file container;

deleting data for the migrated files in a local storage; and

replicating the file metadata for the migrated files to the secondary storage sites over the network to store in filesets at the secondary storage sites without replicating file data for the migrated files to store in the filesets the secondary storage sites.

14. The system of claim 13, wherein the operations further comprise:

migrating the file metadata for the files migrated to the file container to a metadata container at the object storage distributed across the primary and the secondary storage sites; and

copying a full copy of the file metadata to the metadata container at the secondary storage sites in response to the migrating the file metadata to the metadata container.

15. The system of claim 13, wherein the operations further comprise:

determining whether files in the fileset comprise file that have not been accessed within a threshold time, wherein the migrated files comprise a first group of files that have not been accessed within the threshold time; and pre-migrating a second group of files in the fileset that have been accessed within the threshold time, to the file container while leaving a copy of the files in the second group in the fileset.

16. The system of claim 13, wherein the operations further comprise:

receiving an update to one of the files in the fileset comprising an updated file;

updating the file metadata for the updated file to include information on the updated file; and

replicating the file metadata for the updated file to the secondary storage sites over the network.

17. The system of claim 16, wherein the operations further comprise:

replicating updated data of the updated file to the secondary storage sites over the network, wherein the updated data comprises changes to the updated file that is less than full content of the updated file.

18. A system for managing files, comprising:

a primary storage site;

a plurality of secondary storage sites that communicate with the primary storage sites over a network; wherein the primary storage site includes:

a local storage having filesets;

an object storage having a file container

a server in communication with the local storage and the object storage to perform operations comprising: migrating files in a fileset of the filesets to the file container, wherein data for the files migrated to the file container is distributed among the secondary storage sites;

updating file metadata for each of the migrated files to include access information on the migrated file in the file container;

deleting data for the migrated files in a local storage; and

replicating the file metadata for the migrated files to the secondary storage sites over the network to store in filesets at the secondary storage sites without replicating file data for the migrated files to store in the filesets the secondary storage sites.

19. The system of claim 18, further comprising:

migrating the file metadata for the files migrated to the file container to a metadata container distributed across the primary and the secondary storage sites; and

copying a full copy of the file metadata to the metadata container at the secondary storage sites in response to the migrating the file metadata to the metadata container.

20. The system of claim 18, wherein the operations further comprise:

determining whether files in the fileset have not been accessed within a threshold time, wherein the migrated files comprise a first group of files that have not been accessed within the threshold time; and

pre-migrating a second group of files in the fileset that have been accessed within the threshold time to the file container while leaving a copy of the files in the second group in the fileset.

21. A method for managing files in a fileset at a primary storage site and secondary storage sites over a network, comprising:

migrating files in a fileset at the primary storage site to a file container, wherein data for the files migrated to the file container is distributed among the secondary storage sites;

updating file metadata for each of the migrated files to include access information on the migrated file in the file container;

deleting data for the migrated files in a local storage; and

replicating the file metadata for the migrated files to the secondary storage sites over the network to store in filesets at the secondary storage sites without replicating file data for the migrated files to store in the filesets the secondary storage sites.

22. The method of claim 21 , further comprising:

migrating the file metadata for the files migrated to the file container to a metadata container distributed across the primary and the secondary storage sites; and

copying a full copy of the file metadata to the metadata container at the secondary storage sites in response to the migrating the file metadata to the metadata container.

23. The method of claim 21 , further comprising:

determining whether files in the fileset comprise file that have not been accessed within a threshold time, wherein the migrated files comprise a first group of files that have not been accessed within the threshold time; and pre-migrating a second group of files in the fileset that have been accessed within the threshold time to the file container while leaving a copy of the files in the second group in the fileset.

24. The method of claim 21 , further comprising:

receiving an update to one of the files in the fileset comprising an updated file;

updating the file metadata for the updated file to include information on the updated file; and

replicating the file metadata for the updated file to the secondary storage sites over the network.

25. The method of claim 24, further comprising:

replicating updated data of the updated file to the secondary storage sites over the network, wherein the updated data comprises changes to the updated file that is less than full content of the updated file.