Processing

Please wait...

Settings

Settings

Goto Application

1. WO2012059355 - STORAGE MANAGEMENT IN CLUSTERED DATA PROCESSING SYSTEMS

Publication Number WO/2012/059355
Publication Date 10.05.2012
International Application No. PCT/EP2011/068565
International Filing Date 24.10.2011
IPC
G06F 11/14 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
07Responding to the occurrence of a fault, e.g. fault tolerance
14Error detection or correction of the data by redundancy in operation, e.g. by using different operation sequences leading to the same result
G06F 11/20 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
07Responding to the occurrence of a fault, e.g. fault tolerance
16Error detection or correction of the data by redundancy in hardware
20using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
CPC
G06F 11/1425
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
07Responding to the occurrence of a fault, e.g. fault tolerance
14Error detection or correction of the data by redundancy in operation
1402Saving, restoring, recovering or retrying
1415at system level
142Reconfiguring to eliminate the error
1425by reconfiguration of node membership
G06F 11/1482
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
07Responding to the occurrence of a fault, e.g. fault tolerance
14Error detection or correction of the data by redundancy in operation
1479Generic software techniques for error detection or fault masking
1482by means of middleware or OS functionality
G06F 11/203
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
07Responding to the occurrence of a fault, e.g. fault tolerance
16Error detection or correction of the data by redundancy in hardware
20using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
202where processing functionality is redundant
2023Failover techniques
203using migration
G06F 11/2046
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
07Responding to the occurrence of a fault, e.g. fault tolerance
16Error detection or correction of the data by redundancy in hardware
20using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
202where processing functionality is redundant
2046where the redundant components share persistent storage
G06F 11/3006
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
30Monitoring
3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
3006where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
G06F 11/3055
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
11Error detection; Error correction; Monitoring
30Monitoring
3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
Applicants
  • INTERNATIONAL BUSINESS MACHINES CORPORATION [US]/[US] (AllExceptUS)
  • IBM UNITED KINGDOM LIMITED [GB]/[GB] (MG)
  • MEWHINNEY, Greg [US]/[US] (UsOnly)
  • PAFUMI, James [US]/[US] (UsOnly)
  • NEVAREZ, David [US]/[US] (UsOnly)
  • ROSALES, Jacob, Jason [US]/[US] (UsOnly)
Inventors
  • MEWHINNEY, Greg
  • PAFUMI, James
  • NEVAREZ, David
  • ROSALES, Jacob, Jason
Agents
  • ROBERTS, Scott
Priority Data
12/940,46805.11.2010US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) STORAGE MANAGEMENT IN CLUSTERED DATA PROCESSING SYSTEMS
(FR) GESTION DE STOCKAGE DANS DES SYSTÈMES DE TRAITEMENT DE DONNÉES EN MODE CLUSTER
Abstract
(EN)
A method, system, and computer program product utilizes cluster-awareness to effectively support a live partition mobility (LPM) event and provide recovery from node failure within a Virtual Input/Output (I/O) Server (VIOS) cluster. An LPM utility creates a monitoring thread on a first VIOS on initiation of a corresponding LPM event. The monitoring thread tracks a status of an LPM and records status information in the mobility table of a database. The LPM utility creates other monitoring threads on other VIOSes running on the (same) source server. If the first VIOS VIOS sustains one of multiple failures, the LPM utility provides notification to other functioning nodes/VIOSes. The LPM utility enables a functioning monitoring thread to update the LPM status. In particular, a last monitoring thread may perform cleanup/update operations within the database based on an indication that there are nodes on the first server that are in failed state.
(FR)
Le procédé, le système et le produit programme d'ordinateur selon l'invention utilisent la capacité de gestion de cluster pour prendre en charge efficacement un événement LPM (technologie Live Partition Mobility) et fournir une reprise après une défaillance de nœud à l'intérieur d'un cluster VIOS (serveur virtuel d'entrée/sortie). Un utilitaire LPM crée une chaîne de contrôle sur un premier VIOS lors de l'initiation d'un événement LPM correspondant. La chaîne de contrôle suit un état d'un LPM et enregistre des informations d'état dans la table de mobilité d'une base de données. L'utilitaire LPM crée d'autres tâches de contrôle sur d'autres VIOS s'exécutant sur le (même) serveur source. Si le premier VIOS subit une parmi de multiples défaillances, l'utilitaire LPM fournit une notification aux autres nœuds/VIOS opérationnels. L'utilitaire LPM permet à une chaîne de contrôle active de mettre à jour l'état LPM. En particulier, une dernière chaîne de contrôle peut exécuter des opérations de nettoyage/mise à jour à l'intérieur de la base de données en fonction d'une indication qu'il existe sur le premier serveur des noeuds qui sont défaillants.
Also published as
GB1306798.8
Latest bibliographic data on file with the International Bureau