It is shown that a parallel checkpointing system can benefit from supports embedded in low-level communication systems in its implementation and to improve its performance.
该系统通过底层通信系统的支持降低了并行检查点的实现复杂度和执行开销,适用于大规模机群应用。
Checkpoint is an important means to implement fault-tolerance in parallel system. Synchronous checkpointing method has been widely used in network of workstation system.
检查点是并行系统中实现容错的重要手段,同步检查点方法已广泛应用在工作站机群系统中。
Checkpoint is an important means to implement fault-tolerance in parallel system. Synchronous checkpointing method has been widely used in network of workstation system.
检查点是并行系统中实现容错的重要手段,同步检查点方法已广泛应用在工作站机群系统中。
应用推荐