TY - JOUR
T1 - Reliable cluster computing with a new checkpointing RAID-x architecture
AU - Hwang, Kai
AU - Jin, Hai
AU - Ho, Roy
AU - Ro, Wonwoo
PY - 2000
Y1 - 2000
N2 - In a serverless cluster of PCs or workstations, the cluster must allow remote file accesses or parallel I/O directly performed over disks distributed to all client nodes. We introduce a new distributed disk array, called the RAID-x, for use in serverless clusters. The RAID-x architecture is based on an orthogonal striping and mirroring (OSM) scheme, which exploits full-bandwidth and protects the system from all single disk failures. The performance of the RAID-x is experimentally proven superior to RAID-1 and NFS in the Linux cluster environment. We propose a new striped checkpointing scheme, leveraging on striped parallelism and pipelined writing of successive disk stripes. This RAID-x architecture greatly enhances the throughput, reliability, and availability of scalable clusters. It appeals especially to I/O-centric cluster applications.
AB - In a serverless cluster of PCs or workstations, the cluster must allow remote file accesses or parallel I/O directly performed over disks distributed to all client nodes. We introduce a new distributed disk array, called the RAID-x, for use in serverless clusters. The RAID-x architecture is based on an orthogonal striping and mirroring (OSM) scheme, which exploits full-bandwidth and protects the system from all single disk failures. The performance of the RAID-x is experimentally proven superior to RAID-1 and NFS in the Linux cluster environment. We propose a new striped checkpointing scheme, leveraging on striped parallelism and pipelined writing of successive disk stripes. This RAID-x architecture greatly enhances the throughput, reliability, and availability of scalable clusters. It appeals especially to I/O-centric cluster applications.
UR - http://www.scopus.com/inward/record.url?scp=0033902746&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0033902746&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:0033902746
SP - 171
EP - 184
JO - Proceedings of the Heterogeneous Computing Workshop, HCW
JF - Proceedings of the Heterogeneous Computing Workshop, HCW
ER -