TY - JOUR
T1 - An efficient algorithm-based fault tolerance design using the weighted data-check relationship
AU - Youn, Hee Yong
AU - Oh, Choong Gun
AU - Choo, Hyunseung
AU - Chung, Jin Wook
AU - Lee, Dongman
PY - 2001/4
Y1 - 2001/4
N2 - VLSI-based processor arrays have been widely used for computation intensive applications such as matrix and graph algorithms. Algorithm-based fault tolerance designs employing various encoding/decoding schemes have been proposed for such systems to effectively tolerate operation time fault. In this paper, we propose an efficient algorithm-based fault tolerance design using the weighted data-check relationship, where the checks are obtained from the weighted data. The relationship is systematically defined as a new (n,k,Nw) Hamming checksum code, where n is the size of the code word, k is the number of information elements in the code word, and Nw is the number of weights employed, respectively. The proposed design with various weights is evaluated in terms of time and hardware overhead as well as overflow probability and round-off error. Two different schemes employing the (n,k,2) and (n,k,3) Hamming checksum code are illustrated using important matrix computations. Comparison with other schemes reveals that the (n,k,3) Hamming checksum scheme is very efficient, while the hardware overhead is small.
AB - VLSI-based processor arrays have been widely used for computation intensive applications such as matrix and graph algorithms. Algorithm-based fault tolerance designs employing various encoding/decoding schemes have been proposed for such systems to effectively tolerate operation time fault. In this paper, we propose an efficient algorithm-based fault tolerance design using the weighted data-check relationship, where the checks are obtained from the weighted data. The relationship is systematically defined as a new (n,k,Nw) Hamming checksum code, where n is the size of the code word, k is the number of information elements in the code word, and Nw is the number of weights employed, respectively. The proposed design with various weights is evaluated in terms of time and hardware overhead as well as overflow probability and round-off error. Two different schemes employing the (n,k,2) and (n,k,3) Hamming checksum code are illustrated using important matrix computations. Comparison with other schemes reveals that the (n,k,3) Hamming checksum scheme is very efficient, while the hardware overhead is small.
KW - Algorithm-based fault tolerance
KW - Hamming correcting code
KW - Matrix computations
KW - Overflow
KW - Round-off error
KW - VLSI processor array
UR - https://www.scopus.com/pages/publications/0035308631
U2 - 10.1109/12.919281
DO - 10.1109/12.919281
M3 - Article
AN - SCOPUS:0035308631
SN - 0018-9340
VL - 50
SP - 371
EP - 383
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 4
ER -