Really Big ThingsAug 28, 2008, 09:01 (0 Talkback[s])
(Other stories by Douglas Eadline)
[ Thanks to Bryan Richard for this link. ]
"Long running jobs often write checkpoint files so they can re-start from the check point and thus not have to re-do the entire program run. This situation is what I would call manageable pain and is what makes clusters so attractive."