The Effect of Timeout Prediction and Selection on Wide Area
Collective Operations
James S. Plank,
Rich Wolski,
and
Matthew Allen.
The IEEE International Symposium on Network Computing and Applications,
Cambridge, MA, USA,
February 11-13, 2002.
Available via anonymous ftp to cs.utk.edu in
pub/plank/papers/NCA-01.pdf
and
pub/plank/papers/NCA-01.ps.Z.
Abstract
Failure identification is a fundamental operation concerning
exceptional conditions that
network programs must be able to perform. In this paper, we explore
the use of timeouts to perform failure
identification at the application level. We evaluate the use of
static timeouts, and of dynamic timeouts based on forecasts using
the Network Weather Service. For this evaluation, we perform
experiments on a wide-area collection of 31 machines distributed in
eight institions. Though the conclusions are limited to the collection
of machines used, we observe that a single static timeout is not
reasonable, even for a collection of similar machines over time.
Dynamic timeouts perform roughly as well as the best static
timeouts, and more importantly,
they provide a single methodology for timeout determination
that should be effective for wide-area applications.
Citation Information
- Plain Text:
author J. S. Plank and R. Wolski and M. Allen
title The Effect of Timeout Prediction and Selection on Wide
Area Collective Operations
booktitle IEEE International Symposium on Network Computing
and Applications (NCA-2001)
address Cambridge, MA
month October
year 2001
- Bibtex:
@INPROCEEDINGS{pwa:01:etp,
author = "J. S. Plank and R. Wolski and M. Allen",
title = "The Effect of Timeout Prediction and Selection on Wide
Area Collective Operations",
booktitle = "IEEE International Symposium on Network Computing
and Applications (NCA-2001)",
address = "Cambridge, MA",
month = "October",
year = "2001"
}