Sciweavers

90 search results - page 2 / 18
» System log pre-processing to improve failure prediction
Sort
View
167
Voted
SRDS
2006
IEEE
16 years 13 days ago
Call Availability Prediction in a Telecommunication System: A Data Driven Empirical Approach
Availability prediction in a telecommunication system plays a crucial role in its management, either by alerting the operator to potential failures or by proactively initiating pr...
Günther A. Hoffmann, Miroslaw Malek
IPPS
2006
IEEE
16 years 14 days ago
Evaluating cooperative checkpointing for supercomputing systems
Cooperative checkpointing, in which the system dynamically skips checkpoints requested by applications at runtime, can exploit system-level information to improve performance and ...
Adam J. Oliner, Ramendra K. Sahoo
ICPP
2007
IEEE
16 years 22 days ago
Fault-Driven Re-Scheduling For Improving System-level Fault Resilience
The productivity of HPC system is determined not only by their performance, but also by their reliability. The conventional method to limit the impact of failures is checkpointing...
Yawei Li, Prashasta Gujrati, Zhiling Lan, Xian-He ...
IPPS
2006
IEEE
16 years 14 days ago
Predicting failures of computer systems: a case study for a telecommunication system
The goal of online failure prediction is to forecast imminent failures while the system is running. This paper compares Similar Events Prediction (SEP) with two other well-known t...
Felix Salfner, M. Schieschke, Miroslaw Malek
156
Voted
IPPS
2005
IEEE
16 years 1 days ago
Proactive Fault Handling for System Availability Enhancement
Proactive fault handling combines prevention and repair actions with failure prediction techniques. We extend the standard availability formula by five key measures: (1) precisio...
Felix Salfner, Miroslaw Malek