The importance of complete data sets for job scheduling simulations

Varování

Publikace nespadá pod Pedagogickou fakultu, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.
Autoři

KLUSÁČEK Dalibor RUDOVÁ Hana

Rok publikování 2010
Druh Článek ve sborníku
Konference Job Scheduling Strategies for Parallel Processing, Revised Selected Papers
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
www URL
Doi http://dx.doi.org/10.1007/978-3-642-16505-4_8
Obor Informatika
Klíčová slova Grid; Cluster; Scheduling; MetaCentrum; Workload; Failures; Specific Job Requirements
Popis This paper has been inspired by the study of the complex data set from the Czech National Grid MetaCentrum. Unlike other widely used workloads from Parallel Workloads Archive or Grid Workloads Archive, this data set includes additional information concerning machine failures, job requirements and machine parameters which allows to perform more realistic simulations. We show that large differences in the performance of various scheduling algorithms appear when these additional information are used. Moreover, we studied other publicly available workloads and partially reconstructed information concerning their machine failures and job requirements using statistical and analytical models to demonstrate that similar behavior is also expectable for other workloads. We suggest that additional information about both machines and jobs should be incorporated into the workloads archives to allow proper and more realistic simulations.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.