Agritrop
Accueil

Distributed caching of scientific workflows in multisite cloud

Heidsieck Gaëtan, De Oliveira Daniel, Pacitti Esther, Pradal Christophe, Tardieu François, Valduriez Patrick. 2020. Distributed caching of scientific workflows in multisite cloud. In : Database and expert systems applications. DEXA 2020 (Part II). Hartmann Sven (ed.), Küng Josef (ed.), Kotsis Gabriele (ed.), Tjoa A. Min (ed.), Khalil Ismail (ed.). Cham : Springer, 51-65. (Lecture Notes in Computer Science, 12392, 12392) ISBN 978-3-030-59050-5 International Conference on Database and Expert Systems Application (DEXA 2020). 31, Bratislava, Slovaquie, 14 Septembre 2020/17 Septembre 2020.

Communication avec actes
[img] Version post-print - Anglais
Accès réservé aux personnels Cirad
Utilisation soumise à autorisation de l'auteur ou du Cirad.
DEXA_2020-1.pdf

Télécharger (334kB) | Demander une copie
[img] Version publiée - Anglais
Accès réservé aux personnels Cirad
Utilisation soumise à autorisation de l'auteur ou du Cirad.
ID596649.pdf

Télécharger (1MB) | Demander une copie

Résumé : Many scientific experiments are performed using scientific workflows, which are becoming more and more data-intensive. We consider the efficient execution of such workflows in the cloud, leveraging the heterogeneous resources available at multiple cloud sites (geo-distributed data centers). Since it is common for workflow users to reuse code or data from other workflows, a promising approach for efficient workflow execution is to cache intermediate data in order to avoid re-executing entire workflows. In this paper, we propose a solution for distributed caching of scientific workflows in a multisite cloud. We implemented our solution in the OpenAlea workflow system, together with cache-aware distributed scheduling algorithms. Our experimental evaluation on a three-site cloud with a data-intensive application in plant phenotyping shows that our solution can yield major performance gains, reducing total time up to 42% with 60% of same input data for each new execution.

Mots-clés libres : Adaptive Caching, Scientific workflows, Cloud computing, OpenAlea

Auteurs et affiliations

  • Heidsieck Gaëtan, INRIA (FRA)
  • De Oliveira Daniel, UFF (BRA)
  • Pacitti Esther, Université de Montpellier (FRA)
  • Pradal Christophe, CIRAD-BIOS-UMR AGAP (FRA) ORCID: 0000-0002-2555-761X
  • Tardieu François, INRAE (FRA)
  • Valduriez Patrick, INRIA (FRA)

Autres liens de la publication

Source : Cirad-Agritrop (https://agritrop.cirad.fr/596649/)

Voir la notice (accès réservé à Agritrop) Voir la notice (accès réservé à Agritrop)

[ Page générée et mise en cache le 2024-04-04 ]