Heidsieck Gaëtan, De Oliveira Daniel, Pacitti Esther, Pradal Christophe, Tardieu François, Valduriez Patrick.
2020. Distributed caching of scientific workflows in multisite cloud.
In : Database and expert systems applications. DEXA 2020 (Part II). Hartmann Sven (ed.), Küng Josef (ed.), Kotsis Gabriele (ed.), Tjoa A. Min (ed.), Khalil Ismail (ed.)
Version post-print
- Anglais
Accès réservé aux personnels Cirad Utilisation soumise à autorisation de l'auteur ou du Cirad. DEXA_2020-1.pdf Télécharger (334kB) | Demander une copie |
|
Version publiée
- Anglais
Accès réservé aux personnels Cirad Utilisation soumise à autorisation de l'auteur ou du Cirad. ID596649.pdf Télécharger (1MB) | Demander une copie |
Résumé : Many scientific experiments are performed using scientific workflows, which are becoming more and more data-intensive. We consider the efficient execution of such workflows in the cloud, leveraging the heterogeneous resources available at multiple cloud sites (geo-distributed data centers). Since it is common for workflow users to reuse code or data from other workflows, a promising approach for efficient workflow execution is to cache intermediate data in order to avoid re-executing entire workflows. In this paper, we propose a solution for distributed caching of scientific workflows in a multisite cloud. We implemented our solution in the OpenAlea workflow system, together with cache-aware distributed scheduling algorithms. Our experimental evaluation on a three-site cloud with a data-intensive application in plant phenotyping shows that our solution can yield major performance gains, reducing total time up to 42% with 60% of same input data for each new execution.
Mots-clés libres : Adaptive Caching, Scientific workflows, Cloud computing, OpenAlea
Auteurs et affiliations
- Heidsieck Gaëtan, INRIA (FRA)
- De Oliveira Daniel, UFF (BRA)
- Pacitti Esther, Université de Montpellier (FRA)
- Pradal Christophe, CIRAD-BIOS-UMR AGAP (FRA) ORCID: 0000-0002-2555-761X
- Tardieu François, INRAE (FRA)
- Valduriez Patrick, INRIA (FRA)
Autres liens de la publication
Source : Cirad-Agritrop (https://agritrop.cirad.fr/596649/)
[ Page générée et mise en cache le 2024-04-04 ]