Sempere Guilhem, Petel Adrien, Rouard Mathieu, Larmande Pierre, Hueber Yann, Frouin Julien, De Bellis Fabien.
2019. Managing and exploring large genotyping data with Gigwa.
. PAG
|
Version publiée
- Anglais
Utilisation soumise à autorisation de l'auteur ou du Cirad. 600950.pdf Télécharger (191kB) | Prévisualisation |
|
|
Version publiée
- Anglais
Utilisation soumise à autorisation de l'auteur ou du Cirad. paper34105_1.pdf Télécharger (537kB) | Prévisualisation |
Url - éditeur : https://www.intlpag.org/2022/images/pdf/2019/PAGXXVII-abstracts-workshops.pdf
Matériel d'accompagnement : 1 diaporama (12 vues)
Résumé : With the decreasing cost of genome sequencing, many laboratories are increasingly adopting genotyping technologies as a routine component of their analytical workflows, generating large datasets (e.g. VCF files) of genotyping information. Nevertheless, manipulating such large datasets remains a challenge for many scientists. In this context, we developed Gigwa (Genotype Investigator for Genome-Wide Analyses) with the aim of providing a user-friendly system to meet the requirements of scientists who need to filter large datasets and export them into various formats for subsequent analyses. Gigwa is species-agnostic, cross-platform, scalable and easy to deploy. It can be configured to run on a local computer or setup across servers to act as a data portal. It may be used to share data with collaborators while providing means to seek variants of interest based on location, functional annotations or genotype patterns. Based on NoSQL technology, it supports very large datasets (up to tens of millions of genotypes) when configured on suitable hardware. Its most attractive features are: ergonomic interface including user management, numerous import and export formats, powerful filtering engine, interoperability via REST APIs and connection to online or standalone tools. Gigwa now in version 2 (http://gigwa.southgreen.fr), is developed within the scope of South Green bioinformatics, a cross-institute platform and community dedicated to genetics and genomics of tropical and Mediterranean plants, based in Montpellier, France.
Mots-clés libres : NoSQL database, SNP markers, INDELs, VCF, Web tool, Interoperability
Auteurs et affiliations
- Sempere Guilhem, CIRAD-BIOS-UMR INTERTRYP (FRA) ORCID: 0000-0001-7429-2091
- Petel Adrien, IRD (FRA)
- Rouard Mathieu, Bioversity International (FRA)
- Larmande Pierre, IRD (FRA)
- Hueber Yann, Bioversity International (ITA)
- Frouin Julien, CIRAD-BIOS-UMR AGAP (FRA) ORCID: 0000-0003-1591-0755
- De Bellis Fabien, CIRAD-BIOS-UMR AGAP (FRA) ORCID: 0000-0001-7070-7691
Source : Cirad-Agritrop (https://agritrop.cirad.fr/600950/)
[ Page générée et mise en cache le 2024-11-29 ]