Agritrop
Home

DHOEM: a statistical simulation software for simulating new markers in real SNP marker data

Jacquin Laval, Cao Tuong-Vi, Grenier Cécile, Ahmadi Nourollah. 2015. DHOEM: a statistical simulation software for simulating new markers in real SNP marker data. BMC Bioinformatics, 16 (404), 8 p.

Journal article ; Article de recherche ; Article de revue à facteur d'impact Revue en libre accès total
[img]
Preview
Published version - Anglais
License Licence Creative Commons.
2015_DHOEM.pdf

Télécharger (851kB) | Preview

Quartile : Q1, Sujet : MATHEMATICAL & COMPUTATIONAL BIOLOGY / Quartile : Q2, Sujet : BIOTECHNOLOGY & APPLIED MICROBIOLOGY / Quartile : Q3, Sujet : BIOCHEMICAL RESEARCH METHODS

Abstract : Background Numerous simulation tools based on specific assumptions have been proposed to simulate populations. Here we present a simulation tool named DHOEM (densification of haplotypes by loess regression and maximum likelihood) which is free from population assumptions and simulates new markers in real SNP marker data. The main objective of DHOEM is to generate a new population, which incorporates real and simulated SNP by statistical learning from an initial population, which match the realized features of the latter. Results To demonstrate DHOEM's abilities, we used a sample of 704 haplotypes for 12 chromosomes with 8336 SNP from a synthetic population, used for breeding upland rice in Latin America. The distributions of allele frequencies, pairwise SNP LD coefficients and data structures, before and after marker densification of the associated marker data set, were shown to be in relatively good agreement at moderate degrees of marker densification. DHOEM is a user-friendly tool that allows the user to specify the level of marker density desired, with a user defined minor allele frequency (MAF) limit, which is produced in a reasonable computation time. Conclusions DHOEM is a user-friendly and useful tool for simulation and methodological studies in quantitative genetics and breeding. (Résumé d'auteur)

Mots-clés Agrovoc : Modèle de simulation, Modèle mathématique, Génétique des populations, Amélioration des plantes, Étude de cas, Oryza sativa, Allèle, Halophyte, Équilibre génétique, fréquence allélique, génomique, Sélection, Riz pluvial

Mots-clés géographiques Agrovoc : Amérique latine

Mots-clés libres : Data simulation, Data structure, Likelihood, Non-parametric, LD, Haplotypes, SNP, Genomic relationship, Matrix, Minor allele frequency

Classification Agris : U10 - Mathematical and statistical methods
F30 - Plant genetics and breeding

Champ stratégique Cirad : Axe 1 (2014-2018) - Agriculture écologiquement intensive

Auteurs et affiliations

  • Jacquin Laval, CIRAD-BIOS-UMR AGAP (FRA)
  • Cao Tuong-Vi, CIRAD-BIOS-UMR AGAP (FRA)
  • Grenier Cécile, CIRAD-BIOS-UMR AGAP (COL)
  • Ahmadi Nourollah, CIRAD-BIOS-UMR AGAP (FRA) ORCID: 0000-0003-0072-6285

Source : Cirad-Agritrop (https://agritrop.cirad.fr/578824/)

View Item (staff only) View Item (staff only)

[ Page générée et mise en cache le 2020-09-29 ]