Saab Nassif, Huchard Marianne, Martin Pierre.
2022. Evaluating formal concept analysis software for anomaly detection and correction.
In : Proceedings of the Sixteenth International Conference on Concept Lattices and Their Applications (CLA 2022). Cordero Pablo (ed.), Krídlo Ondrej (ed.)
|
Version publiée
- Anglais
Sous licence . ID603626.pdf Télécharger (196kB) | Prévisualisation |
Résumé : Data cleaning is a process that precedes data mining. Particularly, in our dataset on pesticidal plant use, several types of anomalies were identified, ranging from incorrect values to a lack of data susceptible of causing users to draw wrong conclusions during its exploration. Literature presents three methods based on Formal Concept Analysis (FCA), i.e. implication rules computation, association rules computation, and attribute exploration, that may allow the detection and correction of anomalies. This paper evaluates 30 FCA-based software and their apposite features to the development of an anomaly detection and correction method applicable to our dataset. Results show that only ConExp and its reimplementations provide all three methods. Since the data model on plant use is relational but ConExp only allows formal contexts as input, this paper concludes on the importance of integrating Relational Concept Analysis (RCA) with ConExp in future work.
Mots-clés libres : Formal concept analysis, Software evaluation, Anomaly detection, Anomaly correction, Data cleaning
Agences de financement hors UE : Montpellier Université d'Excellence
Auteurs et affiliations
- Saab Nassif, Université de Montpellier (FRA)
- Huchard Marianne, Université de Montpellier (FRA)
- Martin Pierre, CIRAD-PERSYST-UPR AIDA (FRA) ORCID: 0000-0002-4874-5795
Source : Cirad-Agritrop (https://agritrop.cirad.fr/603626/)
[ Page générée et mise en cache le 2023-09-19 ]