Agritrop
Accueil

Deep semi-supervised clustering for multi-variate time-series

Ienco Dino, Interdonato Roberto. 2023. Deep semi-supervised clustering for multi-variate time-series. Neurocomputing, 516 : 36-47.

Article de revue ; Article de recherche ; Article de revue à facteur d'impact
[img]
Prévisualisation
Version publiée - Anglais
Sous licence Licence Creative Commons.
Ienco et Interdonato - 2023 - Deep semi-supervised clustering for multi-variate .pdf

Télécharger (2MB) | Prévisualisation

Liste HCERES des revues (en SHS) : oui

Thème(s) HCERES des revues (en SHS) : Psychologie-éthologie-ergonomie

Résumé : Huge amount of data are nowadays produced by a large and disparate family of sensors, which typically measure multiple variables over time. Such rich information can be profitably organized as multivariate time-series. Collect enough labelled samples to set up supervised analysis for such kind of data is challenging while a reasonable assumption is to dispose of a limited background knowledge that can be injected in the analysis process. In this context, semi-supervised clustering methods represent a well suited tool to get the most out of such reduced amount of knowledge. With the aim to deal with multivariate time-series analysis under a limited background knowledge setting, we propose a semi-supervised (constrained) deep embedding time-series clustering framework that exploits knowledge supervision modeled as Must- and Cannot-link constraints. More in detail, our proposal, named conDetSEC (constrained Deep embedding time SEries Clustering), is based on Gated Recurrent Units (GRUs) with the aim to explicitly manage the temporal dimension associated to multi-variate time series data. conDetSEC implements a procedure in which an embedding generation step is combined with a clustering refinement step. Both steps exploit the small amount of available knowledge provided by Must- and Cannot-link constraints. More specifically, during the data embedding generation the constraints are used by jointly optimizing the network parameters via both unsupervised and semi-supervised tasks, while at the refinement step they are used in conjunction with the goal to stretch the embedding manifold towards the clustering centroids to recover a more clear cluster structure. Experimental evaluation on real-world benchmarks coming from diverse domains has highlighted the effectiveness of our proposal in comparison with state-of-the-art unsupervised and semi-supervised time-series clustering methods.

Mots-clés Agrovoc : analyse de séries chronologiques, analyse de données, méthode statistique, apprentissage machine, algorithme

Mots-clés complémentaires : deep learning

Mots-clés libres : Deep Learning, Clustering, Time series, Time series analysis, Semi-supervised Learning

Classification Agris : U10 - Informatique, mathématiques et statistiques

Champ stratégique Cirad : CTS 7 (2019-) - Hors champs stratégiques

Agences de financement hors UE : Agence Nationale de la Recherche

Projets sur financement : (FRA) Hétérogénéité des données - Hétérogénéité des méthodes : un cadre collaboratif unifié pour l'analyse interactive de données temporelles

Auteurs et affiliations

Source : Cirad-Agritrop (https://agritrop.cirad.fr/608098/)

Voir la notice (accès réservé à Agritrop) Voir la notice (accès réservé à Agritrop)

[ Page générée et mise en cache le 2024-12-11 ]