Comparaison entre les indices de validité  pour obtenir le cluster optimal dans le  clustering.

FEDDANE, Esma Nour El Houda; BRAHIMI, Fatima Zohra

Veuillez utiliser cette adresse pour citer ce document : http://dspace1.univ-tlemcen.dz/handle/112/20930

Titre:	Comparaison entre les indices de validité pour obtenir le cluster optimal dans le clustering.
Auteur(s):	FEDDANE, Esma Nour El Houda BRAHIMI, Fatima Zohra
Mots-clés:	clustering, validity indexes, K-means clustering, indices de validité, K-means.
Date de publication:	4-jui-2023
Editeur:	19-11-2023
Référence bibliographique:	salle des thèses
Collection/Numéro:	son.for.p.;
Résumé:	Clustering is a model that explores the similarities and intrinsic structures of data to group objects into clusters, without using pre-existing labels. There are many clustering methods, and one of the most commonly used is the k-means method. In this thesis, we propose a comparison between several validity indices by combining them with the k-means algorithm to determine the optimal number of clusters. The method involves varying the number of clusters within a predefined range [kmin, kmax] and extracting the best index values representing the final number of clusters. The compared indices are Calinski and Harabas index, Hartiga, Ball and Hall index, Dunn index, Silhouette index, Davies Bouldin index, Xie Beni index, WSJI, WB, and Bayesian Information Criterion. The experimentation and comparison of validity indices were performed on synthetic datasets. The results confirm the effectiveness of certain indices such as Calinski and Harabas indices, Silhouette index, Davies Bouldin index, and Sum of Squares (WB) among different datasets.
URI/URL:	http://dspace1.univ-tlemcen.dz/handle/112/20930
Collection(s) :	Master MID

Fichier(s) constituant ce document :

Fichier	Description	Taille	Format
Comparaison-entre-les-indices-de-validite-pour-obtenir-le-cluster-optimal-dans-le-clustering..pdf	CD	2,77 MB	Adobe PDF	Voir/Ouvrir

Affichage détaillé