3-1 Introduction (������)

[chinese][english]

The objective of data clustering is to identify clusters within the given dataset, such that similar data instances are likely to be within the same cluster. The original dataset is thus decomposed into disjoint (or fuzzy) clusters, with each cluster having a center to represent the cluster. We can use the cluster ceters (also known as centroids or prototypes) to represent the original dataset to acheve the following goals:

¸ê®Æ¤À¸s¡]data clustering¡^©Î¬O¤À¸sºtºâªk¡]clustering algorithms¡^¬O¤@ºØ±N¸ê®Æ¤ÀÃþ¦¨¸sªº¤èªk¡A¨ä¥D­nªº¥Øªº¤D¦b©ó§ä¥X¸ê®Æ¤¤¸û¬Û¦üªº´X­Ó¸s»E¡]clusters¡^¡A¨Ã§ä¥X¦U­Ó¸s»Eªº¥NªíÂI¡AºÙ¬°¤¤¤ßÂI¡]centroids¡^©Î¬O­ì«¬¡]prototypes¡^¡C¨Ï¥Î³o¨Ç¤¤¤ßÂI¨Ó¥Nªí­ì¥ý¤j¶qªº¸ê®ÆÂI¡A´N¥i¥H¹F¨ì¨â­Ó°ò¥»¥Ø¼Ð¡G


Data Clustering and Pattern Recognition (¸ê®Æ¤À¸s»P¼Ë¦¡¿ë»{)