In the k-means clustering method, clusters are created around ________.

Prepare for the Data Mining Test with our comprehensive quizzes. Practice with various question types, each with hints and explanations. Boost your understanding and ensure success on your exam!

Multiple Choice

In the k-means clustering method, clusters are created around ________.

Explanation:
In k-means, each cluster is organized around its centroid, which is the mean of all points assigned to that cluster. The algorithm repeatedly assigns each point to the nearest centroid and then recalculates each centroid as the average of the points in that cluster. This choice of the mean is deliberate because the mean minimizes the sum of squared distances from the points to the center, aligning exactly with what k-means optimizes. So the clusters form around the mean. The other options aren’t used as the central point in standard k-means: the median would correspond to a different objective (minimizing sum of absolute deviations, as in k-medians), the mode isn’t a robust center for continuous data, and the maximum isn’t a meaningful center for clustering.

In k-means, each cluster is organized around its centroid, which is the mean of all points assigned to that cluster. The algorithm repeatedly assigns each point to the nearest centroid and then recalculates each centroid as the average of the points in that cluster. This choice of the mean is deliberate because the mean minimizes the sum of squared distances from the points to the center, aligning exactly with what k-means optimizes. So the clusters form around the mean.

The other options aren’t used as the central point in standard k-means: the median would correspond to a different objective (minimizing sum of absolute deviations, as in k-medians), the mode isn’t a robust center for continuous data, and the maximum isn’t a meaningful center for clustering.

Subscribe

Get the latest from Passetra

You can unsubscribe at any time. Read our privacy policy