Predicted values based on the G-means clustering model.
Source
Adapted from clue
Arguments
- object
of class inheriting from
"kmeans"
.- newdata
matrix()
new data to predict on.- method
character(1)
distance metric to use. Either"euclidean"
,"manhattan"
, or"minkowski"
. Default is"euclidean"
.- p
numeric(1)
power of the Minkowski distance. Default is2
.- ...
additional arguments.
Details
The predict
method for G-means clustering assigns new data points to the nearest
cluster center identified by the G-means algorithm. The method uses the specified
distance metric to calculate the distance between each new data point and all
cluster centers, and then assigns each point to the cluster with the closest center.
The method
argument specifies the distance metric to use. The following options:
"euclidean"
: The euclidean distance is the default metric used in the k-means and is defined as $$ d(x, y) = \sqrt{\sum_{i=1}^{n} (x_i - y_i)^2} $$"manhatten"
: The Manhatten distance is defined as $$ d(x, y) = \sum_{i=1}^{n} |x_i - y_i| $$"minkowski"
: The Minkowski distance is defined as $$ d(x, y) = \left( \sum_{i=1}^{n} |x_i - y_i|^p \right)^{1/p}, $$ where \(p\) is a parameter that defines the distance type (e.g., \(p=2\) for Euclidean, \(p=1\) for Manhattan).