| classifdist {fpc} | R Documentation |
Various methods for classification of unclustered points from
clustered points for use within functions nselectboot
and prediction.strength.
classifdist(cdist,clustering,
method="averagedist",
centroids=NULL,nnk=1)
classifnp(data,clustering,
method="centroid",cdist=NULL,
centroids=NULL,nnk=1)
cdist |
dissimilarity matrix or |
data |
something that can be coerced into a an
|
clustering |
integer vector. Gives the cluster number (between 1 and k for k clusters) for clustered points and should be -1 for points to be classified. |
method |
one of |
centroids |
for |
nnk |
number of nearest neighbours if |
classifdist is for data given as dissimilarity matrix,
classifnp is for data given as n times p data matrix.
The following methods are supported:
assigns observations to the cluster with closest
cluster centroid as specified in argument centroids (this
is associated to k-means and pam/clara-clustering).
assigns to the cluster to which an observation has the minimum average dissimilarity to all points in the cluster (this is associated to average linkage clustering).
only in classifnp. Classifies by quadratic
discriminant analysis (this is associated to Gaussian clusters
with flexible covariance matrices), calling
qda with default settings. If
qda gives an error (usually because a class
was too small), lda is used.
classifies by nnk nearest neighbours (for
nnk=1, this is associated with single linkage clustering).
Calls knn in classifnp.
An integer vector giving cluster numbers for all observations; those for the observations already clustered in the input are the same as in the input.
Christian Hennig c.hennig@ucl.ac.uk http://www.homepages.ucl.ac.uk/~ucakche/
prediction.strength, nselectboot
set.seed(20000) x1 <- rnorm(50) y <- rnorm(100) x2 <- rnorm(40,mean=20) x3 <- rnorm(10,mean=25,sd=100) x <- cbind(c(x1,x2,x3),y) truec <- c(rep(1,50),rep(2,40),rep(3,10)) topredict <- c(1,2,51,52,91) clumin <- truec clumin[topredict] <- -1 classifnp(x,clumin, method="averagedist") classifnp(x,clumin, method="qda") classifdist(dist(x),clumin, centroids=c(3,53,93),method="centroid") classifdist(dist(x),clumin,method="knn")