k-means
Interpreting output from mahout clusterdumper
I ran a clustering test on crawled pages (more than 25K docs ; personal data set). I\'ve done a clusterdump :[详细]
2023-02-28 21:12 分类:问答Is it possible to optimize this Matlab code for doing vector quantization with centroids from k-means?
I\'ve created a codebook using k-means of size 4000x300 (4000 centroids, each with 300 features). Using the codebook, I then want to label an input vector (for purposes of binning later on). The input[详细]
2023-02-27 04:52 分类:问答R: Clustering results are different everytime I run
l开发者_运维知识库ibrary(amap) set.seed(5) Kmeans(mydata, 5, iter.max=500, nstart=1, method=\"euclidean\")[详细]
2023-02-26 01:28 分类:问答R - 'princomp' can only be used with more units than variables
I am using R software (R comm开发者_如何学编程ander) to cluster my data. I have a smaller subset of my data containing 200 rows and about 800 columns. I am getting the following error when trying kmea[详细]
2023-02-25 19:12 分类:问答in R, what is difference between kmeans() and kmeansCBI()?
I tried both kmeans() and kmeansCBI() from the fps package on my dataset. But, they give different SSE value, so I don\'t know which one is correct value.[详细]
2023-02-25 16:32 分类:问答Deploying Mahout on hadoop cluster
I want to run Mahout\'s K-Means example in a hadoop cluster of 5 machines. Which Mahout jar files should I need to keep in all the nodes, in order for the K-Means to be exec开发者_如何学运维uted in a[详细]
2023-02-25 14:16 分类:问答Is it possible to specify your own distance function using scikit-learn K-Means Clustering?
开发者_如何学运维Is it possible to specify your own distance function using scikit-learn K-Means Clustering?Here\'s a small kmeans that uses any of the 20-odd distances in[详细]
2023-02-21 13:19 分类:问答OpenCV KMeans (K-Means) python number of output clusters issue
I\'m using OpenCV\'s python interface to do K-Means clustering of multidimensional data (usually dimension of 7). I\'m getting strange[详细]
2023-02-21 04:18 分类:问答Hierarchical k-Means in OpenCV without knowledge of "k"
I\'m trying to cluster a set of 4D vectors, without knowing how many clusters there should be in advance. In the past, I\'ve been able to use cvKmeans2 to cluster, given knowledge of the number of clu[详细]
2023-02-20 12:45 分类:问答Most mutually distant k elements (clustering?)
I have a simple machine learning question: I have n (~110) elements, and a matrix of all the pairwise distances. I would like to choose the 10 elements that are most far apart. That is, I want to[详细]
2023-02-19 06:34 分类:问答