machine-learning
How can I weight features for better clustering with a very small data set?
I\'m working on a program that takes in several (<50) high dimension points in feature space (1000+ dimensions) and performing hierarchical clustering on them by recursively using standard k-cluste[详细]
2023-03-20 16:19 分类:问答What classifiers to use for deciding if two datasets depict the same individual?
Suppose I have pictures of faces of a set of individuals. The question I\'m trying to answer is: \"do these two pictures represent the same individual\"?[详细]
2023-03-20 09:27 分类:问答In practice, how many machines do you need in order for Hadoop / MapReduce / Mahout to speed up very parallelizable computations?
I need to do some heavy machine learning computations. I have a small number of machines idle on a LAN. How many machines would I need in order for distrubuting my computations using hadoop / mapreduc[详细]
2023-03-20 06:14 分类:问答machine learning predict classification
I have the following problem. I have a training开发者_如何学C dataset comprising of a range of numbers. Each number belongs to a certain class. There are five classes.[详细]
2023-03-19 19:06 分类:问答weka java api stringtovector exception
so I have this code that uses Weka\'s Java API: String html = \"blaaah\"; Attribute input = new Attribute(\"html\",(FastVector) null);[详细]
2023-03-19 10:25 分类:问答Large scale naïve Bayes classifier with top-k output
I need a library for naïve Bayes large scale, with millions of training examples and +100k b开发者_JAVA百科inary features. It must be an online version (updatable after training). I also need top-k o[详细]
2023-03-19 10:20 分类:问答Creating a string attribute in Weka Java API
I\'m trying to create a new string Attribute using Weka\'s Java API... Reading through the API javadocs, it appears that the way to do so is to use this constructor:[详细]
2023-03-18 23:30 分类:问答DerivativeCheck fails with minFunc
I\'m trying to train a single layer of an autoencoder using minFunc, and while the cost function appears to decrease, when enabled, the DerivativeCheck fails. The code I\'m using isas close to textboo[详细]
2023-03-18 22:03 分类:问答Ways to improve Image Pixel Classification
Here is the problem we are trying to solve: Goal is to classify pixels of a colored image into 3 different classes.[详细]
2023-03-18 13:37 分类:问答What are some good ways of estimating 'approximate' semantic similarity between sentences?
I have been looking at the nlp tag on SO for the past couple of hours and am confident I did not miss anything but if I did, please do point me to the question.[详细]
2023-03-18 12:27 分类:问答