Do you know of any python mapreduce ready clustering libraries?
I have found some good libraries in Java (http://lucene.apache.org/mahout/), I'd prefer to use python though.
https://github.com/klbostee/dumbo/wi开发者_如何转开发ki (Python mapreduce API )
Edit --- I'm looking for mapreduce ready : Canopy, K-means, Means-shift,etc..
You can use Python in combination with Hadoop, if you like:
http://www.michael-noll.com/wiki/Writing_An_Hadoop_MapReduce_Program_In_Python
精彩评论