Hadoop
Extreme amount of overhead in simple MapReduce job
I\'m experimenting with Hadoop and created a very simply map and reduce job. The input is a 30 line text file, and the output is only 3 lines (it\'s an excerpt of a log file, where the map extracts a[详细]
2023-03-31 12:44 分类:问答Is it faster to compare String's, or byte arrays?
So, mig开发者_如何学Pythonht sound like an odd question, but is it faster to compare 2 String\'s, or byte[]\'s (using Arrays.equals())? I\'m working with Hadoop/Hbase, and I get byte[] as the value fr[详细]
2023-03-30 15:22 分类:问答Custom MapReduce Input Format - Can't Find Constructor
I\'m writing a custom InputFormat for Hadoop 0.20.2 and am running into a NoSuchMethodException I can\'t get rid of.I started with:[详细]
2023-03-30 12:51 分类:问答Multiple volume & limit disk usage with Hadoop
I am using Hadoop to processing on large set of data. I set up a hadoop node to use multiple volumes : one of these volume is a NAS with 10To disk, and the other one is the local disk from server with[详细]
2023-03-30 04:46 分类:问答is hadoop necessary to run mahout-in-action examples?
is hadoop necessary to run the Mahout In Action examples? i saw that there is a hadoop jar provided with mahout. i have been having problems with build-reuters.sh and was wondering if a seper开发者_St[详细]
2023-03-30 00:04 分类:问答Efficient and scalable storage for JSON data with NoSQL databases
We are working on a project which should collect journal and audit data and store it in a datastore for archive purposes and some views. We are not quite sure which datastore would work for us.[详细]
2023-03-29 15:10 分类:问答Error while running Mapreduce program
am getting the following error while Running a Map-reduce program. The开发者_如何学JAVA program is to sort the o/p using TotalOrderpartition.[详细]
2023-03-29 11:55 分类:问答Hadoop on OSX "Unable to load realm info from SCDynamicStore"
I am getting this error on startup of H开发者_如何学Pythonadoop on OSX 10.7: Unable to load realm info from SCDynamicStore[详细]
2023-03-29 07:14 分类:问答Recommendations using R with SimpleDB or BigQuery or using PHP with SimpleDB
I am currently working on syste开发者_高级运维m that generated product recommendations like those on Amazon : \"People who bought this also bought this..\"[详细]
2023-03-29 01:20 分类:问答running hadoop on Google app engine?
Is it possible to run map reduce jobs on Google app engine? Any reference or tu开发者_如何转开发torial would help[详细]
2023-03-28 20:35 分类:问答