Hadoop
Hadoop outputCollector
I have a mapreduce progr开发者_如何学JAVAam and is working fine, following are the signatures of map and reduce functions. The outputcollector presently is[详细]
2023-04-11 21:27 分类:问答Hadoop Sort map and reduce key value
If I had a file with random integers on ea开发者_运维百科ch line and wanted to sort the file using Hadoop, what would my mapper and reducer\'s input/output key and value be?Yahoo has sorted Peta and T[详细]
2023-04-11 16:08 分类:问答copying to and from hdfs within Hadoop Streaming
I asked a similar question to this earlier, but after doing some exploring, I have a better understanding of what\'s going on, but i\'d like to see if other people have alternative solutions to my app[详细]
2023-04-11 13:46 分类:问答how to write subquery and use "In" Clause in Hive
How can I use In clause in Hive I want to write something like this in Hive select x from y where y.z in (select distinct z from y) order by x;[详细]
2023-04-11 10:41 分类:问答Variants of Hadoop
A project of mine is to compare different varian开发者_C百科ts of Hadoop, it is said that there are many of them out there, but googling didn\'t work well for me :([详细]
2023-04-11 07:19 分类:问答How to uninstall Hadoop?
I am using Mac OSX and want to u开发者_StackOverflow中文版ninstall/re-install(clean) hadoop Please let me know how can I do that[详细]
2023-04-11 05:37 分类:问答google appengine mapper - map over range of dates
I would like to use the appengine mapper to iterate over a range of dates (from-date and to-date passed as properties to the configuration). For each date in the range, I would retrieve the entities t[详细]
2023-04-11 05:22 分类:问答Hadoop eclipse plugin : Unable to see output on console
I am using hadoop-0.20.2 from http://www.apache.org/dyn/closer.cgi/hadoop/common/ and I\'m using the following Eclipse plugin hadoop-0.20.1-eclipse-plugin.jar from http://code.google.com/p/hadoop-ecli[详细]
2023-04-11 03:11 分类:问答Does java api for hadoop writing require SSH?
Hi guys :Im trying to setup writes to a remote, single node hadoop instance (remote in that its running on my box in a VM)....[详细]
2023-04-11 01:41 分类:问答Starting jobs with direct calls to Hadoop from within SSH
I\'ve been able to kick off job flows using the elastic-mapreduce ruby library just fine. Now I have an instance which is still \'alive\' after it\'s jobs have finished. I\'ve logged in to is using SS[详细]
2023-04-10 23:24 分类:问答