Hadoop
How to deal with unbalanced input of reduce task?
Recently I was asked how to deal with unbalanced input of reduce task. I thought for while and try to redistribute the data, but didn\'t come up with a goo开发者_运维知识库d solution. Any advice?Actua[详细]
2023-04-03 10:47 分类:问答Why Hive errors FileNotFoundException when a UDF use some external resource files but runs right at native?
A 开发者_高级运维UDF used some external resource files, then it error: \"java.io.FileNotFoundException: resource/placeMap.txt (No such file or directory)\",[详细]
2023-04-03 09:39 分类:问答How to start learning hadoop [closed]
Closed. This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this po[详细]
2023-04-03 08:39 分类:问答Hadoop Hive DOUBLE type question
Quick Hive/Hadoop question from a new user. I have a DOUBLE column that has \"1.8E8\" for value, does it mean I reached the max value for DOUBLE?开发者_运维技巧[详细]
2023-04-02 17:14 分类:问答Text Mining on huge list of strings
I have list of strings. (pretty big list of ids and strings scattered in 4-5 big files. around a GB each). These strings are formatted like this:[详细]
2023-04-02 15:55 分类:问答How to expose the task tracker/Job tracker webinterface to the public in hadoop?
I\'m trying to monitor different cluster nodes, but everytime I have to ssh -X to the node and start the browser to take a look at the status information.[详细]
2023-04-02 05:48 分类:问答Why all the reduce tasks are ending up in a single machine?
I wrote a relatively simple map-reduce program in Hadoop platform (cloudera distribution). Each Map & Reduce write some diagnostic information to standard ouput besides the regular map-reduce task[详细]
2023-04-02 03:47 分类:问答hbase - how to change tables structure without drop tables
i use grails-1.3.2 and gorm-hbase-0.2.4 plugin. Sometimes i need to change tables structure(add new tables or columns).[详细]
2023-04-02 03:15 分类:问答How to specify tab as a record separator for hadoop input text file?
The input file to my hadoop M/R job is a text file in which the records are separated by tab character \'\\t\' instead of newline \'\\n\'. How can I instruct hadoop to split using the tab character as[详细]
2023-04-01 23:32 分类:问答Running Hadoop MapReduce, is it possible to call external executables outside of HDFS
Within my mapper I\'d like to call external software installed on the worker node outside of the HDFS.Is this possible?What is the best way to do this?[详细]
2023-04-01 23:07 分类:问答