Hadoop
How can I inspect a Hadoop SequenceFile for which I lack full schema information?
I have a compressed Hadoop SequenceFile from a customer which I\'d like to insp开发者_如何学JAVAect. I do not have full schema information at this time (which I\'m working on separately).[详细]
2023-04-08 00:35 分类:问答how to call a pig script within another pig script
I have a file in hdfs with 100 columns, which i want to proces using pig.I want to load this file into a tuple 开发者_如何转开发with columns names in a separate pig script, and reuse this script from[详细]
2023-04-07 23:07 分类:问答Can Hadoop read arbitrary key binary file
It looks like Hadoop MapReduce requires a key value pair structure in the text or binary text. In reality we might have files to be split into chunks to be processed. But the keys may be[详细]
2023-04-07 22:52 分类:问答Map reduce value list order problem
As we know Hadoop groups values with per key and sends them to same reduce task. Suppose I have next lines in file on hdfs.[详细]
2023-04-07 19:39 分类:问答Type mismatch in key from map when replacing Mapper with MultithreadMapper
I\'d like to implement a MultithreadMapper for my MapReduce job. For this I replaced Mapper with MultithreadMapper in a working code.[详细]
2023-04-07 14:28 分类:问答Hadoop java mapper job executing on slave node, directory issue
As part of my Java mapper I have a command executes some standalone code on a local slave node.When I run a code it executes fine, unless it is trying to access some local files in which case I get th[详细]
2023-04-07 11:58 分类:问答The right place for "io.sort.mb" in Hadoop?
I am a bit confused, in the Hadoop cluster setup, in section \"Real-World Cluster Configurations\", an example is given where properties like io.sort.mb & io.sort.factor goes in core-site.xml. But[详细]
2023-04-07 10:59 分类:问答Hadoop streaming example failed Type mismatch in key from map [duplicate]
This question already has answers here: Closed 11 years ago. Possible Duplicate: hadoop-streaming example failed to run - Type mismatch in key from map[详细]
2023-04-07 10:08 分类:问答Difference between String.getBytes() and Bytes.toBytes(String data)
I\'m writing a Hadoop/HBase job. I needed to transfo开发者_高级运维rm a Java String into a byte array. Is there any differences between Java\'s String.getBytes() and Hadoop\'s Bytes.toBytes()?Accordin[详细]
2023-04-07 07:43 分类:问答How to pull data in the Map/Reduce functions?
According to the Hadoop : The Definitive Guide. The new API supports both a “push” and a “pull” style of iteration. In both APIs, key-value record pairs are pushed to the mapper, but i开发者_Go百[详细]
2023-04-06 22:58 分类:问答