hadoop-streaming
hadoop-streaming : writing output to different files
Here is the scenario Reducer1 / Mapper - - Reducer2 \\ ReducerN In reducer I want to write the data on different files, lets say the reducer looks like[详细]
2023-04-12 13:42 分类:问答copying to and from hdfs within Hadoop Streaming
I asked a similar question to this earlier, but after doing some exploring, I have a better understanding of what\'s going on, but i\'d like to see if other people have alternative solutions to my app[详细]
2023-04-11 13:46 分类:问答Python Streaming : how to reduce to multiple outputs?(its possible with Java though)
I read Hadoop in Action and found that in Java using MultipleOutputFormat and MultipleOutputs classes we can reduce the data to multiple files but what I am not sure is how to achieve the same thing u[详细]
2023-04-10 01:08 分类:问答Write to different files using hadoop streaming
I\'m currently processing about 300 GB of log files on a 10 servers hadoop cluster. My data is being saved in folders named YYMMDD so each day can be accessed quickly.[详细]
2023-04-09 04:16 分类:问答hadoop streaming ensuring one key per reducer
I have a mapper that, while processing data, classifies output into 3 different types (type is the output key). My goal is to create 3 different csv files via the reducers, each with all of the data f[详细]
2023-04-04 22:00 分类:问答Hadoop Streaming Problems
I ran into these issues while using Hadoop Streaming. I\'m writing code in python 1) Aggregate library package[详细]
2023-03-25 11:53 分类:问答Hadoop Streaming Multiline Input
I\'m using Dumbo for some Hadoop Streaming jobs.I have a bunch of JSON dictionaries each containing an article (multiline text) and some meta data.I know Hadoop performs best when give large files, so[详细]
2023-01-08 22:35 分类:问答Managing dependencies with Hadoop Streaming?
I have a quick Hadoop Streaming question. If I\'m using Python streaming and I have Python packages that my mappers/reducers require but aren\'t installed by default do I need to install those on all[详细]
2022-12-31 18:41 分类:问答Ruby on Rails for web applications, Django for webpages?
Days ago I read something like \"Ruby on Rails is for web applications, Django is for standard webpages\". Is that true? 开发者_运维知识库[详细]
2022-12-18 17:58 分类:问答