apache-pig
How to Get Pig to Work with lzo Files?
So, I\'ve seen a couple of tutorials for this online, but each seems to say to do something different. Also, each of them doesn\'t seem to specify whether you\'re trying to get things to work on a rem[详细]
2023-04-01 05:05 分类:问答Process entire files in Hadoop using Python code (preferably in Dumbo)
It seems a very common use case but so hard to do in 开发者_JAVA百科Hadoop (it is possible with WholeFileRecordReader class).[详细]
2023-04-01 00:39 分类:问答Using IN clause with PIG FILTER
Does PIG support IN clause? filtered = FILTER bba BY reason not in (\'a\',\'b开发者_如何学Python\',\'c\',\'d\');[详细]
2023-03-30 04:47 分类:问答What are some approaches to run multiple Pig scripts sequentially?
I need to run some Pig scripts sequentially in Hadoop.They must be run separately.Any suggestions? update[详细]
2023-03-22 18:28 分类:问答How to call perl script from PIG..?
Anyone knows the way t开发者_StackOverflow中文版o call a perl script from a pig script..also i want to know how to call pig from perl..[详细]
2023-03-21 02:34 分类:问答apache pig command
just wondering what 开发者_JS百科does the following command means? so hard to google topics related to pig:[详细]
2023-03-20 22:12 分类:问答Adding values using PIG
I have a list of names and 开发者_StackOverflowcounts for males and females. Looking like this ABEL3289882[详细]
2023-03-20 20:14 分类:问答Loading from mysqldump with PIG
I have a mysqldump of the format: INSERT INTO `MY_TABLE` VALUES (893024968,\'342903068923468\',\'o03gj8ip234qgj9u23q59u\',\'testing123\',\'HTTP\',\'1\',\'4213883b49b74d3eb9bd57b7\',\'blahblash\',\'20[详细]
2023-03-20 18:48 分类:问答How to sort (order by) big data with hive efficiently?
I want to sort a big dataset efficiently (i.e. with a custom partitioner, like described here: How does the MapReduce sort algorithm work?)开发者_开发技巧, but I want to do it with hive.[详细]
2023-03-19 10:24 分类:问答run pig on hadoop could not find the result
I ran a pig script on a hadoop cluster, it pass successfully but i cannot find the result files, here is what it said:[详细]
2023-03-19 05:55 分类:问答