hdfs
Moving files in Hadoop using the Java API?
I want to mo开发者_运维技巧ve files around in HDFS using the Java APIs.I cannot figure out a way to do this.The FileSystem class only seems to want to allow moving to and from the local file system..[详细]
2023-02-21 12:13 分类:问答Hadoop HDFS maximum file size
A colleague of mine thinks that HDFS has no maximum file size, i.e., by partitioning into 128 / 256 meg chunks any file size can be stored (obviously the HDFS开发者_高级运维 disk has a size and that w[详细]
2023-02-21 02:42 分类:问答How can I troubshoot this Hadoop filesystem installation error?
I\'m trying to install Hadoop on a non-Cloudera Ubuntu test image. Everything seems to have been going well until I ran ./bin/start-all.sh. The name node never comes up so I can\'t even run a hadoop f[详细]
2023-02-20 03:07 分类:问答HDFS says file is still open, but process writing to it was killed
I\'m new to hadoop and I\'ve spent the past couple hours trying to google this issue, but I couldn\'t find anything that helped. My problem is HDFS says the file is still open, even though the process[详细]
2023-02-18 16:54 分类:问答Importing multi-level directories of logs in hadoop/pig
We store our logs in S3, and one of our (Pig) queries would grab three different log types. Each log type is in sets of subdirectories based upon type/date. For instance:[详细]
2023-02-16 19:59 分类:问答How can I tell if a hadoop namenode has already been formatted?
When configuring my hadoop namenode for the first time, I know I need to run bin/hadoop namenode -format[详细]
2023-02-16 19:58 分类:问答Hadoop for processing very large binary files
I have a system I wish to distribute where I have a number of very large non-splittable binary files I wish to process in a distributed fashion. These are of the order of a couple of hundreds of Gb. F[详细]
2023-02-15 06:18 分类:问答Pass file location as value to hadoop mapper?
Is it possible to pass the locations of a files in HDFS as the value to my mapper so that i can ru开发者_JAVA技巧n an executable on them to process them?yes, you can create file with file names in the[详细]
2023-02-13 06:15 分类:问答Hadoop, hardware and bioinformatics
We\'re about to buy new hardware to run our analyses and are wondering if we\'re making the right decisions.[详细]
2023-02-12 03:37 分类:问答How can I be sure that data is distributed evenly across the hadoop nodes?
If I copy data from local system to HDFS, сan I be sure that it is distributed evenly across the nodes?[详细]
2023-02-12 02:59 分类:问答