开发者

How to upload a huge(GBs) file to a webserver [closed]

开发者 https://www.devze.com 2022-12-24 05:16 出处:网络
Closed. This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this
Closed. This question needs to be more focused. It is not currently accepting answers.

Want to improve this question? Update the question so it focuses on one problem only by editing this post.

Closed 4 years ago.

Improve this question

I am creating an applicatio开发者_运维百科n where I want to upload huge files. Following is a little description of what this application tries to achieve:

  1. Create a vmdk file from user's physical machine(using Vmware Converter tool, vmdk files can be GBs in size).
  2. Upload this vmdk file to the remote server
  3. now the purpose of having vmdk file on remote server is accessibility.
  4. i.e. a user away from his physical machine, can later on login via a webconsole, and instantiate a virtual machine from this vmdk on the remote server

I think this makes the situation different from normal file uploads(10-20MB file uploads).

rsync/scp/sftp might help, but.. Would this be possible using a web interface?

If not, then do I need to create a separate client for the end user, to convert and upload his files efficiently?

Any help is appreciated..


Use a file transfer protocol for this, not HTTP. You need a protocol that can restart the transfer in the middle in case the connection breaks.


BTW, I don't mean to use FTP.


I'm not an expert on all the current file transfer protocols (I've been an FTP expert, which is why I recommend against it).

However, in this situation, I think you're off-base in assuming you need transparency. All the users of this system will already have the VMWare Converter software on their machine. I see no reason they couldn't also have a small program of yours that will do the actual upload. If there's an API to the Converter software, then your program could automate the entire process - they'd run your program before they go home for the night, your program would convert to the vmdk, then upload it.

Exactly which protocol to use, I don't know. That might take some experimentation. However, if the use of the protocol is embedded within your small application and in the service, then your users will not need to know which protocols you're experimenting with. You'll be able to change them as you learn more, especially if you distribute your small program in a form that allows auto-update.


If you insist on using a web interface for this, the only way to pull it off is with something similar to a signed Java applet (I can't speak to Flash or other similar technologies, but I'm sure they're similarly capable).

Once you've crossed this threshold of going to an applet-like control, then you have far more freedom about what and how you can do things.

There's nothing wrong with HTTP per se for uploading files, it's just that the generic browser is a crummy client for it (no restartability, as mentioned, is but one limitation).

But with an applet you can select any protocol you want, you can throttle uploads so as to not saturate the clients connection, you can restart, send pieces, do checksums, whatever.

You don't need an entire webpage devoted to this, it can be a small component. It can even be an invisible component (and fired via JS). But the key factor is that it has to be a SIGNED component. An unsigned component can't interact with the users file system, so you'll need to get the component signed. It can be your own cert, etc. It follows much of the similar mechanics as normal web certificates.

Obviously the client browser will need to support your applet tech as well.


Rsync would be ideal if you can find a host that supports it.
It can restart easily, retransfer only changed parts of a file if that's useful to you and has built in options to use ssh, compression etc.

It can also confirm that the remote copy matches the local file without transferring very much data


I would run parallel FTP streams to speed up the process....

0

精彩评论

暂无评论...
验证码 换一张
取 消