I need to put a little project together for myself, and I need some functionality to download a page for offline viewing. Is there a library that will download a given page and its embedded images, and edit the img tags to reflect the local locations of the images.
I know there are a lot of website downloaders out t开发者_运维知识库here, but I cant find something that i can use directly in my code.
I have some basic scripts done in python, so Python is very welcome. but pretty much any language will do.
Yes, BeautifulSoup + python urllib module
You're looking for BeautifulSoup.
How about python web crawler? http://code.google.com/p/pywebcrawler/
OR, Anemone (ruby)? http://anemone.rubyforge.org/
simplest solution I can think of.
wget -p example.com
精彩评论