What tools would you recommend for doing such task?
I'd use RSS or JSON on the first site and make a feed of all the changes. If I can't do that due to ownership, I'd just parse the site with BeautifulSoup and cron every few minutes to record changes via that.
I'd run the second site using the Flask web framework and Python, as it's easy for me to churn out things quickly with Python.
You could also implement Pubsubhubbub for near-realtime notifications. It's based on Atom.
精彩评论