I am working on data of Mathematics Genealogy Project. I collect all information about students and advisors and do some query processing on that data. To be precise, I crawl all the HTML pages from the root URL of Mathematics Genealogy Project http://www.genealogy.ams.org/ and collect all information that I need and query on that. For experimental purposes, I need some more data on net which is available in similar format. Can anybody suggest good websites which I can crawl for some interesting information. any data other than gen开发者_StackOverflow社区ealogy is also welcome but it should have at least some heirarchy. Thanks for all your suggestions.
There is a list of such sites at http://en.wikipedia.org/wiki/Academic_genealogy. For instance, http://academictree.org/.
精彩评论