We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this questionSafari 5 has gotten a new feature: The Reader. It shows a simplified version of an article page with just the article itself (and not all the clutter around). It also merges multiple pages (if the article is split across multiple pages) to a single one.
This is an extremely useful feature and I wou开发者_C百科ld like to port it over to Chrome.
I was searching for Readers code in the WebKit trunk (e.g. http://svn.webkit.org/repository/webkit/trunk/) but I couldn't find it.
Any hint where I can find it?
Safari Reader borrows from the Readability project, according to an article in the register. Readability implements a similar user experience, but does so in a cross browser fashion (using bookmarklets)
The project site is probably a good place to start:
http://code.google.com/p/arc90labs-readability/
I hope this helps!
BTW - I had links to several sites, including the demo site, the original artcile in the register, but stackoverflow won't let n00bs post more than one link. I will edit to add those once I have some rep!
UI-level features are generally part of the Safari codebase, which as Ivo said is not open-source. The WebKit nightly builds aren't open-source either, they are essentially versions of Safari that use an embedded, trunk copy of the engine instead of the one that shipped with the OS.
I'm not sure where safari's webreader code is. but there is a tool called boilerpipe that does something very similar
A good review on similar tools available is given on Tomaz Kovacic's blog: http://tomazkovacic.com/blog/122/evaluating-text-extraction-algorithms/
It contains comparison of text extraction tools (including boilerpipe, reaability and several others) on two sets of articles. Also there is a feature wise comparison in other article on the same blog.
精彩评论