We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
开发者_如何学Go Improve this questionRFCs (http://www.ietf.org/rfc.html) are usually published as text files.
- Are there any other formats, which would make parsing the RFC content easier?
- Are there any parsers for the widely used RFC text documents?
A limited number of RFCs are offerd as XML at http://xml.resource.org/public/rfc/xml/
Also you could merge the text data using Bib XML from http://xml.resource.org/public/rfc/bibxml/
IETF maintains minmally-marked-up RFCs in HTML, for example:
https://www.rfc-editor.org/rfc/rfc2616.html
but the markup consists mostly of anchors to implement a table of contents; and main-body markup that is mostly <pre> ... </pre>. Nevertheless, it might be possible to do some meaningful parsing on those RFCs.
W3C has some HTMLized RFCs, for example:
http://www.w3.org/Protocols/rfc2616/rfc2616.html
in which the markup is somewhat richer in its semantics and so perhaps more amenable to parsing.
精彩评论