开发者

How to get data for BeautifulSoup Xml Parser

开发者 https://www.devze.com 2023-01-04 16:47 出处：网络

I\'m trying to parse an XML file with BeautifulSoup. In all tutorials on the net, the content of the xml is given like

相关专题：xml-parsing

I'm trying to parse an XML file with BeautifulSoup. In all tutorials on the net, the content of the xml is given like

xml = "<doc><tag1>Contents 1<tag2>Contents 2<tag1>Contents 3"
soup = BeautifulStoneSoup(xml)

but I want to give only xml file'开发者_Go百科s path. In mechanize one can use get_data() method but it only works for html files. Any sugestions ?

The BeautifulSoup documentation says that:

"A Beautiful Soup constructor takes an XML or HTML document in the form of a string (or an open file-like object). It parses the document and creates a corresponding data structure in memory."

In the formulation of your question, you use BeautifulStoneSoup, and allthough the online documentation uses strings, the docstring for the constructor reveals that:

"The Soup object is initialized as the 'root tag', and the provided markup (which can be a string or a file-like object) is fed into the underlying parser."