Our state government has opened its transport timetable data. The data is in xml based TransXchange standard format.
The problem is the data files are huge. The sample data file itself is 300 MB.
The good thing is most of the data is redundant and I don't need it for my application. I am wondering what options do开发者_开发百科 I have of inserting/transforming only the data I need into SQL Server?
Thanks.
You need an XML streaming (event based) parser to avoid loading the whole tree into memory. Most languages have several based on the SAX (Simple API for XML) standard.
精彩评论