开发者

Performance-effective way to transform XML data represented as Writeable

开发者 https://www.devze.com 2023-02-21 19:05 出处:网络
I\'m working on utility method that allows conversion of XML data into formatted String and before you\'re going to think it\'s a trivial task for javax.xml.transform.Transformer let me explain the sp

I'm working on utility method that allows conversion of XML data into formatted String and before you're going to think it's a trivial task for javax.xml.transform.Transformer let me explain the specific constraints I've faced with.

The input data does not exist at the moment conversion starts. Actually it's represented as groovy.lang.Writeable (javadoc) instance that I could output into any开发者_StackOverflow java.io.Writer instance. Signature of method looks like this:

static String serializeToString(Writable source)

My current solution involves few steps and actually provides expected result:

  1. Create StringWriter, output source there and convert to String
  2. Create javax.xml.transform.stream.StreamSource instance based on this string (using StringReader)
  3. Create new StringWriter instance and wrap it into javax.xml.transform.stream.StreamResult
  4. Perform transformation using instance of javax.xml.transform.Transformer
  5. Convert StringWriter to String

While solution does work I'm not pleased enough with its efficiency. This method will be used really often and I do want to optimize it. What I'd like to avoid is necessity to perform multiple conversions along the line:

  1. From Writeable to String (unformatted)
  2. From String to StreamSource (which means that data will be parsed again)
  3. From StreamSource to String again (formatted)

So the question is whether it's possible to build pipe-like flow which eliminates unnecessary conversions?

UPDATE #1:

To give a little bit more context, I'm converting GPathResult instance to formatted string using StreamingMarkupBuilder.bindNode() method which produces Writable instance. Unfortunately there is no way to specify StreamingMarkupBuilder to produce formatted output.

UPDATE #2:

I did experiment with implementation based on PipedWriter + PipedReader but experiments didn't show much speed gain from this approach. Looks like it's not that critical issue in this case.


Not knowing what you mean exactly by "XML data", but you could think of representing the "Yet-to-be" stuff as a SAXSource directly, thereby by-passing the "to-string" and "parse-string" steps.

0

精彩评论

暂无评论...
验证码 换一张
取 消