I have an issue where I'm trying to open a pdf that was coded with Quark 8.51 and use itext to extract the text from the document, but when it开发者_开发知识库 opens there is just a long bunch of gibberish symbols and nonsensical words. Does anyone have any suggestions?
Have you asked on the IText mailing list or tried any other extraction libraries like jpedal or PdfBox?
if are trying to read anything other then just plain text it will not work. Something else that could be causing the problem is encoding
精彩评论