开发者

how can get highligted word from pdf file?

开发者 https://www.devze.com 2023-01-06 12:24 出处:网络
I develop开发者_运维技巧e new programbut i need to allow user to highlighting word in pdf file then i want to process the file to get list of highlighted words with place

I develop开发者_运维技巧e new program but i need to allow user to highlighting word in pdf file then i want to process the file to get list of highlighted words with place

how can do that by java

thank in advance


PDF files are PostScript, which is very difficult to process. I doubt there's an easy way.

Take a look at http://java-source.net/open-source/pdf-libraries , but be aware you might have some difficulty.

Also, read http://partners.adobe.com/public/developer/en/pdf/HighlightFileFormat.pdf for the specs of the highlight format. Depending on what "place" information you need, that might be enough.


How are you displaying the PDF? If you are displaying the image, you just need the word co-ordinates. Something like PdfBox or JPedal or maybe IText can do this.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号