I a using apache pdfbox 1.5 for extracting text from pdf's. Here is the code which is being used. This seems to work fine for some pdf's. But it failed for one pdf with the below error. Let me know if anyone has encountered such issue & resolved
java.lang.IndexOutOfBoundsException: Index: 2,Size: 2
at java.util.SubList.rangeCheck(AbstractList.java:864)
at java.util.SubList.get(AbstractList.java:737)
at org.apache.fontbox.cff.CharStringConverter.drawCurve(CharStringConverter.java:415)
at org.apache.fontbox.cff.CharStringConverter.handleType2Command(CharStringConverter.java:277)
at org.apache.fontbox.cff.CharStringConverter.handleCommand(CharStringConverter.java:81)
at org.apache.fontbox.cff.CharStringHandler.handleSequence(CharStringHandler.java:53)
at org.apache.fontbox.cff.CharStringConverter.handleType2Command(CharStringConverter.java:307)
at org.apache.fontbox.cff.CharStringConverter.handleCommand(CharStringConverter.java:81)
at org.apache.fontbox.cff.CharStringHandler.handleSequence(CharStringHandler.java:53)
at org.apache.fontbox.cff.CharStringConverter.convert(CharStringConverter.java:64)
at org.apache.fontbox.cff.CFFFon开发者_如何学编程t$Mapping.toType1Sequence(CFFFont.java:374)
at org.apache.fontbox.cff.AFMFormatter.renderFont(AFMFormatter.java:126)
at org.apache.fontbox.cff.AFMFormatter.printFontMetrics(AFMFormatter.java:64)
at org.apache.fontbox.cff.AFMFormatter.printFont(AFMFormatter.java:57)
at org.apache.fontbox.cff.AFMFormatter.format(AFMFormatter.java:50)
Is it possible to share the PDF file which is creating this issue?
Also, check if the command line version works with the -force
option enabled.
精彩评论