pdfbox
How to create a PDF document from languages of Unicode char set regarding using third party Fonts
I\'m using PDFBox and iText to create a simple (just paragraphs) pdf document from various languages. Something like :[详细]
2023-03-09 14:59 分类:问答Issue with apache pdfbox java.lang.IndexOutOfBoundsException: Index: 2,Size: 2
I a using apache pdfbox 1.5 for extracting text from pdf\'s. Here is the code which is being used. This seems to work fine for some pdf\'s. But it failed for one pdf with the below error. Let me know[详细]
2023-02-20 02:35 分类:问答Using PDFBox to write UTF-8 encoded strings to a PDF [duplicate]
This question already has an answer here: Apache PDFBox: Can I set font other than those present in PDType1Font[详细]
2023-02-19 08:16 分类:问答PDF extraction issue with apache PDFBox 1.3.1
I am facing some issue while extracting data from PDF using apache PDFBox. With PDFBox version 1.1, i was able to extract the data properly. But the same code is giving different output with version 1[详细]
2023-02-16 19:33 分类:问答Extracting paragraph from pdf
I\'m doing topic modelling on a pdf e-book and need to extract text paragraph by paragraph. For this I use apache pdfBox which is efficiently extract text from pdf.[详细]
2023-02-16 17:50 分类:问答Java PDFBOX text encoding
I try to export some data from my Java application to a pdf file. I decided to use the pdfBox library,but I realized that I could not do the Greek charactersdisplayed properly into the pdf file. Is th[详细]
2023-02-16 16:40 分类:问答Any other way to read/write a PDF file using java application other than itext, PDFbox?
I Tried with iText and PDFBox . It is not sim开发者_如何学编程ple , we need to understand lot of code for this.[详细]
2023-02-14 15:11 分类:问答PDFBox - Coordinate System
I would like to accomplish the following thing. I have a set of PDF files, first I would like to check the origin of the coordinate system. If the origin of the coordinate system for the pdf is not up[详细]
2023-02-08 15:14 分类:问答Preserve "long" spaces in PDFBox text extraction
I am using PDFBox to extract text from PDF. The PDF has a tabular structure, which is quite simple and columns are also very widely spaced from each-other[详细]
2023-02-03 09:29 分类:问答How to read the empty cell in a PDF file in ASP.net
I am able to read a pdf file using PDFBOX in my ASP.net application but it is not adding space for an empty cell in a table, So how to read empty fields from a pdf file using PDFBOX in C#. Is there an[详细]
2023-01-30 12:54 分类:问答