(no title)
robinhowlett | 8 years ago
With PDFBox I was able to deal with the content at a very low level (on a per-character basis), so that when for instance building a String, I would insert a pipe character when the distance between adjacent characters was greater than the width of the space character and then detect that when translating to a certain field.
See the convertToText() method for an example: https://github.com/robinhowlett/chart-parser/blob/master/src...
and https://github.com/robinhowlett/chart-parser/blob/f8d651e9a1... for when I used this technique
tcho|8 years ago