Java 的 PDF 處理類,Apache PDFBox 1.8.11 發布

jopen 8年前發布 | 11K 次閱讀

Apache PDFBox 1.8.11 發布,此版本是個增量 bug 修復版本,包括大量 bug 修復和改進。

現已提供下載:

http://pdfbox.apache.org/download.cgi

主要改進內容:

Bug 修復

[PDFBOX-962] - All sort of Problems when importing Xfdf files into PDFs -> 
damaged pdfs and NPEs
[PDFBOX-2508] - Text extraction getting zero font height, bad widths, and ? for 
text in this PDF with Type 3 Fonts
[PDFBOX-2693] - OutOfMemoryError at 
org.apache.fontbox.cff.IndexData.initData(IndexData.java:95)
[PDFBOX-2816] - PDFBox makes disallowed changes when signing a signed document
[PDFBOX-2845] - Error parsing PDF
[PDFBOX-2901] - High CPU load and OutOfMemoryError when rendering shading
[PDFBOX-2903] - ClassCastException at PDFParser:667
[PDFBOX-2909] - NullPointerException when rendering shading with no function
[PDFBOX-2911] - Merge does not close input streams
[PDFBOX-2914] - java.lang.NegativeArraySizeException in 
org.apache.pdfbox.pdmodel.graphics.color.PDDeviceGray.createColorModel
[PDFBOX-2916] - ArrayIndexOutOfBoundsException in CmapSubtable.processSubtype6
[PDFBOX-2923] - CFFParser parser treats CIDFont's charset data as SID
[PDFBOX-2924] - ClassCastException when doing PDFSplit
[PDFBOX-2925] - EmptyStackException in PDFStreamEngine.getColorSpaces
[PDFBOX-2935] - Problem while extracting font from PDFontSetting (used in 
PDExtendedGraphicsState)
[PDFBOX-2940] - ClassCastException in FDF export
[PDFBOX-2958] - TIFF-Predictor with 1 bit per component not supported
[PDFBOX-2964] - Checkbox getOnValue() throws NPE
[PDFBOX-2965] - NPE in PDAcroForm.getField() if the /Fields entry is missing
[PDFBOX-2976] - java.util.zip.DataFormatException: incorrect data check
[PDFBOX-2982] - fix ClassCastExceptions in operator methods
[PDFBOX-2985] - Potential NPE in PDMarkedContent#getMCID()
[PDFBOX-2986] - Potential resource leak in TTFParser's use of RAFDataStream
[PDFBOX-2987] - NPE in PDTrueTypeFont.extractCMaps
[PDFBOX-2988] - Infinite recursion in ExtractImages 1.8.11-SNAPSHOT
[PDFBOX-2989] - LZW decode filter shouldn't throw IndexOutOfBoundsException
[PDFBOX-2990] - PDDocument.load fails to load a PDF document.
[PDFBOX-2996] - StackOverflow in Quicksort
[PDFBOX-3002] - PDF files not closed after load fails
[PDFBOX-3022] - Maven repos should be https
[PDFBOX-3034] - Newly created XRef stream has direct root objects
[PDFBOX-3035] - Files with missing xref table must fail
[PDFBOX-3041] - Wrong default type in Xref stream W0 element
[PDFBOX-3087] - Metadata stream should not be compressed
[PDFBOX-3097] - ClassCastException in Axial / Radial shading when object 
reference in extends
[PDFBOX-3110] - Extract by beads doesn't work
[PDFBOX-3114] - Visible signatures in different pages changes previous revision
[PDFBOX-3153] - Direct JPEG extraction results in invalid images in 2.0.0 releases.
[PDFBOX-3155] - org.apache.pdfbox.util.PDFTextStripper class initialization 
throws NumberFormatException with recent Verona-enabled Java 9 JVMs
[PDFBOX-3157] - PDOutputIntent has N=3 (RGB) hardcoded
[PDFBOX-3173] - Signature dictionary is not decrypted in encrypted files
[PDFBOX-3190] - Links don't work in firefox
[PDFBOX-3193] - New NPE in PDFBox 1.8.11-rc1 in Acroform PDCheckbox's isChecked()

改進

[PDFBOX-1621] - Add setModifiedDate(Calendar c) to PDAnnotation
[PDFBOX-2891] - Use animal sniffer maven plugin to detect non java 5 api usage 
within the 1.8 branch
[PDFBOX-2952] - Log statement on level 'severe' while nothing else indicates error
[PDFBOX-2962] - Handle TIFF predictor for bpc 2 and 4 / optimize existing 
predictor code
[PDFBOX-3007] - Preflight cookbook example is inefficient
[PDFBOX-3176] - Add a removeRegion method in PDFTextSTripperByArea class

PDFBox是Java實現的PDF文檔協作類庫,提供PDF文檔的創建、處理以及文檔內容提取功能,也包含了一些命令行實用工具。

主要特性包括:

  • 從PDF提取文本
  • 合并PDF文檔
  • PDF 文檔加密與解密
  • 與Lucene搜索引擎的集成
  • 填充PDF/XFDF表單數據
  • 從文本文件創建PDF文檔
  • 從PDF頁面創 建圖片
  • 打印PDF文檔
  • </ul> </div>

    來自: http://www.oschina.net//news/69983/pdfbox-1-8-11

     本文由用戶 jopen 自行上傳分享,僅供網友學習交流。所有權歸原作者,若您的權利被侵害,請聯系管理員。
     轉載本站原創文章,請注明出處,并保留原始鏈接、圖片水印。
     本站是一個以用戶分享為主的開源技術平臺,歡迎各類分享!