Java 的 PDF 處理類庫,Apache PDFBox 1.8.9 發布
PDFBox是Java實現的PDF文檔協作類庫,提供PDF文檔的創建、處理以及文檔內容提取功能,也包含了一些命令行實用工具。
主要特性包括:
-
從PDF提取文本
-
合并PDF文檔
-
PDF 文檔加密與解密
-
與Lucene搜索引擎的集成
-
填充PDF/XFDF表單數據
-
從文本文件創建PDF文檔
-
從PDF頁面創 建圖片
-
打印PDF文檔
Apache PDFBox 1.8.9 發布,此版本現已提供下載:http://pdfbox.apache.org/downloads.cgi。
發行說明 -- Apache PDFBox -- Version 1.8.9
此版本主要是 bug 修復版本,基于上一版本 1.8.8,包括大量的 bug 修復和改進。
issues 列表:https://issues.apache.org/jira/browse/PDFBOX
Bug 修復
[PDFBOX-159] - Field renaming character set problem
[PDFBOX-785] - Spliting a PDF creates unnecessarily large files
[PDFBOX-1002] - Form field not rendered after being processed by pdfbox-1.1.0,
wrong position of same field in pdfbox-1.5.0
[PDFBOX-1618] - Split PDF file to single page files, some files are inflated in size
[PDFBOX-1651] - PDFBox doesn't read the permission bits correct.
PDDocument.getCurrentAccessPermission().canPrint() is allways returning true
irrespective of the document print permissions
[PDFBOX-1750] - PDTextbox and PDAnnotationWidget are not correct initialized
from it's own constructor .
[PDFBOX-1859] - ClassCastException for unknown destination type
[PDFBOX-1874] - PDFTextStripper.isParagraphSeparation(...)
[PDFBOX-1965] - NPE in NonSequentialPDFParser when parseMinimal property is set
to true
[PDFBOX-2525] - Overlay: data black & white after import
[PDFBOX-2549] - TIFF-Predictor with 16 bits per component not supported
[PDFBOX-2550] - ClassCastException in PDAnnotation.getColour
[PDFBOX-2552] - Blank rendering when negative page rotation
[PDFBOX-2559] - TTF font cannot be loaded
[PDFBOX-2579] - Exception in thread "main" java.io.IOException: Error: Expected
a long type at offset 1029, instead got '12688(Deleted'
[PDFBOX-2582] - Form fields missing entirely or incorrect in PDField list
[PDFBOX-2583] - Error when rendering a PDF with annotations
[PDFBOX-2595] - Pdfbox always sets the second part of documentID to the same value
[PDFBOX-2615] - IllegalArgumentException in PDPageTree constructor: root cannot
be null
[PDFBOX-2617] - Group of Button fields treated as a Radio Button group
[PDFBOX-2620] - Support named actions
[PDFBOX-2622] - PDAnnotationLink::getBorderStyle() don't understand external
border style
[PDFBOX-2644] - Load FDF document creates Temp file when called with file parameter
[PDFBOX-2653] - Image extraction fails with attached PDF
[PDFBOX-2654] - NullPointerException when reading a GIF file with a transparent
color
[PDFBOX-2676] - PDPageLabelRange.setLabelItem() should not allow negative startPage
[PDFBOX-2678] - possible NPE in ExtractText tool of trunk
[PDFBOX-2687] - ClassCastException when trying to get OutputIntents or add to it
[PDFBOX-2698] - PDFToImage IndexOutOfBoundsException
[PDFBOX-2702] - Merging PDFs created using "Nuance PDF Create" not possible
[PDFBOX-2710] - Form fields missing in 1.8.9 version
[PDFBOX-2726] - org.apache.pdfbox.cos.COSArray cannot be cast to
org.apache.pdfbox.cos.COSDictionary
改進
[PDFBOX-1523] - Manifest should support Specification entries
[PDFBOX-2594] - Set default params in JBIG2Filter
[PDFBOX-2619] - XMP dates contain time zone, while document info dates do not,
and this isn't detected by preflight
[PDFBOX-2673] - Add output path prefix param in PDFSplit/Splitter