[MAGNOLIA-1158] indexing pdf: use latest PDFBox Created: 24/Oct/06  Updated: 23/Jan/13  Resolved: 24/Oct/06

Status: Closed
Project: Magnolia
Component/s: None
Affects Version/s: 3.0 RC3
Fix Version/s: 3.0 RC4

Type: Bug Priority: Major
Reporter: Philipp Bärfuss Assignee: Sameer Charles
Resolution: Fixed Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Template:
Acceptance criteria:
Empty
Task DoD:
[ ]* Doc/release notes changes? Comment present?
[ ]* Downstream builds green?
[ ]* Solution information and context easily available?
[ ]* Tests
[ ]* FixVersion filled and not yet released
[ ]  Architecture Decision Record (ADR)
Bug DoR:
[ ]* Steps to reproduce, expected, and actual results filled
[ ]* Affected version filled
Date of First Response:

 Description   

The following exception is thrown by PDFBox when using the version used by jackrabbit indexers. I hope we can just change the version in our pom to use the latest PDFBox together with the indexers

20.10.2006 18:29:40 exception initializing reader
org.apache.jackrabbit.core.query.PdfTextFilter$1: java.io.IOException:
Error: Expected hex number, actual=' 2'
java.lang.Throwable: Warning: You did not close the PDF Document
at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384)
at java.lang.ref.Finalizer.invokeFinalizeMethod(Native Method)
at java.lang.ref.Finalizer.runFinalizer(Finalizer.java:83)
at java.lang.ref.Finalizer.access$100(Finalizer.java:14)
at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:160)



 Comments   
Comment by Sameer Charles [ 24/Oct/06 ]

on svn

Generated at Mon Feb 12 03:24:13 CET 2024 using Jira 9.4.2#940002-sha1:46d1a51de284217efdcb32434eab47a99af2938b.