[MGNLEESOLR-172] Exclude dependency on tika-parsers 1.x Created: 22/Sep/22  Updated: 24/Feb/23  Resolved: 03/Nov/22

Status: Closed
Project: Solr Search Provider
Component/s: None
Affects Version/s: 6.1
Fix Version/s: 6.1.1, 6.1.2

Type: Bug Priority: Neutral
Reporter: Jonathan Ayala Assignee: Federico Grilli
Resolution: Fixed Votes: 5
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Attachments: PNG File Screenshot 2022-09-22 at 13.16.55.png     PNG File Screenshot 2022-10-19 at 15.01.27.png    
Issue Links:
Problem/Incident
causality
documentation
to be documented by DOCU-2594 Resolving runtime conflicts related t... Closed
relation
is related to BUILD-684 Update JackRabbit to version 2.20.6 Closed
Template:
Acceptance criteria:
Empty
Task DoD:
[X]* Doc/release notes changes? Comment present?
[X]* Downstream builds green?
[X]* Solution information and context easily available?
[X]* Tests
[X]* FixVersion filled and not yet released
[ ]  Architecture Decision Record (ADR)
Bug DoR:
[X]* Steps to reproduce, expected, and actual results filled
[X]* Affected version filled
Release notes required:
Yes
Date of First Response:
Team: Foundation

 Description   

In Magnolia 6.2.23 tika libraries were updated (BUILD-684) However, magnolia-content-indexer is still using the old tika version, which ends up in conflict.

tika dependency should be updated in magnolia-content-indexer accordingly.


Dev Notes:

Due to an issue with one release of Magnolia's own crawler4j fork (3rd party library no longer maintained by its authors), it is recommended to use version 6.1.2 instead of 6.1.1. 

Solr 6.1.2 is already available on Nexus.



 Comments   
Comment by Federico Grilli [ 19/Oct/22 ]

Tika artifacts are inherited via JackRabbit. Magnolia 6.2.23 updated JR to a new minor version 2.20.6 where Tika was bumped from version 1.28.x to version 2.4.1.
In practice, tika-parsers-1.28.x.jar artifact has been replaced by several individual parsers which in Maven go under the artifactId tika-parsers-standard-package.
Here is the relevant diff between Magnolia 6.2.22 and 6.2.23

The older tika-parsers-1.28.4.jar artifact should be simply removed when updating to Magnolia 6.2.23+. This should ensure no runtime conflict arises.
Probably this should be better documented in our release notes - CC mdrapela (see also my comment at BUILD-684).

As far as the Maven dependency hierarchy goes Magnolia BOM 6.2.23+ manages the correct version and artifact id, so that should be no problem.

Hope this helps. 

Comment by Federico Grilli [ 28/Oct/22 ]

Reopening as it turned out the incompatible tika-parsers version pulled in by crawler4j ends up in a Magnolia webapp despite the correct Tika version being managed in Magnolia's BOM.

Generated at Mon Feb 12 11:00:49 CET 2024 using Jira 9.4.2#940002-sha1:46d1a51de284217efdcb32434eab47a99af2938b.