[MAGNOLIA-2953] How to active big size repository Created: 24/Nov/09  Updated: 24/Nov/09  Resolved: 24/Nov/09

Status: Closed
Project: Magnolia
Component/s: None
Affects Version/s: 3.6.5
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: fan abii Assignee: Boris Kraft
Resolution: Won't Fix Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

unix


Template:
Acceptance criteria:
Empty
Task DoD:
[ ]* Doc/release notes changes? Comment present?
[ ]* Downstream builds green?
[ ]* Solution information and context easily available?
[ ]* Tests
[ ]* FixVersion filled and not yet released
[ ]  Architecture Decision Record (ADR)
Bug DoR:
[ ]* Steps to reproduce, expected, and actual results filled
[ ]* Affected version filled
Date of First Response:

 Description   

I am developing a project by Magnolia.When I want to active a page , I meet a problem
1.There is many record to active from magnoliaAuthor to magnoliaPublic .And Every record has its attach info . I use binary to store the attach.
The problem is If the size of document is very big (eg:300MB), It will occur OutOfMemory when you want to active it from author to public . How to fix it ?
2.Contintue the above issue , you can say I can solve it by adding more memory size for this application ,for example 2G. But when you want to active some file just like pdf . it will still occur pdf exact exception as below :
WARN org.apache.jackrabbit.extractor.PdfTextExtractor PdfTextExtractor.java(extractText:91) 20.11.2009 15:48:26 Failed to extract PDF text content
java.io.IOException: Unknown encoding for 'UniGB-UCS2-H'
at org.pdfbox.encoding.EncodingManager.getEncoding(EncodingManager.java:83)
at org.pdfbox.pdmodel.font.PDFont.getEncoding(PDFont.java:614)
at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:463)
at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:327)
at org.pdfbox.util.operator.ShowText.process(ShowText.java:63)
at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:487)
at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:467)
at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:202)
at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:156)
at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:351)
at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:267)
at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:219)
at org.apache.jackrabbit.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:75)
at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
at org.apache.jackrabbit.core.query.lucene.TextExtractorJob$1.call(TextExtractorJob.java:91)
at EDU.oswego.cs.dl.util.concurrent.FutureResult$1.run(Unknown Source)
at org.apache.jackrabbit.core.query.lucene.TextExtractorJob.run(TextExtractorJob.java:170)
at EDU.oswego.cs.dl.util.concurrent.PooledExecutor$Worker.run(Unknown Source)
at java.lang.Thread.run(Thread.java:619)

How to fix it ??



 Comments   
Comment by Zdenek Skodik [ 24/Nov/09 ]

Hi Fan,

this project is dedicated for reporting bugs, improvements etc.
But the issue you've described is a support issue rather then anything else.
Unfortunately until you'll sign the enterprise license you haven't access to support,
but you can subscribe yourself and ask at user list where a lot of experience users contribute.

Best regards,
Zdenek

Generated at Mon Feb 12 03:41:44 CET 2024 using Jira 9.4.2#940002-sha1:46d1a51de284217efdcb32434eab47a99af2938b.