Uploaded image for project: 'Magnolia'
  1. Magnolia
  2. MAGNOLIA-2953

How to active big size repository

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Critical
    • None
    • 3.6.5
    • None
    • None
    • unix

    Description

      I am developing a project by Magnolia.When I want to active a page , I meet a problem
      1.There is many record to active from magnoliaAuthor to magnoliaPublic .And Every record has its attach info . I use binary to store the attach.
      The problem is If the size of document is very big (eg:300MB), It will occur OutOfMemory when you want to active it from author to public . How to fix it ?
      2.Contintue the above issue , you can say I can solve it by adding more memory size for this application ,for example 2G. But when you want to active some file just like pdf . it will still occur pdf exact exception as below :
      WARN org.apache.jackrabbit.extractor.PdfTextExtractor PdfTextExtractor.java(extractText:91) 20.11.2009 15:48:26 Failed to extract PDF text content
      java.io.IOException: Unknown encoding for 'UniGB-UCS2-H'
      at org.pdfbox.encoding.EncodingManager.getEncoding(EncodingManager.java:83)
      at org.pdfbox.pdmodel.font.PDFont.getEncoding(PDFont.java:614)
      at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:463)
      at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:327)
      at org.pdfbox.util.operator.ShowText.process(ShowText.java:63)
      at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:487)
      at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:467)
      at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:202)
      at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:156)
      at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:351)
      at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:267)
      at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:219)
      at org.apache.jackrabbit.extractor.PdfTextExtractor.extractText(PdfTextExtractor.java:75)
      at org.apache.jackrabbit.extractor.CompositeTextExtractor.extractText(CompositeTextExtractor.java:90)
      at org.apache.jackrabbit.core.query.lucene.JackrabbitTextExtractor.extractText(JackrabbitTextExtractor.java:195)
      at org.apache.jackrabbit.core.query.lucene.TextExtractorJob$1.call(TextExtractorJob.java:91)
      at EDU.oswego.cs.dl.util.concurrent.FutureResult$1.run(Unknown Source)
      at org.apache.jackrabbit.core.query.lucene.TextExtractorJob.run(TextExtractorJob.java:170)
      at EDU.oswego.cs.dl.util.concurrent.PooledExecutor$Worker.run(Unknown Source)
      at java.lang.Thread.run(Thread.java:619)

      How to fix it ??

      Checklists

        Acceptance criteria

        Attachments

          Activity

            People

              kraft Boris Kraft
              abii fan abii
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Checklists

                  Bug DoR
                  Task DoD