Uploaded image for project: 'Magnolia'
  1. Magnolia
  2. MAGNOLIA-2070

Import corrupts UTF-8 characters for String nodes >64K

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Critical
    • None
    • 3.5.4
    • admininterface
    • None
    • Windows XP or Windows 2003 Server, community with derby (magnolia-tomcat-bundle-3.5.4-bundle-jdk15.zip) or enterprise with MySQL

    Description

      From AdminCentral Website, import the XML files and export them again. I am using right-click "Import from XML" and "Export tree to XML".
      After export, the longer file (website.corruptedPolish.xml, >64K) will have the Polish "z" replaced with a question mark.
      The shorter file (by 20 lines, <64K) will match the original file.

      It doesn't matter where the Polish character is positioned; the example has it at the bottom of the data, but it happens at the top too.

      I use ExamDiff (prestosoft.com) to compare the original and exported file, as it can display the weird Polish character (gvim can't).

      We're exceeding 64K with a more complex XML string that has the world's countries in 8 different languages, and some other data.
      I believe the corruption occurs during the import (vs. the export), as we have Ajax code that displays the data from the magnolia node, and the question marks appear there.

      John Kalstrom, J2 Global Communications

      Issue is specific to the OS of the server. Happens while using wide range of application server (Tomcat, GlassFish, Weblogic, possibly others as well). Problem occurs on import of the files no matter what browser or OS is on the client side.

      Linux and Windows Vista seems not to be affected by the problem.

      Checklists

        Acceptance criteria

        Attachments

          Activity

            People

              had Jan Haderka
              john.kalstrom John Kalstrom
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Checklists

                  Bug DoR
                  Task DoD