[MAGNOLIA-2070] Import corrupts UTF-8 characters for String nodes >64K Created: 22/Feb/08  Updated: 03/Sep/08  Resolved: 03/Sep/08

Status: Closed
Project: Magnolia
Component/s: admininterface
Affects Version/s: 3.5.4
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: John Kalstrom Assignee: Jan Haderka
Resolution: Won't Fix Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified
Environment:

Windows XP or Windows 2003 Server, community with derby (magnolia-tomcat-bundle-3.5.4-bundle-jdk15.zip) or enterprise with MySQL


Attachments: XML File website.corruptedPolish.xml     XML File website.goodPolish.xml    
Template:
Acceptance criteria:
Empty
Task DoD:
[ ]* Doc/release notes changes? Comment present?
[ ]* Downstream builds green?
[ ]* Solution information and context easily available?
[ ]* Tests
[ ]* FixVersion filled and not yet released
[ ]  Architecture Decision Record (ADR)
Bug DoR:
[ ]* Steps to reproduce, expected, and actual results filled
[ ]* Affected version filled
Date of First Response:

 Description   

From AdminCentral Website, import the XML files and export them again. I am using right-click "Import from XML" and "Export tree to XML".
After export, the longer file (website.corruptedPolish.xml, >64K) will have the Polish "z" replaced with a question mark.
The shorter file (by 20 lines, <64K) will match the original file.

It doesn't matter where the Polish character is positioned; the example has it at the bottom of the data, but it happens at the top too.

I use ExamDiff (prestosoft.com) to compare the original and exported file, as it can display the weird Polish character (gvim can't).

We're exceeding 64K with a more complex XML string that has the world's countries in 8 different languages, and some other data.
I believe the corruption occurs during the import (vs. the export), as we have Ajax code that displays the data from the magnolia node, and the question marks appear there.

John Kalstrom, J2 Global Communications

Issue is specific to the OS of the server. Happens while using wide range of application server (Tomcat, GlassFish, Weblogic, possibly others as well). Problem occurs on import of the files no matter what browser or OS is on the client side.

Linux and Windows Vista seems not to be affected by the problem.



 Comments   
Comment by Jan Haderka [ 25/Feb/08 ]

Works fine on Linux with FF as a browser.
This might be an OS or browser specific problem.

Comment by Jan Haderka [ 25/Feb/08 ]

Tried on Vista with IE and FF and could not reproduce either. Perhaps app server is what makes the difference?

Comment by Jan Haderka [ 10/Mar/08 ]

Platform specific. Occurs only on Win XP and Win 2003

Comment by Jan Haderka [ 03/Sep/08 ]

Platform specific. Affects only older versions of MS Windows OS - Win XP and Win 2003 Server. Since the issue is with the OS itself there is nothing we can do to fix it. Workaround would be to upgrade to newer version - Win Vista or to different OS - Mac OS and Linux do not exhibit the problem.

Generated at Mon Feb 12 03:33:07 CET 2024 using Jira 9.4.2#940002-sha1:46d1a51de284217efdcb32434eab47a99af2938b.