character encodings

my ejb xml descriptors have "<?xml version="1.0" encoding="UTF-8"?>" in the header. i have set the default encoding to UTF-8 in IDE settings and enabled the option "files detected as encoded in UTF-family of encodings automatically load in UTF", i saved the file and expects the file to be written to the specfied character set. but when i used the plugin "ShowEncodingPlugin" written by Mr. Guillame Laforge, it says the char encoding is still "US-ASCII". how does it work? im not sure myself. wasnt it supposed to be "UTF-8" now?
im doing this because JBOSS is always throwing the following exception everytime i try to deploy my EJBs :

17:07:54,289 ERROR Document root element is missing.:-1:1
org.xml.sax.SAXParseException: Document root element is missing.
at org.apache.crimson.parser.Parser2.fatal(

anyone familiar with this issue? i have checked the byte order mark and its 0xFF 0xFE (was it the correct BOM?)

im using JBOSS 3.0.x

Comment actions Permalink

i also came across this blog by mr. laforge which might confirm that IDEA 4 has something to do with my prob.

im using IDEA 4.0 build 1165 on win2k.

Comment actions Permalink

bumP. is my prob an isolated one? am i the only one having this prob?

btw, i got this from

UTF-8 - EF BB BF - 
UTF-16LE - FF FE - ÿþ
UTF-16BE - FE FF - þÿ

so what i have is a UTF-16LE...

Comment actions Permalink

i solved the problem by using a reliable unicode text editor
to convert all xml file descriptors to UTF-8 without BOM (byte order mark). Intellij IDEA 4 seems to have a bug on saving xml files. the files gets converted to UTF-16LE even if you already set the char encoding to UTF-8 by default.


Please sign in to leave a comment.