[xml-dev] utf8 -> utf16 conversion problem ?
by Alex Ben-Ari other posts by this author
Feb 12 2003 9:34AM messages near this date
RE: [xml-dev] Error parsing &
|
[xml-dev] rdf uri subject question
Hello all.
I am wondering if anyone can help me with the following little xml/unicode curiosity:
I have a Xerxec-C SAX2 parser parsing an in-memory buffer of xml encoded in utf8.
When reading the sequence C3 AA (which is 'e' with upper circumflux) the characters() method
receives 00 c3 00 aa , which is wrong !
The right value of 'e' with upper circumflux in UTF16 is 00EA.
What is going on ?
Thanks to anyone who can enlight me on this one.
Alex.
p.s. The code I am using looks like so:
someFunction(char *xml, int32_t xml_len) {
SAX2XMLReader* parser = XMLReaderFactory::createXMLReader();
//ExtractHandler is derived from ContentHandler and ErrorHandler
ExtractHandler extHandler(xml, xml_len);
parser-> setContentHandler(&extHandler);
parser-> setErrorHandler(&extHandler);
MemBufInputSource mbis((XMLByte*)xml, xml_len, "", false);
mbis.setCopyBufToStream(false); //not necessary to duplicate the buffer
parser-> parse(mbis);
}
-----------------------------------------------------------------
The xml-dev list is sponsored by XML.org <http://www.xml.org> , an
initiative of OASIS <http://www.oasis-open.org>
The list archives are at http://lists.xml.org/archives/xml-dev/
To subscribe or unsubscribe from this list use the subscription
manager: <http://lists.xml.org/ob/adm.pl>
Thread:
Alex Ben-Ari
bryan
John Cowan
|