ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> xsl-list
xsl-list
Re: [xsl] Encoding issues with document() function
by Pankaj Bishnoi other posts by this author
Nov 4 2006 6:34AM messages near this date
Re: [xsl] Encoding issues with document() function | RE: [xsl] Encoding issues with document() function
& XSLT Hi
     I am facing problems in removing 0xb,0xc,0xe,0xf. What will be the
representation for these characters in UTF-8. For 0x1 ia m using::
'\u0001' and it works fine. But the problem is with 0xb,0xc,0xe,0xf.

thanks
pankaj
----- Original Message ----- 
From: "Joe Fawcett" <joefawcett@[...].com> 
To: <xsl-list@[...].com> 
Sent: Saturday, November 04, 2006 6:00 PM
Subject: Re: [xsl] Encoding issues with document() function


>  It doesn't matter about the encoding. XML cannot have 0xb, 0xc, 0xe and
0xf
>  in it.
>  You can base64encode the data if it's part of an element's content before
>  passing it to the XML parser, or replace the characters with allowed ones
>  and then post process the data later to re-insert.
> 
>  Joe
> 
> 
>  >From: "Pankaj Bishnoi" <pankaj.bishnoi@[...].com>
>  >Reply-To: xsl-list@[...].com
>  >To: <xsl-list@[...].com>
>  >Subject: Re: [xsl] Encoding issues with document() function
>  >Date: Sat, 4 Nov 2006 17:53:11 +0530
>  >
>  >Thanks for your help michael. Now i am replacing unicode characters.
>  >
>  >I have the encoding UTF-8 now::
>  >
>  >for 0x2 i can use replace('\u0002','')
>  >
>  >but for following characers what will be the replace character::
>  >
>  >0xa,0xb,0xc,0xd,0xe,0xf
>  >
>  >
>  >Thanks
>  >Pankaj
>  >
>  >----- Original Message -----
>  >From: "Michael Kay" <mike@[...].com>
>  >To: <xsl-list@[...].com>
>  >Sent: Saturday, November 04, 2006 3:08 PM
>  >Subject: RE: [xsl] Encoding issues with document() function
>  >
>  >
>  > > If the document really does contain the Unicode character with
codepoint
>  > > 0x02, then it's not a well-formed XML document, and you won't be able
to
>  > > read it from XSLT or from anything else that's designed to process
XML.
>  >You
>  > > need  to correct the program that created the document so that it
>  >outputs
>  > > well-formed XML.
>  > >
>  > > The other possibility is that the document contains some other
character
>  > > which is being misread as codepoint 0x02 because the parser is using
the
>  > > wrong encoding, for example because the XML declaration is incorrect.
>  > >
>  > > Michael Kay
>  > > http://www.saxonica.com/
>  > >
>  > > > -----Original Message-----
>  > > > From: Pankaj Bishnoi [mailto:pankaj.bishnoi@[...].com]
>  > > > Sent: 04 November 2006 09:24
>  > > > To: xsl-list@[...].com
>  > > > Subject: [xsl] Encoding issues with document() function
>  > > >
>  > > > Hi All
>  > > >         I am having a xsl in which i use XSLT document()
>  > > > function. The problem i am facing is that the xml file i am
>  > > > trying to read by using
>  > > > document() function is having some Unicode characters and the
>  > > > exception thrown at transformation time is ::
>  > > >
>  > > > SystemId Unknown; Line #133;Column #104; Can not load
>  > > > requested doc: An invalid XML character(Unicode: 0x2) was
>  > > > found in the element content of the document
>  > > >
>  > > > The source xml file is having encoding UTF-8. I tried to
>  > > > search the web for this issue and one alternate specified is
>  > > > to replace thos '0x2' character.
>  > > > Now there can be other characters as well that might come in
>  > > > other scenarios such as 0x1,0x13 etc. Now my quesstion is is
>  > > > there any encoding that supports all these characters?
>  > > >
>  > > > Is there any way out for this issue . Any help will be highly
>  > > > appreciated.
>  > > >
>  > > > Thanks
>  > > > Pankaj
>  > > >
>  > > >
>  > >
>  --~------------------------------------------------------------------
>  > > > XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
>  > > > To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
>  > > > or e-mail: <mailto:xsl-list-unsubscribe@[...].com>
>  > > > --~--
>  > > >
>  > >
>  > >
>  > > --~------------------------------------------------------------------
>  > > XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
>  > > To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
>  > > or e-mail: <mailto:xsl-list-unsubscribe@[...].com>
>  > > --~--
>  > >
>  >
>  >
>  >--~------------------------------------------------------------------
>  >XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
>  >To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
>  >or e-mail: <mailto:xsl-list-unsubscribe@[...].com>
>  >--~--
>  >
> 
> 
> 
>  --~------------------------------------------------------------------
>  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
>  To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
>  or e-mail: <mailto:xsl-list-unsubscribe@[...].com>
>  --~--
> 


--~------------------------------------------------------------------
XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
To unsubscribe, go to: http://lists.mulberrytech.com/xsl-list/
or e-mail: <mailto:xsl-list-unsubscribe@[...].com> 
--~--
Thread:
Nic James Ferrier
Pankaj Bishnoi
Michael Kay
Pankaj Bishnoi
Joe Fawcett

Privacy Policy | Email Opt-out | Feedback | Syndication
© ActiveState Software Inc. All rights reserved