ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> perl-xml
perl-xml
utf-8 (or not) encoding question
by Joshua Santelli other posts by this author
Dec 9 2004 10:21AM messages near this date
view in the new Beta List Site
Re: utf-8 (or not) encoding question | Re: utf-8 (or not) encoding question
& XSLT Hello,

I'm using XML::LibXML to parse a file that I have. 
The character in questions looks like one byte (F3)
when I `less` the file on UNIX:

analysis and algebraic topology, such as
Calder<F3> n-Zygmund theory

This is the error I get when I parse_file() the file:

1.xml:21: parser error : Input is not proper UTF-8,
indicate encoding !
analysis and algebraic topology, such as
Calderón-Zygmund theory
                                               ^
1.xml:21: error: Bytes: 0xF3 0x6E 0x2D 0x5A
analysis and algebraic topology, such as
Calderón-Zygmund theory
                                               ^
 at ./test.pl line 28
 
Is LibXML correct in thinking that this this is not
UTF-8?  Is there an easy way for me to tell if this
(or any file) is properly encoded as UFT-8?

What's wrong with F3 (&amp;#243;)?

Thanks,
josh



		
__________________________________ 
Do you Yahoo!? 
Jazz up your holiday email with celebrity designs. Learn more. 
http://celebrity.mail.yahoo.com
Thread:
Joshua Santelli
Grant McLean
Joshua Santelli

Privacy Policy | Email Opt-out | Feedback | Syndication
© ActiveState Software Inc. All rights reserved