ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> perl-xml
perl-xml
Re: Seg Faults Using HTML::Tidy -- Any Alternatives?
by Shannon Booher other posts by this author
Jan 27 2005 11:43AM messages near this date
view in the new Beta List Site
Re: Seg Faults Using HTML::Tidy -- Any Alternatives? | Re: XML::LibXML - appending children and finding them again
& XSLT As an update to the group, using the latest dev version of HTML-Tidy, 
1.05_01, fixes the seg fault problem I was having.

Using the perl package, has anyone come up with a data case that will 
cause the tidy-> clean to fail (aka unable to clean the HTML)?

I've run this over my current data set and it was able to clean everything 
it encountered.

thanks,

sjb

On Fri, 21 Jan 2005, Dominique Quatravaux wrote:

>  -----BEGIN PGP SIGNED MESSAGE-----
>  Hash: SHA1
> 
>  Shannon Booher wrote:
> 
>  | [...] I'm on Linux 7.3, using HTML::Tidy and the latest libtidy
>  | (the 01/18 build).
>  |
>  | Using the default configuration, I get seg faults on any non-HTML
>  | tags.  [...]
> 
>  By any chance, does the HTML::Tidy test suite fail on the
>  t/segfault-form case? If so, apply the attached patch, or try the
>  latest developper-only version of HTML::Tidy
>  (http://search.cpan.org/CPAN/authors/id/P/PE/PETDANCE/HTML-Tidy-1.05_01.tar.gz).
> 
>  (I told Andy Lester about that a few months ago already, but he seems
>  to be one busy person - Let's prod him again then :-)
> 
>  diff -r -U3 HTML-Tidy-1.04/Tidy.xs HTML-Tidy-1.04.NEW/Tidy.xs
>  - --- HTML-Tidy-1.04/Tidy.xs    Fri Apr  2 07:36:19 2004
>  +++ HTML-Tidy-1.04.NEW/Tidy.xs    Tue Oct 19 15:33:09 2004
>  @@ -61,8 +61,11 @@
>  ~         if ( rc >= 0 ) {
>  ~             char *str;
>  ~             str = (char *)output.bp;
>  - -            if ( str )
>  +            if ( str ) {
>  ~                 RETVAL = newSVpvn( str, strlen(str) );
>  +        } else {
>  +                RETVAL = &PL_sv_undef;
>  +            }
>  ~             tidyBufFree( &output );
>  ~         } else {
>  ~             XSRETURN_UNDEF;
> 
>  - --
>  Dominique QUATRAVAUX                           Ingénieur senior
>  01 44 42 00 08                                 IDEALX
> 
>  -----BEGIN PGP SIGNATURE-----
>  Version: GnuPG v1.2.4 (GNU/Linux)
>  Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
> 
>  iD8DBQFB8XJWMJAKAU3mjcsRAsCSAJwMyTL9JCF9tHv2r1nMpHjUUGkdcgCfYw+i
>  uFL63n1fKWESMYapnfKNnjM=
>  =bz0w
>  -----END PGP SIGNATURE-----
> 
> 
> 
Attachments:
unknown1
unknown2

Thread:
Shannon Booher
Dominique Quatravaux
Shannon Booher

Privacy Policy | Email Opt-out | Feedback | Syndication
© ActiveState Software Inc. All rights reserved