Re: Seg Faults Using HTML::Tidy -- Any Alternatives?
by Shannon Booher other posts by this author
Jan 27 2005 11:43AM messages near this date
view in the new Beta List Site
Re: Seg Faults Using HTML::Tidy -- Any Alternatives?
|
Re: XML::LibXML - appending children and finding them again
& XSLT As an update to the group, using the latest dev version of HTML-Tidy,
1.05_01, fixes the seg fault problem I was having.
Using the perl package, has anyone come up with a data case that will
cause the tidy-> clean to fail (aka unable to clean the HTML)?
I've run this over my current data set and it was able to clean everything
it encountered.
thanks,
sjb
On Fri, 21 Jan 2005, Dominique Quatravaux wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Shannon Booher wrote:
>
> | [...] I'm on Linux 7.3, using HTML::Tidy and the latest libtidy
> | (the 01/18 build).
> |
> | Using the default configuration, I get seg faults on any non-HTML
> | tags. [...]
>
> By any chance, does the HTML::Tidy test suite fail on the
> t/segfault-form case? If so, apply the attached patch, or try the
> latest developper-only version of HTML::Tidy
> (http://search.cpan.org/CPAN/authors/id/P/PE/PETDANCE/HTML-Tidy-1.05_01.tar.gz).
>
> (I told Andy Lester about that a few months ago already, but he seems
> to be one busy person - Let's prod him again then :-)
>
> diff -r -U3 HTML-Tidy-1.04/Tidy.xs HTML-Tidy-1.04.NEW/Tidy.xs
> - --- HTML-Tidy-1.04/Tidy.xs Fri Apr 2 07:36:19 2004
> +++ HTML-Tidy-1.04.NEW/Tidy.xs Tue Oct 19 15:33:09 2004
> @@ -61,8 +61,11 @@
> ~ if ( rc >= 0 ) {
> ~ char *str;
> ~ str = (char *)output.bp;
> - - if ( str )
> + if ( str ) {
> ~ RETVAL = newSVpvn( str, strlen(str) );
> + } else {
> + RETVAL = &PL_sv_undef;
> + }
> ~ tidyBufFree( &output );
> ~ } else {
> ~ XSRETURN_UNDEF;
>
> - --
> Dominique QUATRAVAUX Ingénieur senior
> 01 44 42 00 08 IDEALX
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.2.4 (GNU/Linux)
> Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org
>
> iD8DBQFB8XJWMJAKAU3mjcsRAsCSAJwMyTL9JCF9tHv2r1nMpHjUUGkdcgCfYw+i
> uFL63n1fKWESMYapnfKNnjM=
> =bz0w
> -----END PGP SIGNATURE-----
>
>
>
Attachments:
unknown1
unknown2
Thread:
Shannon Booher
Dominique Quatravaux
Shannon Booher
|