Re: [PHP-DEV] Removal of unicode_semantics
by Andrei Zmievski other posts by this author
May 7 2008 9:54AM messages near this date
Re: [PHP-DEV] Removal of unicode_semantics
|
Re: [PHP-DEV] Removal of unicode_semantics
Precisely.
Stefan Walk wrote:
> Lester Caine schrieb:
> > That sounds like just the sort of edge case that Derick is suggesting
> > needs logging for fixing up. unicode_semantics=on is just another
> > bodge to to make it happen rather than a solution. I think I
> > understand your description, and to my eyes it looks like a unicode
> > bug that needs addressing?
>
> No, it's a misunderstanding of how things work that has been explained
> to Tomas countless times. A unicode string consists of codepoints, not
> of bytes. Having \xXX and \XXX insert bytes instead of codepoints does
> not make sense, because a) That would require a defined unicode
> encoding to be used, and even if that is the case b) would allow you to
> insert broken data into the unicode string, so it's not a unicode string
> anymore, which is a no-no. If you want to do that sort of fiddling with
> binary details, use binary strings, not unicode strings.
>
> Regards,
> Stefan
>
--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php
Thread:
Scott MacVicar
Kalle Sommer Nielsen
Nikolay Ananiev
Antony Dovgal
Tomas Kuliavas
Antony Dovgal
Tomas Kuliavas
Antony Dovgal
dz
Arvids Godjuks
Christian Schneider
Arvids Godjuks
Christian Schneider
Jeremy Privett
Alexey Zakhlestin
Scott MacVicar
Marco
Derick Rethans
Andrei Zmievski
Andi Gutmans
Derick Rethans
Andrei Zmievski
Andi Gutmans
Hannes Magnusson
Lukas Kahwe Smith
Tomas Kuliavas
Andrei Zmievski
Lester Caine
Derick Rethans
Lester Caine
Stefan Walk
Andrei Zmievski
Tomas Kuliavas
|