ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> xml-dev
xml-dev
Re: Its the semantics dummy :-)
by other posts by this author
Aug 1 2000 1:38AM messages near this date
Re: Why the Infoset? | Re: Its the semantics dummy :-)
Jonathan Borden wrote, in the continuing saga -

>  Simon St. Laurent wrote:
> 
>  [Jonathan Borden]:
>  > >It is always
>  > >possible to send a perfectly well formed XML document that is totally
>  > >useless e.g.
>  > ><doc> <byte>67</byte> <byte>121</byte>... </doc>
>  >
>  > Uselessness is in the eye of the beholder, and MIME only takes you to
the
>  > doucment container, not to its contents.  67 121 might be a very
important
>  > code to me.
> 
>  The point being that this document is no less difficult to understand than
a
>  pure binary document consisting of byte after byte. The fact that one
>  document is XML or the other document can be sent using base64 encoding
over
>  text based SMTP makes no substantial difference (and one can quite easily
>  convert between the two formats).
> 
>  >
Actually, there is quite a difference between this XML document and a pure
binary one.  In the binary one you have no idea about the structure - you
don't even know if it's really supposed to be a series of bytes of a series
of unicode points or what.  In the XML document, you know that the data
items are 67, 121, ....  True, you don't know if they are string or
integers, but that might not matter anyway.

And  this isn't necessarily trivial.  I once wrote a program that ingested
time series data, did all kinds of processing on it, and plotted the
results.  I wrote it to ingest one or two column ASCII text, to discover the
number of columns, to discover which lines were comment lines (if it didn't
start with a valid number, it was a comment), and even to learn whether the
input was really a stream of binary bytes.  Worked great.  But I couldn't
discover if I was trying to read a file with binary 16-bit numbers.  I could
imagine how to do it, but it was too hard to justify the effort and wouldn't
have been foolproof anyway.

If I could have inspected the files to find out the storage units, it would
have been great.

These marked-up documents are called "self-describing" not really because of
semantics, but because the structural units are self-delineated.

Cheers,

Tom Passin
Thread:
Paul W. Abrahams
Rick JELLIFFE
W. E. Perry

Jonathan Borden
Simon St.Laurent
Jonathan Borden
Simon St.Laurent
John F. Schlesinger
Jonathan Borden
Simon St.Laurent
W. E. Perry
John Cowan
Rick JELLIFFE
Rick JELLIFFE
Sean McGrath
Simon St.Laurent
Jonathan Borden
Sean McGrath
Rick JELLIFFE
Rick JELLIFFE
Simon St.Laurent
James Robertson
Simon St.Laurent
Jonathan Borden
Simon St.Laurent
Paul W. Abrahams
Jonathan Borden
Paul W. Abrahams
Rick JELLIFFE
Dan Vint
Rick JELLIFFE
Marcus Carr
Michael Champion
John Cowan
John Cowan
John Cowan
Michael Champion
Winchel 'Todd' Vincent, III
John Cowan
Jonathan Borden
sam th
Jonathan Borden

Simon St.Laurent
John Cowan
John Cowan
John Cowan
Simon St.Laurent
Richard Lanyon
John Cowan
Jonathan Borden
John Cowan
Simon St.Laurent
John Cowan
Jonathan Borden
Rick JELLIFFE
james anderson
Winchel 'Todd' Vincent, III
Winchel 'Todd' Vincent, III
Rick JELLIFFE

Norman Walsh
Jonathan Borden
Winchel 'Todd' Vincent, III
Jonathan Borden
Norman Walsh
Winchel 'Todd' Vincent, III
Amy Lewis

Eric Bohlman

John Cowan
Simon St.Laurent
Jeff Greif
Jonathan Borden
Elliotte Rusty Harold
Sean McGrath
Simon St.Laurent
Joe English
Simon St.Laurent
Jonathan Borden
Simon St.Laurent
W. E. Perry
Jonathan Borden
John Cowan
John Cowan
Sean McGrath
W. E. Perry
John F. Schlesinger
Sean McGrath
Michael Champion
Michael Champion
Paul W. Abrahams
John Cowan
Paul W. Abrahams
Paul W. Abrahams
Simon St.Laurent
Martin Gudgin
Jonathan Borden
Simon St.Laurent
Tim Bray
Jonathan Borden
Jack Rusher
Steve Rowe

Privacy Policy | Email Opt-out | Feedback | Syndication
© ActiveState Software Inc. All rights reserved