ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> xml-dev
xml-dev
RE: [xml-dev] The general XML processing problem
by =?iso-8859-1?Q?Bill_de_h=D3ra?= other posts by this author
Jul 16 2002 9:59AM messages near this date
RE: [xml-dev] The general XML processing problem | Re: [xml-dev] The general XML processing problem
>  -----Original Message-----
>  From: Patrick Durusau [mailto:pdurusau@[...].edu] 
> 
>  >(SSL)
>  >The hierarchical issues arise from the particular style of embedded 
>  >markup that XML uses, and there's a serious trade-off there.  XML is 
>  >not as flexible for created labeled structures as it might 
>  be precisely 
>  >because it is typically embedded directly in documents, and because 
>  >XML's creators found ambiguity a problem.
>  >
> 
>  Yes, and the ambiguity solution inherited from SGML was to solve the 
>  problem in syntax, not in the processing layer. Since the ambiguity 
>  problem was solved by Earley in 1970 (Earley, J. (1970) An efficient 
>  context-free parsing algorithm. Communications of the Association for 
>  Computing Machinery, 13(2):94-102) as well as dealt with in NLP and 
>  other disciplines by techniques such as active chart parsing 
>  and parse 
>  forests, I fail to see any reason to continue to with a 
>  solution in syntax.

Are we really done then? There's a lot of work done after Earley, ie
Marcus, (and of which Schematron is possibly a special case, I'm not
sure yet) that suggest to me we're not done without resorting to syntax.
The ideal at the XML level would be not to resort to context-free
parsers that require probabilities (PCFGs). And there's still markup to
deal with, even when it's called punctuation ;)

Bill de hÓra
..
Propylon
www.propylon.com 

 


-----------------------------------------------------------------
The xml-dev list is sponsored by XML.org <http://www.xml.org> , an
initiative of OASIS <http://www.oasis-open.org> 

The list archives are at http://lists.xml.org/archives/xml-dev/

To subscribe or unsubscribe from this list use the subscription
manager: <http://lists.xml.org/ob/adm.pl> 
Thread:
=?iso-8859-1?Q?Bill_de_h=D3ra?=
Patrick Durusau
Eric van der Vlist

Privacy Policy | Email Opt-out | Feedback | Syndication
© ActiveState Software Inc. All rights reserved