RE: NLP portal nlp.petamem.com
by Tolkin, Steve other posts by this author
Apr 2 2003 2:59PM messages near this date
RE: NLP portal nlp.petamem.com
|
Re: NLP portal nlp.petamem.com
Can you say more about this.
Is the source code available?
How to you decide which diacritics to add?
For example both Mueller and Muller get the umlaut added
on the "u".
Do you know of code that removes diacritics in a reasonable
way, e.g. for systems that can only handle ASCII.
Ideally your approach to ading diacritics would be fully reversible,
when processing the unaccented words,
but that is perhaps too idealistic.
Hopefully helpfully yours,
Steve
--
Steven Tolkin steve.tolkin@[...].com 617-563-0516
Fidelity Investments 82 Devonshire St. V4D Boston MA 02109
There is nothing so practical as a good theory. Comments are by me,
not Fidelity Investments, its subsidiaries or affiliates.
> -----Original Message-----
> From: Richard Jelinek [mailto:rj@[...].com]
> Sent: Wednesday, April 02, 2003 9:29 AM
> To: perl-ai@[...].org
> Subject: NLP portal nlp.petamem.com
>
>
> Hi,
>
> thought you might be interested in http://nlp.petamem.com
>
> The aim of this site is to provide a nice portal with various NLP
> services. The backend is mostly pure Perl, the frontend is
> mod_perl2. It's still in development, doesn't contain many features
> yet and has some flaws - but we're working on it.
>
> --
> best regards,
>
> Dipl.-Inf. Richard Jelinek
>
> - PetaMem s.r.o. - Ocelarska 1 - Prague - www.petamem.com -
> -= 2026049 Mind Units =-
>
Thread:
Tolkin, Steve
Richard Jelinek
|