FAQ
most of the html parsers I can find on the web handle only the <tag>
syntax and forget about the { code } syntax that usually occurs in a
lot of web pages.

Is there a good library to return the plain text of a html doc string
which will eliminate more than simply the <tag> occurrance?

--
___________________________________________________
Chris Fraschetti
e fraschetti@gmail.com

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Search Discussions

  • Otis Gospodnetic at Sep 26, 2004 at 10:25 pm
    I use NekoHTML and am happy with it. I don't know about that
    particular case.

    Otis
    http://www.simpy.com/ - full-text search your bookmarks

    --- Chris Fraschetti wrote:
    most of the html parsers I can find on the web handle only the <tag>
    syntax and forget about the { code } syntax that usually occurs in a
    lot of web pages.

    Is there a good library to return the plain text of a html doc string
    which will eliminate more than simply the <tag> occurrance?

    --
    ___________________________________________________
    Chris Fraschetti
    e fraschetti@gmail.com

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
    For additional commands, e-mail: lucene-user-help@jakarta.apache.org

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
    For additional commands, e-mail: lucene-user-help@jakarta.apache.org

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupjava-user @
categorieslucene
postedSep 25, '04 at 3:21p
activeSep 26, '04 at 10:25p
posts2
users2
websitelucene.apache.org

People

Translate

site design / logo © 2022 Grokbase