FAQ
HI,

Currently my text source files (800 000) are stored in folder which make
retrieving it by many users some how slow. I heard it might be possible that
these files content can be stored in the index it self although I found this
unrealistic.

Is it possible storing the source text files content in the index instead of
just “doc id” which is assigned with each term in the index?




--
View this message in context: http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3176703.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Search Discussions

  • Ian Lea at Jul 18, 2011 at 8:12 am
    Of course. See the javadocs for Field, Field.Store and Field.Index.


    --
    Ian.

    On Sun, Jul 17, 2011 at 1:40 PM, starz10de wrote:
    HI,

    Currently my text source files (800 000) are stored in folder which make
    retrieving it by many users some how slow. I heard it might be possible that
    these files content can be stored in the index it self although I found this
    unrealistic.

    Is it possible storing the source text files content in the index instead of
    just “doc id” which is assigned with each term in the index?




    --
    View this message in context: http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3176703.html
    Sent from the Lucene - Java Users mailing list archive at Nabble.com.

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Starz10de at Jul 18, 2011 at 8:14 pm
    thanks for your reply

    --
    View this message in context: http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3180435.html
    Sent from the Lucene - Java Users mailing list archive at Nabble.com.

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Erick Erickson at Jul 18, 2011 at 11:13 pm
    It's certainly possible as others have said, but don't be surprised
    if it's not performant. At root, you still have a disk out there that's
    being used for fetching the data. Simply moving it from fetching
    individual files to fetching that data from the index doesn't change
    that fundamental fact.

    Before jumping to this conclusion, you might want to instrument
    your code or use one of the profiler tools to see where the actual
    bottleneck is rather than assume you know (if you haven't already).

    Seriously, you *must* determine whether the file I/O is the problem
    or whether you're, say, CPU bound or else risk wasting
    lots of time and energy and still not fixing the problem.


    Best
    Erick
    On Mon, Jul 18, 2011 at 4:13 PM, starz10de wrote:
    thanks for your reply

    --
    View this message in context: http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3180435.html
    Sent from the Lucene - Java Users mailing list archive at Nabble.com.

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Starz10de at Jul 19, 2011 at 12:28 pm
    thanks for your kind answer

    --
    View this message in context: http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3182340.html
    Sent from the Lucene - Java Users mailing list archive at Nabble.com.

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Jagdish Vasani at Jul 18, 2011 at 5:16 pm
    Yes..you can Store Text file content by saying that field store-Field.Store.YES.same time you can also index it by saying Field.Index.ANALYZED as another parameter of Field class contructor.

    Thanks,
    Jagdish

    -----Original Message-----
    From: starz10de
    Sent: Sunday, July 17, 2011 6:10 PM
    To: java-user@lucene.apache.org
    Subject: Store the documents content in the index

    HI,

    Currently my text source files (800 000) are stored in folder which make
    retrieving it by many users some how slow. I heard it might be possible that
    these files content can be stored in the index it self although I found this
    unrealistic.

    Is it possible storing the source text files content in the index instead of
    just “doc id” which is assigned with each term in the index?




    --
    View this message in context: http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3176703.html
    Sent from the Lucene - Java Users mailing list archive at Nabble.com.

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Starz10de at Jul 18, 2011 at 8:13 pm
    thanks for your reply

    --
    View this message in context: http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3180432.html
    Sent from the Lucene - Java Users mailing list archive at Nabble.com.

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org
  • Andrew Kane at Jul 19, 2011 at 2:15 am
    Some file systems might be slow if too many files are in one folder, try
    splitting them into subfolders...

    Andrew.

    On Sun, Jul 17, 2011 at 8:40 AM, starz10de wrote:

    HI,

    Currently my text source files (800 000) are stored in folder which make
    retrieving it by many users some how slow. I heard it might be possible
    that
    these files content can be stored in the index it self although I found
    this
    unrealistic.

    Is it possible storing the source text files content in the index instead
    of
    just “doc id” which is assigned with each term in the index?




    --
    View this message in context:
    http://lucene.472066.n3.nabble.com/Store-the-documents-content-in-the-index-tp3176703p3176703.html
    Sent from the Lucene - Java Users mailing list archive at Nabble.com.

    ---------------------------------------------------------------------
    To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
    For additional commands, e-mail: java-user-help@lucene.apache.org

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupjava-user @
categorieslucene
postedJul 17, '11 at 12:40p
activeJul 19, '11 at 12:28p
posts8
users5
websitelucene.apache.org

People

Translate

site design / logo © 2022 Grokbase