FAQ

Search Discussions

102 discussions - 444 posts

  • Hi All, I am using Lucene for creating indexes. There is one field as "email" which stored the email id. I have few queries regarding searching: 1. I want to search for all the records having domain ...
    Aditi GoyalAditi Goyal
    Jun 23, 2008 at 6:22 am
    Dec 8, 2008 at 7:06 pm
  • Hi, Is there a lucene index reader that will load a disk-based index into memory and perform searches on it from RAM? Sorry if I missed this in the docs somewhere. Darren ...
    Darren GovoniDarren Govoni
    Jun 26, 2008 at 7:41 pm
    Aug 13, 2008 at 5:13 pm
  • if I search a keyword likes 'computer' in a shopping website. the result may contains. total: (1000) products . categories: pc (500) products . notebook (300) products . server (200) products . so ...
    LutanLutan
    Jun 28, 2008 at 7:58 am
    Jul 18, 2008 at 4:49 pm
  • Hi, I am using Lucene for indexing and searching the documents. Its working file for supported documents. Now i want to index documents with unsupported mime types. Right now i am using LIUS which is ...
    Gaurav SharmaGaurav Sharma
    Jun 18, 2008 at 2:08 pm
    Jul 4, 2008 at 5:53 pm
  • Hi, I see some strange behavoiur of lucene. The following scenario. While adding documents to my index (every doc is pretty small, doc- count is about 12000) I have implemented a custom behaviour of ...
    Sascha FahlSascha Fahl
    Jun 30, 2008 at 1:00 pm
    Jul 1, 2008 at 12:37 pm
  • Is it possible to do nested proximity searches with lucene? i.e. can I say I want a to be within 1 word of b and then that group to be within 4 words of c? The syntax ""a b"~1" c"~4 doesn't seem to ...
    David LeeDavid Lee
    Jun 30, 2008 at 9:16 pm
    Jul 1, 2008 at 9:15 am
  • hi, I am fairly new to Lucene and is currently going over its source code. I had read through the code for a few times, mapping it and all but I seems to be facing a problem. I could go all the way ...
    Blazingwolf7Blazingwolf7
    Jun 26, 2008 at 7:24 am
    Jul 1, 2008 at 6:34 am
  • Hi: I had some code to do indexReader pooling to avoid open and close on a large index when doing lotsa searches. So I had a FilteredIndexReader proxy that overrides the doClose method to do nothing, ...
    John WangJohn Wang
    Jun 29, 2008 at 3:52 pm
    Jun 30, 2008 at 10:16 pm
  • Hello Lucene Gurus, I'm new to Lucene so sorry if this question basic or naïve. I have a Document to which I want to add a Field named, say, "foo" that is tokenized, indexed and unstored. I am using ...
    Bill CheskyBill Chesky
    Jun 27, 2008 at 4:03 am
    Jun 30, 2008 at 3:09 pm
  • Hello All, Sort of new to lucene but have a general question in regards to performance. I've got a single index of rather large size (about 7 million docs). I've ran a couple different queries ...
    Jordon SaardchitJordon Saardchit
    Jun 27, 2008 at 9:25 pm
    Jun 30, 2008 at 3:03 pm
  • What is the difference between these three modes of operating with lucene... And are there any other modes/ways of operation also, using which we can more effectively run applications with lucene. I ...
    DevashishDevashish
    Jun 30, 2008 at 8:58 am
    Jun 30, 2008 at 10:19 am
  • Hello i am having the following code to highlight a text public String highlight(String text, String query ) throws IOException { TermQuery query = new TermQuery(new Term("f", query)); QueryScorer ...
    JimJim
    Jun 26, 2008 at 7:50 am
    Jun 30, 2008 at 10:01 am
  • Hi all. IndexWriter.close() API states that :: "Flushes all changes to an index and closes all associated files.". What does "closes all associated files" mean, since we are apparently able to still ...
    Java_is_everythingJava_is_everything
    Jun 27, 2008 at 12:41 pm
    Jun 30, 2008 at 8:59 am
  • Hi List, I've been redirected from general@lucene.apache.org to here to discuss my issue. ---------- My original email ---------- I try to provide relevant results for the users of a lyrics site, ...
    László MondaLászló Monda
    Jun 18, 2008 at 2:06 pm
    Jun 29, 2008 at 1:44 pm
  • Hi, I'm looking for the correct way to create an index given the following restrictions: 1. The documents are received in batches of variable sizes (not more then 100 docs in a batch). 2. The batch ...
    Eran SeviEran Sevi
    Jun 26, 2008 at 2:28 pm
    Jun 29, 2008 at 7:07 am
  • Dear all, Currently I am using Lucene jave 2.3.2 demo to parse Microsoft 2003 and 2007 docs and PDF files. It is able to parse files with *.pdf, *.doc, *.xls etc. But it does not search in files of ...
    Kumar GauravKumar Gaurav
    Jun 27, 2008 at 11:08 am
    Jun 28, 2008 at 3:08 pm
  • Hello, I am currently keeping an index of all our client's usernames. The search functionality is implemented using a PrefixFilter. However, we would like to expand the functionality to be able to ...
    Mark FergusonMark Ferguson
    Jun 25, 2008 at 5:43 pm
    Jun 28, 2008 at 12:33 am
  • If I'm using a computer that has multiple cores, or if I want to use several computers to speed up the indexing process, how should I do that? Is there some kind of support for that in the API? David ...
    David LeeDavid Lee
    Jun 27, 2008 at 9:58 pm
    Jun 27, 2008 at 10:12 pm
  • I just implemented a sorting feature on our application where the user can change the sort on a query and reexecute the search. It works fine on text fields where most of the documents have different ...
    Robert HastingsRobert Hastings
    Jun 27, 2008 at 3:45 pm
    Jun 27, 2008 at 7:17 pm
  • Hi, Is it possible to read a disk-based index into RAM (entirely) and have all searches operate on it there? I saw some RAMDirectory examples, but it didn't look like it will transfer a disk index ...
    Darren GovoniDarren Govoni
    Jun 27, 2008 at 5:53 pm
    Jun 27, 2008 at 6:35 pm
  • Hi all. Is there a way to know "number-of-documents-that-will-be-flushed", just before giving a call to flush() method? I am currently using Lucene 2.2.0 API. Looking forward to replies. Ajay Garg -- ...
    Java_is_everythingJava_is_everything
    Jun 27, 2008 at 4:27 am
    Jun 27, 2008 at 2:58 pm
  • Hello, I have a stemmed index, but i want to search the exact form of a word. I use French Analyzer, so for instance "progression", "progresser" are indexed with the linguistic root "progress". But ...
    Renou okiRenou oki
    Jun 25, 2008 at 8:16 am
    Jun 27, 2008 at 1:38 pm
  • Folks, Could anyone tell me the significance of the naming of the cfs files in the luceneindex e.g. _1pp.cfs, _2kk.cfs etc. I have observed many differently named files being created temporarily ...
    Mick lMick l
    Jun 27, 2008 at 9:49 am
    Jun 27, 2008 at 1:34 pm
  • hi, what is the correct way to instruct the indexwriter (or other classes?) to delete old commit points after N minutes ? I tried to write a customized IndexDeletionPolicy that uses the parameters to ...
    Alex ChengAlex Cheng
    Jun 26, 2008 at 12:40 am
    Jun 26, 2008 at 2:19 pm
  • Is there a class to do this?
    Jason RutherglenJason Rutherglen
    Jun 26, 2008 at 1:09 pm
    Jun 26, 2008 at 1:09 pm
  • Hi, I'm using lucene to compute the score of some documents. For several reasons I need also to know the documents that don't match the input query. For example with score 0. I don't know the engine ...
    Paolo ValleriPaolo Valleri
    Jun 25, 2008 at 7:31 am
    Jun 26, 2008 at 9:37 am
  • Hello i am having the following code to highlight a text public String highlight(String text, String query ) throws IOException { TermQuery query = new TermQuery(new Term("f", query)); QueryScorer ...
    JimJim
    Jun 26, 2008 at 8:32 am
    Jun 26, 2008 at 8:32 am
  • Hi, I know that case-insensitive searching is normally done by creating an all-lower-case version of the documents, and turning the search terms into lower case whenever this field is searched, but ...
    John ByrneJohn Byrne
    Jun 25, 2008 at 9:41 am
    Jun 26, 2008 at 8:20 am
  • Hi all. Is there a way to obtain the number of documents in the Lucene index (2.0.0), having a particular term indexed, much like what we do in a database ? Looking forward to a reply. Ajay Garg -- ...
    Java_is_everythingJava_is_everything
    Jun 26, 2008 at 5:10 am
    Jun 26, 2008 at 5:24 am
  • hi, what is the correct way to instruct the indexwriter to delete old commit points after N minutes ? I tried to write a customized IndexDeletionPolicy that uses the parameters to schedule future ...
    Alex ChengAlex Cheng
    Jun 26, 2008 at 12:22 am
    Jun 26, 2008 at 12:22 am
  • hi, what is the correct way to instruct the indexwriter to delete old commit points after N minutes ? I tried to write a customized IndexDeletionPolicy that uses the parameters to schedule future ...
    Alex ChengAlex Cheng
    Jun 26, 2008 at 12:16 am
    Jun 26, 2008 at 12:16 am
  • Hello people, yes, there were several threads about this topic, but I sadly have to respawn it, I'm sorry. The first I found was a discussion from May 2005: ...
    Christian ReuschlingChristian Reuschling
    Jun 25, 2008 at 9:48 am
    Jun 25, 2008 at 10:01 pm
  • Folks, My users require wildcard searches. Sometimes their search phrases contain spaces. I am having trouble trying to implement a wildcard search on strings containing spaces, so if the term ...
    Mick lMick l
    Jun 24, 2008 at 12:29 pm
    Jun 25, 2008 at 9:21 pm
  • Hi, I have 2 kind of searches. One kind is like the wikipedia suggestions and the other one is pretty classic. So does it make sense to have different indices for this 2 search-styles? best, sascha ...
    Sascha FahlSascha Fahl
    Jun 25, 2008 at 1:51 pm
    Jun 25, 2008 at 2:01 pm
  • I have extended my evaluation (previous evaluation: http://zzzoot.blogspot.com/2008/06/simultaneous-threaded-query-lucene.html) to include as well as an increasing # of threads performing concurrent ...
    Glen NewtonGlen Newton
    Jun 11, 2008 at 6:08 pm
    Jun 25, 2008 at 10:56 am
  • Hi All, I am new to Lucene Search. Can you let me know if it is possible to index the "Verity Spider" content. If possible please let me know how to create a index form it and search on it. Also ...
    YuganaYugana
    Jun 24, 2008 at 7:25 am
    Jun 25, 2008 at 10:55 am
  • Hi, I have around 10 different indexfiles to request. Is it better to do this via one request to one MultiReader or is better to request the 10 indeces one after another? Especially for doing some ...
    Sascha FahlSascha Fahl
    Jun 23, 2008 at 12:28 pm
    Jun 25, 2008 at 9:12 am
  • Hi, I have a tags field. And each tag can have multiple words, like "San Francisco". Each tag is analyzed into Keyword field like this new Field("tags", "San Francisco",Field.Store.YES, ...
    Chris LuChris Lu
    Jun 25, 2008 at 12:15 am
    Jun 25, 2008 at 3:44 am
  • Hello there! I trying to query for a specific document on a efficient way. My index is structured in a way where I have an id field which is a unique key for the whole index. When I'm ...
    Vinicius CarvalhoVinicius Carvalho
    Jun 20, 2008 at 4:13 pm
    Jun 24, 2008 at 11:57 pm
  • Hi, I want to customize a new Similarity class which need to adopt payload information.The current definition of scorePayload is below: "public float scorePayload(String fieldName, byte [] payload, ...
    WuqiWuqi
    Jun 24, 2008 at 4:22 pm
    Jun 24, 2008 at 7:30 pm
  • Hi: I am trying to add couple more values to the TermInfo file and want to keep the index backward compatible. But I see values such as docFreq etc. are stored as a VInt, so I couldn't do things like ...
    John WangJohn Wang
    Jun 24, 2008 at 7:00 pm
    Jun 24, 2008 at 7:00 pm
  • Jay dragonJay dragon
    Jun 24, 2008 at 4:44 pm
    Jun 24, 2008 at 4:44 pm
  • Hello: I have a problem where I need to search for the term "C++". If I use StandardAnalyzer, the "+" characters are removed and the search is done on just the "c" character which is not what is ...
    Alex SotoAlex Soto
    Jun 24, 2008 at 3:49 pm
    Jun 24, 2008 at 4:40 pm
  • Hello: I have a problem where I need to search for the word "C++". If I use StandardAnalyzer, the "+" characters are removed and the search is done on just the "c" character which is not what is ...
    Alex SotoAlex Soto
    Jun 24, 2008 at 3:59 pm
    Jun 24, 2008 at 3:59 pm
  • Hi All, I am facing this error while doing Indexing text files.can anyone guide me how to resolve this issue. -- View this message in context: ...
    SebastinSebastin
    Jun 23, 2008 at 10:07 pm
    Jun 24, 2008 at 3:46 pm
  • Hello, I need to be able to select a random word out of all the words in my index. how can I do this tru termDocs() ? Also, I need to get a list of unique words as well. Is there a way to ask this to ...
    Cam BazzCam Bazz
    Jun 23, 2008 at 10:03 pm
    Jun 24, 2008 at 2:43 pm
  • Hi All, I have created an index file and indexing the content retrieved from a database. How can I search on this content? When indexed 3 files namely _0.cfs, segments.gen and segments_k are created. ...
    YuganaYugana
    Jun 24, 2008 at 9:18 am
    Jun 24, 2008 at 11:36 am
  • Hi, BoostingQuery is designed to demote the scores of documents when they match the undesired query by the boosting/demoting the final score. The problem I see is this demoting factor is ...
    Jay dragonJay dragon
    Jun 24, 2008 at 6:39 am
    Jun 24, 2008 at 7:05 am
  • How do you handle token payload that represent multiple values? I simply don't do it even though there are cases where I would like to see it. I also find that my token filters that update payload ...
    Karl WettinKarl Wettin
    Jun 21, 2008 at 4:57 pm
    Jun 23, 2008 at 10:39 pm
  • Hello. I am trying to implement a search based on a search text in an index that contains Track Title, Album Name or Artist Name information that delivers a list or results that are suited for "auto ...
    Lukas ÖesterreicherLukas Öesterreicher
    Jun 23, 2008 at 3:25 pm
    Jun 23, 2008 at 4:19 pm
Group Navigation
period‹ prev | Jun 2008 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions102
posts444
users119
websitelucene.apache.org

119 users for June 2008

Erick Erickson: 33 posts Grant Ingersoll: 25 posts Otis Gospodnetic: 21 posts Chris Hostetter: 18 posts Michael McCandless: 15 posts Anshum: 9 posts Lutan: 9 posts Glen Newton: 8 posts Matthew Hall: 8 posts Bill Chesky: 7 posts Jason Rutherglen: 7 posts John Byrne: 7 posts Karl Wettin: 7 posts Aditi Goyal: 6 posts Allahbaksh Mohammedali Asadullah: 6 posts Daniel Naber: 6 posts Daniel Noll: 6 posts László Monda: 6 posts Sascha Fahl: 6 posts Sebastin: 6 posts
show more
Archives