Search Discussions

102 discussions - 444 posts

  • Hi, I am quite new to the lucene scene and I need your help :-) There are several document classes. Lets say documents from class A, B, C, D and E. What I need is the following: 1) I want to search ...
    Sascha FahlSascha Fahl
    Jun 8, 2008 at 11:20 am
    Jun 9, 2008 at 7:58 am
  • Hi, Is it possible to read a disk-based index into RAM (entirely) and have all searches operate on it there? I saw some RAMDirectory examples, but it didn't look like it will transfer a disk index ...
    Darren GovoniDarren Govoni
    Jun 27, 2008 at 5:53 pm
    Jun 27, 2008 at 6:35 pm
  • hi, what is the correct way to instruct the indexwriter (or other classes?) to delete old commit points after N minutes ? I tried to write a customized IndexDeletionPolicy that uses the parameters to ...
    Alex ChengAlex Cheng
    Jun 26, 2008 at 12:40 am
    Jun 26, 2008 at 2:19 pm
  • Hello, I need to be able to select a random word out of all the words in my index. how can I do this tru termDocs() ? Also, I need to get a list of unique words as well. Is there a way to ask this to ...
    Cam BazzCam Bazz
    Jun 23, 2008 at 10:03 pm
    Jun 24, 2008 at 2:43 pm
  • Hi All, I have created an index file and indexing the content retrieved from a database. How can I search on this content? When indexed 3 files namely _0.cfs, segments.gen and segments_k are created. ...
    Jun 24, 2008 at 9:18 am
    Jun 24, 2008 at 11:36 am
  • Hi, I am new to Lucene. I have several text files I would like to index and search. How do I do this? Thanks, jnance -- View this message in context: ...
    Jun 20, 2008 at 2:58 pm
    Jun 23, 2008 at 1:24 pm
  • Hello list, I need to generate a report with all the terms, the document ids where they appear and the score in each document. My current approach is to get a Term enumeration from the index and ...
    Gerardo SeguraGerardo Segura
    Jun 19, 2008 at 4:07 pm
    Jun 20, 2008 at 2:59 pm
  • HI, i need to download this file which is NGramSpeller.java more information about this file is here http://www.marine-geo.org/services/oai/docs/javadoc/org/apache/lucene/spell/NGramSpeller.html but ...
    Jun 19, 2008 at 10:35 pm
    Jun 20, 2008 at 2:33 am
  • Hi all Currently I'm using the search method returning the Hits object. According to http://wiki.apache.org/lucene-java/ImproveSearchingSpeed one should use a HitCollector-oriented search method ...
    Konstantyn SmirnovKonstantyn Smirnov
    Jun 2, 2008 at 3:44 pm
    Jun 17, 2008 at 10:00 am
  • Dear all, I would like to seek your suggestion on re-ranking methodology. My problem is that I have a set of resulting documents to a query and each one of them with a matching score and also a list ...
    Sengly HengSengly Heng
    Jun 13, 2008 at 3:48 pm
    Jun 15, 2008 at 6:16 am
  • Hi all, I've worked just a little with Lucene and now i need some stuffs to be done but i can't find responses in faq or how-to. My index contains products title like Nikon d300, nikon d200, nikon d3 ...
    Jun 13, 2008 at 3:00 pm
    Jun 13, 2008 at 3:53 pm
  • Hello, When you look at the fields of a document with Luke, there is a norm column. I have not been able to figure out what that is. The reason I am asking is that I am trying to build a uniqueness ...
    Cam BazzCam Bazz
    Jun 11, 2008 at 2:05 pm
    Jun 11, 2008 at 2:51 pm
  • Each of my filters represent single boosting term queries. But when using the filter instead o the boosting term query I loose the score (not sure this is true) and payload boost (if any), both ...
    Karl WettinKarl Wettin
    Jun 10, 2008 at 11:42 pm
    Jun 11, 2008 at 1:50 pm
  • I am interested in changing the search configuration for grails domain objects on a running grails instance. Near as I can tell, I need to reload the mappings (Domain.cpm.xml) and then reindex ...
    Jun 9, 2008 at 9:28 pm
    Jun 10, 2008 at 7:55 am
  • I have a design where I will be using multiple index shards to hold approx 7.5 million documents per index per month over many years. These will be large static R/O indexes but the corresponding ...
    Antony BowesmanAntony Bowesman
    Jun 9, 2008 at 2:47 pm
    Jun 10, 2008 at 5:48 am
  • I would like to be able to get multi-language support within a single index. I would appreciate input on what I am suggesting: Assuming that you want something like the following in your document: ...
    Glen NewtonGlen Newton
    Jun 5, 2008 at 4:15 pm
    Jun 5, 2008 at 5:07 pm
  • Hello there! I'm indexing documents using the BrazilianAnalyzer, and I've noticed that many words are not being indexed. I store and index the entire doc (I'm doing this in order to present the ...
    Vinicius CarvalhoVinicius Carvalho
    Jun 3, 2008 at 7:51 pm
    Jun 4, 2008 at 12:59 pm
  • Hi, I am new to Lucene, so asking some basic question. Is there any example/reference implementation available of Lucene Usage using BooleanQuery using API instead of QueryParser? Cheers Aamir Yaseen ...
    Aamir YaseenAamir Yaseen
    Jun 2, 2008 at 3:34 pm
    Jun 3, 2008 at 10:04 am
  • Hi All, I am using Lucene-core-2.3.2. One of the fields that I have indexed with Lucene contains a single character value which stands for a code. When I make queries using a StandardAnalyzer lucene ...
    Ryan catambingRyan catambing
    Jun 2, 2008 at 3:47 pm
    Jun 3, 2008 at 7:29 am
  • Is it possible to do nested proximity searches with lucene? i.e. can I say I want a to be within 1 word of b and then that group to be within 4 words of c? The syntax ""a b"~1" c"~4 doesn't seem to ...
    David LeeDavid Lee
    Jun 30, 2008 at 9:16 pm
    Jul 1, 2008 at 9:15 am
  • Hello All, Sort of new to lucene but have a general question in regards to performance. I've got a single index of rather large size (about 7 million docs). I've ran a couple different queries ...
    Jordon SaardchitJordon Saardchit
    Jun 27, 2008 at 9:25 pm
    Jun 30, 2008 at 3:03 pm
  • What is the difference between these three modes of operating with lucene... And are there any other modes/ways of operation also, using which we can more effectively run applications with lucene. I ...
    Jun 30, 2008 at 8:58 am
    Jun 30, 2008 at 10:19 am
  • If I'm using a computer that has multiple cores, or if I want to use several computers to speed up the indexing process, how should I do that? Is there some kind of support for that in the API? David ...
    David LeeDavid Lee
    Jun 27, 2008 at 9:58 pm
    Jun 27, 2008 at 10:12 pm
  • Hi all. Is there a way to obtain the number of documents in the Lucene index (2.0.0), having a particular term indexed, much like what we do in a database ? Looking forward to a reply. Ajay Garg -- ...
    Jun 26, 2008 at 5:10 am
    Jun 26, 2008 at 5:24 am
  • Hi, I have 2 kind of searches. One kind is like the wikipedia suggestions and the other one is pretty classic. So does it make sense to have different indices for this 2 search-styles? best, sascha ...
    Sascha FahlSascha Fahl
    Jun 25, 2008 at 1:51 pm
    Jun 25, 2008 at 2:01 pm
  • Hi, I have around 10 different indexfiles to request. Is it better to do this via one request to one MultiReader or is better to request the 10 indeces one after another? Especially for doing some ...
    Sascha FahlSascha Fahl
    Jun 23, 2008 at 12:28 pm
    Jun 25, 2008 at 9:12 am
  • Hi, I want to customize a new Similarity class which need to adopt payload information.The current definition of scorePayload is below: "public float scorePayload(String fieldName, byte [] payload, ...
    Jun 24, 2008 at 4:22 pm
    Jun 24, 2008 at 7:30 pm
  • Hi, BoostingQuery is designed to demote the scores of documents when they match the undesired query by the boosting/demoting the final score. The problem I see is this demoting factor is ...
    Jay dragonJay dragon
    Jun 24, 2008 at 6:39 am
    Jun 24, 2008 at 7:05 am
  • How do you handle token payload that represent multiple values? I simply don't do it even though there are cases where I would like to see it. I also find that my token filters that update payload ...
    Karl WettinKarl Wettin
    Jun 21, 2008 at 4:57 pm
    Jun 23, 2008 at 10:39 pm
  • Hello. I am trying to implement a search based on a search text in an index that contains Track Title, Album Name or Artist Name information that delivers a list or results that are suited for "auto ...
    Lukas ÖesterreicherLukas Öesterreicher
    Jun 23, 2008 at 3:25 pm
    Jun 23, 2008 at 4:19 pm
  • Dear Fellow Java/Lucene developers: I want to know if there is a way to improve the efficiency of doing a search using lucene such that when a user does a search, and should there be hundreds of ...
    Jun 19, 2008 at 3:00 am
    Jun 19, 2008 at 10:43 am
  • Hi All, I need to fetch approximately 225 GB of Index Store records in a web page .the total time to fetch the record and display to the user takes 10 minutes.is it possible to reduce the time to ...
    Jun 18, 2008 at 2:17 pm
    Jun 18, 2008 at 2:51 pm
  • So I'm using Snowball Analyzer on a field for business titles. The value "Charlie's Sandwich Shoppe" becomes "charli sandwich shopp". This happens partly because the StandardAnalyzer strips off the ...
    Max MetralMax Metral
    Jun 17, 2008 at 9:17 pm
    Jun 18, 2008 at 1:46 pm
  • Hi all, I was wondering why only the Field constructor which accepts a String offers Store and Index options? I understand there might be no logic in offering them for the TokenStream constructor, ...
    Itamar Syn-HershkoItamar Syn-Hershko
    Jun 9, 2008 at 8:53 pm
    Jun 17, 2008 at 12:27 am
  • Hi, I have a small problem regarding QueryParser and WildcardQueries. Basically, I'm indexing documents like this: doc.add(new Field("title", "_FooBar", Field.Store.YES, Field.Index.TOKENIZED)); ...
    Felix SchwarzFelix Schwarz
    Jun 16, 2008 at 11:52 am
    Jun 16, 2008 at 1:23 pm
  • Hi, I'm trying to build my own query. I want to combine several TermQuery + 1 PrefixQuery in a BooleanQuery. The Code looks like this: BooleanQuery bq = new BooleanQuery(); Term t = new Term("field", ...
    Sascha FahlSascha Fahl
    Jun 16, 2008 at 8:26 am
    Jun 16, 2008 at 1:12 pm
  • Hi, We had the memory leak issue when using DistanceSortSource of LocalLucene for repeated query/search. In about 450 queries, we are experiencing out of memory error. After dig in the code, we found ...
    Ethan TaoEthan Tao
    Jun 10, 2008 at 10:35 am
    Jun 11, 2008 at 5:53 am
  • Hello, In performing a multi-field query, is there a way to determine which field(s) the query matched in short of running a query against each of the fields independently? Could payloads be utilized ...
    Michael GarskiMichael Garski
    Jun 3, 2008 at 11:28 pm
    Jun 4, 2008 at 4:55 pm
  • Is there a class to do this?
    Jason RutherglenJason Rutherglen
    Jun 26, 2008 at 1:09 pm
    Jun 26, 2008 at 1:09 pm
  • Hello i am having the following code to highlight a text public String highlight(String text, String query ) throws IOException { TermQuery query = new TermQuery(new Term("f", query)); QueryScorer ...
    Jun 26, 2008 at 8:32 am
    Jun 26, 2008 at 8:32 am
  • hi, what is the correct way to instruct the indexwriter to delete old commit points after N minutes ? I tried to write a customized IndexDeletionPolicy that uses the parameters to schedule future ...
    Alex ChengAlex Cheng
    Jun 26, 2008 at 12:22 am
    Jun 26, 2008 at 12:22 am
  • hi, what is the correct way to instruct the indexwriter to delete old commit points after N minutes ? I tried to write a customized IndexDeletionPolicy that uses the parameters to schedule future ...
    Alex ChengAlex Cheng
    Jun 26, 2008 at 12:16 am
    Jun 26, 2008 at 12:16 am
  • Hi: I am trying to add couple more values to the TermInfo file and want to keep the index backward compatible. But I see values such as docFreq etc. are stored as a VInt, so I couldn't do things like ...
    John WangJohn Wang
    Jun 24, 2008 at 7:00 pm
    Jun 24, 2008 at 7:00 pm
  • Jay dragonJay dragon
    Jun 24, 2008 at 4:44 pm
    Jun 24, 2008 at 4:44 pm
  • Hello: I have a problem where I need to search for the word "C++". If I use StandardAnalyzer, the "+" characters are removed and the search is done on just the "c" character which is not what is ...
    Alex SotoAlex Soto
    Jun 24, 2008 at 3:59 pm
    Jun 24, 2008 at 3:59 pm
  • I've tried everything I can think of and I still can't unsubscribe from java-user@lucene.apache.org . None of my unsubscribe or emails to java-user-unsubscribe or java-user-help seem to do anything. ...
    William ThimblebyWilliam Thimbleby
    Jun 23, 2008 at 11:43 am
    Jun 23, 2008 at 11:43 am
  • is there any way i can find example of a program using NGramSpeller.java -- View this message in context: http://www.nabble.com/Example-using-NGramSpeller.java-tp18034945p18034945.html Sent from the ...
    Jun 20, 2008 at 6:21 pm
    Jun 20, 2008 at 6:21 pm
  • I am not here to waste anyone's time and don't believe my post violates the terms of use. If the admins wish to remove my posts I understand. I am recruiting for a Search Engineer (Lucene), this ...
    Jun 17, 2008 at 11:07 pm
    Jun 17, 2008 at 11:07 pm
  • Manu Konchady's book on building search applications is out: Konchady, Manu. 2008. Building Search Applications: Lucene, LingPipe, and Gate. Mustru Publishing. It's available from Amazon: ...
    Bob CarpenterBob Carpenter
    Jun 12, 2008 at 9:24 pm
    Jun 12, 2008 at 9:24 pm
  • Dear all, To improve the search, I will have to do keyword expansion. I am looking for a library that would help me to get the list of synonym of a term with some similarity score. Is there any lib ...
    Sengly HengSengly Heng
    Jun 11, 2008 at 5:18 pm
    Jun 11, 2008 at 5:18 pm
Group Navigation
period‹ prev | Jun 2008 | next ›
Group Overview
groupjava-user @

119 users for June 2008

Erick Erickson: 33 posts Grant Ingersoll: 25 posts Otis Gospodnetic: 21 posts Chris Hostetter: 18 posts Michael McCandless: 15 posts Anshum: 9 posts Lutan: 9 posts Glen Newton: 8 posts Matthew Hall: 8 posts Bill Chesky: 7 posts Jason Rutherglen: 7 posts John Byrne: 7 posts Karl Wettin: 7 posts Aditi Goyal: 6 posts Allahbaksh Mohammedali Asadullah: 6 posts Daniel Naber: 6 posts Daniel Noll: 6 posts László Monda: 6 posts Sascha Fahl: 6 posts Sebastin: 6 posts
show more