Search Discussions

152 discussions - 692 posts

  • Hi, Lucene sort hits by relevance as default. Cause i would like to sort them by a special string field and not by relevance i was thinking about dropping the sorting by relevance as default and ...
    Jul 29, 2006 at 8:43 am
    Sep 26, 2006 at 4:36 pm
  • If I want to search an email address (i.e. michael@foo.com) do I need to Tokenize that field? doc.add(new Field("from", (String) itemContent.get("from"), Field.Store.YES, Field.Index.TOKENIZED)); ...
    Michael J. PrichardMichael J. Prichard
    Jul 26, 2006 at 8:35 pm
    Aug 4, 2006 at 5:09 pm
  • Is it possible to modify a stored field but not indexed? for example, if I have a field like this: new Field("address", address, Field.Store.YES, Field.Index.NO) and I want to modify it like this: ...
    Jul 7, 2006 at 10:54 am
    Jul 14, 2006 at 11:10 am
  • I'm want to filter words with a dash in them. ["x-men"] ["xmen"] ["x", "men"] All of above should be synonyms. The problem is ["x", "men"] requiring a distance between the terms and thus also ...
    Karl wettinKarl wettin
    Jul 24, 2006 at 1:53 am
    Aug 2, 2006 at 8:05 pm
  • Hi, i have a bigger index with around 10 GB. The most used field is the "Name"-Field. So is there way to put a part of the index (maybe a special field) into memory and the rest on disk ? Thanks, ...
    Jul 17, 2006 at 4:23 pm
    Jul 19, 2006 at 8:17 pm
  • My question might be very easy for you Lucene experts. But after going through the Lucene documentation / example, I haven't been able to figure out how to solve this problem. I'll be really grateful ...
    Namit YadavNamit Yadav
    Jul 25, 2006 at 2:06 am
    Jul 26, 2006 at 10:50 pm
  • Hi, I have 10 fields in my index and some of the fields can be empty. I'd like to be able do something like "IS NOT NULL" in SQL. For example: Name:Jane AND Addr IS NOT NULL AND Zip IS NOT NULL ...
    Dragon FlyDragon Fly
    Jul 18, 2006 at 2:20 pm
    Jul 20, 2006 at 12:51 pm
  • How do I do distance based searches with Lucene? I can do this in SQL with longitude/latitude values, but can this be done in Lucene? I can't do my full search in the database as I want the fast ...
    Giesen GiesenGiesen Giesen
    Jul 9, 2006 at 2:32 am
    Jul 12, 2006 at 7:25 am
  • Hello, sorry, didn't find the information elsewhere: 1.) Did the format of the lucene-index change between version 1.4.3 and 2.0? 2.) Is it possible to use the old Luke-Tool with a new lucene 2 ...
    Jul 18, 2006 at 6:53 pm
    Aug 25, 2006 at 8:22 am
  • Hello The lucene 2.0.0 StandardAnalyzer does treat the "_"(underscore) as a token. Is there a way I can make StandardAnalyzer don't tokenize for "_" or any given characters? I'd like to keep all ...
    Ngo, Anh \(ISS Southfield\)Ngo, Anh \(ISS Southfield\)
    Jul 21, 2006 at 2:16 pm
    Jul 21, 2006 at 9:42 pm
  • For the sake of date ranges, I'm storing dates as YYYYMMDD in my e-mail indexing application. My users typically want to limit their queries to ranges of dates, which include today. The application ...
    Rob Staveley (Tom)Rob Staveley (Tom)
    Jul 14, 2006 at 10:56 am
    Jul 20, 2006 at 7:33 pm
  • Chris Hostetter and Yonik's MissingStringLastComparator looks like a neat way to specify where to put null values when you want them to appear at the end of reverse sorts rather than at the ...
    Rob Staveley (Tom)Rob Staveley (Tom)
    Jul 14, 2006 at 5:13 pm
    Jul 15, 2006 at 7:03 pm
  • Hello, I am looking for a way to limit the number of search results I retrieve when searching. I am only interested in (let's say) the first ten hits of a query.. maybe I want to look at hits ...
    Jul 25, 2006 at 2:09 pm
    Jul 26, 2006 at 1:15 pm
  • I have a BooleanQuery that looks like this: BooleanQuery query = new BooleanQuery(); TermQuery term1 = new TermQuery(new Term(ID, "1234")); TermQuery term2 = new TermQuery(new Term(ID, "2344")); ...
    Van NguyenVan Nguyen
    Jul 6, 2006 at 7:53 pm
    Jul 22, 2006 at 8:16 am
  • Hi, I used the method MoreLikeThis (in search.similar package) of Lucene to find similar documents, but the result is 0 documents also when I index more times the same document. I don't understand ...
    Jul 19, 2006 at 8:40 am
    Jul 21, 2006 at 1:03 pm
  • Hello all, I want to realize a drill-down Function aka "narrow search" aka "refine search". I want to have something like: Refine by Date: * 1990-2000 (30 Docs) * 2001-2003 (200 Docs) * 2004-2006 (10 ...
    Martin BraunMartin Braun
    Jul 21, 2006 at 1:48 pm
    Sep 26, 2006 at 6:20 pm
  • I am curious about the potential use of document scoring as a means to extract additional data from an index. Specifically, I would like the score to be a count of how many times a particular field ...
    Russell M. AllenRussell M. Allen
    Jul 27, 2006 at 4:03 pm
    Aug 10, 2006 at 6:53 pm
  • So I have the following code... // let's get our SynonymAnalyzer SynonymAnalyzer synAnalyzer = getSynonymAnalyzer(); // let's get our EmailAnalyzer EmailAnalyzer emailAnalyzer = getEmailAnalyzer(); ...
    Michael J. PrichardMichael J. Prichard
    Jul 29, 2006 at 7:51 pm
    Jul 31, 2006 at 1:27 pm
  • Would anyone give me a hint regarding the natural language expression of the following span query? ------------if creating queries programmatically (it is in Lucene scr) SpanTermQuery t1 = new ...
    Jul 24, 2006 at 2:31 am
    Jul 30, 2006 at 3:11 pm
  • I've been asked to do a project which provides full-text search for a large database of articles. The expectation is that most of the articles are fairly small (<2k bytes). There will be an initial ...
    Scott SmithScott Smith
    Jul 6, 2006 at 5:48 pm
    Jul 8, 2006 at 2:01 am
  • My index contains approximately 5 millions documents. During a search, I need to grab the value of a field for every document in the result set. I am currently using a HitCollector to search. Below ...
    Ryan O'HaraRyan O'Hara
    Jul 21, 2006 at 6:43 pm
    Aug 2, 2006 at 8:35 pm
  • I am working on indexing emails and have stored the data as milliseconds. I was thinking of using a filter w/ my search that would only return the email in that data range. I am currently indexing as ...
    Michael J. PrichardMichael J. Prichard
    Jul 26, 2006 at 1:48 pm
    Jul 27, 2006 at 8:27 am
  • I'm guessing they're neither the guy from Cheers nor the sociology term ;-) The examples have you creating them before you do searches. What are they? The javadoc doesn't really explain their ...
    Furash GaryFurash Gary
    Jul 10, 2006 at 3:58 pm
    Jul 15, 2006 at 12:07 am
  • Hi experts, There seems to be a strange memory leak with the IndexSearcher. I get an OutOfMemoryException after a few iterations of the following loop: LOOP: ramdir = new RAMDirectory( ...
    Heng MeiHeng Mei
    Jul 5, 2006 at 11:58 pm
    Jul 6, 2006 at 8:49 am
  • Hi folks, I'm looking for a solution/best practices concerning Lucene and SQL database integration. The database (MySQL) is already developed and contains data. I've tried MySQL full-text search, but ...
    Alexander MashtakovAlexander Mashtakov
    Jul 4, 2006 at 2:49 pm
    Jul 5, 2006 at 1:05 pm
  • Anyone know of good free email libraries I can use for lucene indexing for Windows Outlook Express and Unix emails?? suba suresh. --------------------------------------------------------------------- ...
    Suba SureshSuba Suresh
    Jul 26, 2006 at 4:56 pm
    Jul 31, 2006 at 1:43 pm
  • I met this problem: when searching, I add documents to index. Although I instantiates a new IndexSearcher, I can't retrieve the newly added documents. I have to close the program and enter the ...
    Hu andyHu andy
    Jul 27, 2006 at 9:49 am
    Jul 30, 2006 at 2:36 pm
  • I built an indexer that runs through email and its attachments, rips out content and what not and then creates a Document and adds it to an index. It works w/ no problem. The issue is that it takes ...
    Michael J. PrichardMichael J. Prichard
    Jul 27, 2006 at 4:32 pm
    Jul 28, 2006 at 6:35 pm
  • Hi, I'm going to attempt to output several thousand documents from a 3+ million document collection into a csv file. What is the most efficient method of retrieving all the text from the fields of ...
    Jul 27, 2006 at 9:01 pm
    Jul 28, 2006 at 6:03 am
  • Hello What can I use as a drop in replacement? I mean, about the (String, String[], Analyzer) one. The 1.9.1 javadoc says to use QueryParser.parse, but I need to construct the query first. Any util ...
    Paulo SilveiraPaulo Silveira
    Jul 25, 2006 at 5:22 am
    Jul 27, 2006 at 3:53 pm
  • I am indexing different document formats with lucene 1.9. One of the pdf file I am indexing is 300MG. Whenever the index writer hits that file it stops the indexing with "Out of Memory" exception. I ...
    Suba SureshSuba Suresh
    Jul 13, 2006 at 1:55 pm
    Jul 26, 2006 at 5:15 pm
  • Hi, all, My document's title field contains standalone(not contained inside a word) special char such as &,:,%,; etc. With luke0.6 tool, I found that these chars are not indexed in the title field or ...
    Herbert WuHerbert Wu
    Jul 22, 2006 at 8:59 pm
    Jul 24, 2006 at 3:37 pm
  • I am using Lucene 2.0 and trying to use the MultiFieldQueryParser in my search. I want to limit my search to documents which have "silly" in "field1" ...within that subset of documents, I want ...
    Rod MaddenRod Madden
    Jul 16, 2006 at 6:26 pm
    Jul 17, 2006 at 1:23 pm
  • Hello, I am working on an application similar to google books which allows searching on documents which represent a scanned page. Of course, one might search for a phrase starting at the end of one ...
    Mile RosuMile Rosu
    Jul 11, 2006 at 2:55 pm
    Jul 13, 2006 at 11:17 am
  • Hi, A new project that I am investigating lucene for needs the Parts of speech information for the tokens. I can get that information using NLP techniques (GATE etc.), by pre processing the documents ...
    Amit KumarAmit Kumar
    Jul 12, 2006 at 5:36 am
    Jul 12, 2006 at 4:51 pm
  • Hello, I'm pretty new to lucene so I hope my question is not stupid :) I'd like to index articles but I want them to be in a group. such as: article1, article2 and article3 are in the group1 article4 ...
    John johnJohn john
    Jul 24, 2006 at 6:50 pm
    Jul 27, 2006 at 7:16 am
  • Hi, I am checking a txt file with entries against an index generated with Lucene. Of the enclosed Searcher.java class, I use the isInLex(String noun) method, i.e. I read every line of the txt file ...
    Pasquale ImbembaPasquale Imbemba
    Jul 19, 2006 at 12:22 pm
    Jul 20, 2006 at 10:27 pm
  • Hi, Can Lucene index a database? PostgreSQL, Mysql, Access ? Thanks Cheers Teresa --------------------------------------------------------------------- To unsubscribe, e-mail: ...
    Jul 12, 2006 at 3:48 pm
    Jul 12, 2006 at 8:06 pm
  • Hy, I got the following situation: A Servlet runing in Tomcat5. When starting the servlet up it automatically creates a IndexReader and stores it in a static variable. For searching this variable is ...
    Dominik BruhnDominik Bruhn
    Jul 12, 2006 at 4:49 pm
    Jul 12, 2006 at 7:41 pm
  • I plan to make lucene (and nutch) a key element in an intranet solution, but I only know about lucene what I've read in the last couple of days. Here's what I'd like opinions about. I would like to ...
    Tomi NATomi NA
    Jul 11, 2006 at 10:41 am
    Jul 12, 2006 at 3:22 pm
  • When non-English word is used in TermQuery, it always returns null. With other types query, I could pass in an language specific analyzer. but with this TermQuery, I can't find anyway to specify the ...
    Jul 9, 2006 at 2:49 am
    Jul 10, 2006 at 1:57 pm
  • All, For performance reasons we keep our index of over a million documents ordered alphabeticaly. This way for an alpha sort we can just use the index order. This works very good, but I'm now looking ...
    Jason CalabreseJason Calabrese
    Jul 6, 2006 at 12:08 am
    Jul 7, 2006 at 9:23 pm
  • When i start the program its fast.. about 10 docs per second. but after about 15000 it slows down very much. Now it does 1 doc per second and it is at nr# 40 000 after a whole night indexing. These ...
    Peter velthuisPeter velthuis
    Jul 3, 2006 at 8:52 am
    Jul 3, 2006 at 10:57 am
  • Hello, I do have a question about fields with empty content should be added to the document / index or not. I do have a index schema, which defines all field a document can have. if one of the real ...
    Simon WillnauerSimon Willnauer
    Jul 31, 2006 at 12:23 pm
    Jul 31, 2006 at 6:41 pm
  • I was wondering if there was a nice way to add documents to a cached filter 'manually' as it were. The reason would be to avoid a complete refresh of the filter, if you already knew the docids of the ...
    Paul WaitePaul Waite
    Jul 26, 2006 at 9:05 pm
    Jul 31, 2006 at 2:54 pm
  • Ok, this might have been answered somewhere, but I can't find it so here goes: When I close my application containing index writers the lock files are left in the temp directory causing an "Lock ...
    Björn EkengrenBjörn Ekengren
    Jul 26, 2006 at 2:24 pm
    Jul 27, 2006 at 7:25 am
  • Hi, I went through the IndexModifier class. It says that - Although an instance of this class can be used from more than one thread, you will not get the best performance. You might want to use ...
    Vasu shahVasu shah
    Jul 25, 2006 at 1:11 pm
    Jul 25, 2006 at 4:45 pm
  • for example: $sql = "select count(*), user_group from groups where uid 0 group by user_group; can lucene query this result?
    James liuJames liu
    Jul 20, 2006 at 3:34 am
    Jul 20, 2006 at 6:03 am
  • Not sure if anyone out is doing this, thought about doing this or is just plain curious. I want to figure out a way to build a search/rule gui's whereas the user can build searches much like building ...
    Michael PrichardMichael Prichard
    Jul 18, 2006 at 3:00 am
    Jul 18, 2006 at 2:45 pm
  • How to parse this kind of query? COM(2006) 0001
    Jul 11, 2006 at 10:21 am
    Jul 11, 2006 at 2:53 pm
Group Navigation
period‹ prev | Jul 2006 | next ›
Group Overview
groupjava-user @

152 users for July 2006

Chris Hostetter: 51 posts Erick Erickson: 49 posts Rob Staveley (Tom): 26 posts Yonik Seeley: 26 posts Otis Gospodnetic: 25 posts Michael Prichard: 24 posts Karl wettin: 23 posts Doron Cohen: 22 posts Mark Miller: 18 posts Erik Hatcher: 14 posts Neils: 14 posts Dan2000: 12 posts Miles Barr: 10 posts Mark harwood: 9 posts Suba Suresh: 9 posts James liu: 8 posts Martin Braun: 8 posts Michael McCandless: 8 posts Furash Gary: 7 posts Mike Streeton: 7 posts
show more