Search Discussions

128 discussions - 623 posts

  • Hi all, Is there any guaranty that the maxDoc returned by a reader will be about the total number of indexed documents? The motivation of this question is that I want to associate some info to each ...
    Carlos PitaCarlos Pita
    May 24, 2007 at 4:41 pm
    May 29, 2007 at 1:19 pm
  • Hi all consider following index field1 field2 field3 text1 text1 text2 text3 text4 text4 text2 text2 text3 text5 I want to get all terms in filed3 if I use Reader.terms() it will returns: (however i ...
    Mohammad NorouziMohammad Norouzi
    May 22, 2007 at 9:29 am
    May 29, 2007 at 5:05 pm
  • I seem to be having problems using a * in a phrase term query This is my search String, its not finding any matches 54:"MusicIP PUID*" If I match on a particular record it works ok 54:"MusicIP ...
    Paul TaylorPaul Taylor
    May 12, 2007 at 5:03 pm
    May 18, 2007 at 10:53 am
  • Hello all, I need to index a table containing company details (name, address, city ... country). Each record contains data written in the language appropriate to the records country. I was thinking ...
    May 7, 2007 at 8:02 am
    May 8, 2007 at 3:15 pm
  • I am currently exploring how to solve performance problems I encounter with Lucene document reads. We have amongst other fields one field (default) storing all searchable fields. This field can ...
    Andreas GutherAndreas Guther
    May 17, 2007 at 6:10 am
    May 21, 2007 at 12:50 am
  • Hi everyone, I have an application that indexes/searches xml documents using Lucene. I'm having a problem with what looks like a memory leak, which occurs when indexing a large number of documents, ...
    Stephen GrayStephen Gray
    May 15, 2007 at 6:31 am
    May 21, 2007 at 12:23 am
  • Our application includes an indexing server that writes to multiple indexes in parallel (each thread writes to a single index). In order to avoid an OutOfMemoryError, each request to index a document ...
    David mDavid m
    May 3, 2007 at 7:55 pm
    May 7, 2007 at 5:35 pm
  • Hi, Does Lucene search FSDirectory as well as buffered in-memory docs while we are calling searcher.search(query)? Why I'm asking this is, I've indexed my doc with mergeFactor & Max.Buff.Docs = 50 ...
    SK RSK R
    May 28, 2007 at 12:20 pm
    May 29, 2007 at 11:23 am
  • Hello, I tried org.apache.lucene.analysis.fr.FrenchAnalyzer and I got strange search results on strings in uppercase. (example : VEHICLE) When I search the string (in lower case), I get no result. I ...
    May 21, 2007 at 9:30 am
    May 28, 2007 at 4:50 pm
  • Hello Ard, What you are after is a higher mergeFactor and probably also a higher maxBufferedDocs. Is indexing performance the concern? Don't go crazy with setting a super high (e.g. 100+) ...
    Otis GospodneticOtis Gospodnetic
    May 25, 2007 at 3:43 pm
    Jun 6, 2007 at 11:24 pm
  • Hi, I have the following problem. I'm indexing documents that belong to some collection (ie. the dataset is divided into collections, which are divided into documents). These documents become my ...
    Peter BloemPeter Bloem
    May 19, 2007 at 3:26 pm
    May 20, 2007 at 7:28 pm
  • I moved today from Lucene 2.0 to 2.1 and I noticed that the IndexReader.isCurrent() call is very expensive. What took 20 milliseconds in 2.0 now takes seconds in 2.1. I have the following scenario: - ...
    Andreas GutherAndreas Guther
    May 11, 2007 at 4:45 pm
    May 12, 2007 at 6:07 pm
  • I don't understand why I'm getting the results I'm getting. If I search for "pandock*" I get 6 results Np-pandock Np-pandock-L Np-pandock-1 Np-pandock-2 Np-pandock Np-pandock-L1 If I search for ...
    John PowersJohn Powers
    May 8, 2007 at 5:47 pm
    May 9, 2007 at 9:33 pm
  • Hello everyone, I have created a Lucene Index of Students Database, this database have 5 fields i.e. Name, Address, Class, PhoneNo and ScholarNo. Now I have opened Searcher and query "Name:Menaria" , ...
    Laxmilal MenariaLaxmilal Menaria
    May 30, 2007 at 7:01 am
    May 30, 2007 at 1:28 pm
  • Hopefully I'm not opening myself up to public ridicule with what may be a very stupid question, but... At the moment, I'm trying to wrap my head around some of the math that happens when Lucene does ...
    Walt StoneburnerWalt Stoneburner
    May 30, 2007 at 8:45 pm
    Jun 1, 2007 at 12:59 am
  • I have looked around on Lucene web site as well as some documentation but have not found anything to do with Concept Search. My definition of Concept Search is as follows: 1. I would have a file ...
    Charles PatridgeCharles Patridge
    May 16, 2007 at 12:20 am
    May 16, 2007 at 11:54 pm
  • Hi, Can anybody point me to some references how to create an ideal set of stop words? I konw that this is more like a theoretical question but how do Luceners determine which words shuold be excluded ...
    Lukas VlcekLukas Vlcek
    May 10, 2007 at 6:40 pm
    May 11, 2007 at 11:40 am
  • I have a question about empty fields. I want to run a query that will search against a few particular fields for the query term but then also also check to see if a two other fields have any value at ...
    Les FletcherLes Fletcher
    May 10, 2007 at 10:35 am
    May 11, 2007 at 12:02 am
  • Hello all, I am new to lucene and want to use the FuzzyLikeThisQuery. I have read the documentation for this class, and read the following for what maxNumTerms means: "maxNumTerms - The total number ...
    May 9, 2007 at 3:46 pm
    May 10, 2007 at 5:54 am
  • Hello everyone! I'm wondering if any of you have any helpful advice to what MergeFactor i should use... The indexing process is handling a large amount of documents and i would like to index as fast ...
    Aleksander M. StensbyAleksander M. Stensby
    May 3, 2007 at 9:48 am
    May 3, 2007 at 11:35 pm
  • Is there a way of boosting only fragment of the field? Let's say that I have a title and short description of something which I want to index into "myfield" field - is there a way of boosting title ...
    Wojtek huryWojtek hury
    May 31, 2007 at 6:44 pm
    May 31, 2007 at 10:18 pm
  • In a j2ee webapp we have a search object that stores a user's search preferences (items/page, detail level, etc). it has a search() that calls a static method getSearcher() that returns a static ...
    John PowersJohn Powers
    May 28, 2007 at 1:19 am
    May 29, 2007 at 5:37 pm
  • Hi, If a search returns a document that has multiple fields with the same name, is there a way to filter only those fields that contain hits? Background: I am indexing documents and we store all ...
    Andreas GutherAndreas Guther
    May 23, 2007 at 6:02 pm
    May 26, 2007 at 5:24 pm
  • I've built a Lucene system that gets rapidly updated - documents are supposed to be searchable immeidately after they've been indexed. As such I have a Writer that puts new index, update and delete ...
    Simon WistowSimon Wistow
    May 24, 2007 at 9:22 am
    May 26, 2007 at 4:15 pm
  • Hi, In nutch we have a use case in which we need to store tokens with their original text plus their stemmed form plus their canonical form(through some asciifization). From my understanding of ...
    Enis SoztutarEnis Soztutar
    May 25, 2007 at 2:44 pm
    May 25, 2007 at 8:12 pm
  • Hi folks, I need to collect some global information from my first 1000 search results in order to build up some search refining components containing only relevant values (those which correspond to ...
    Carlos PitaCarlos Pita
    May 24, 2007 at 3:31 am
    May 24, 2007 at 7:29 pm
  • Hi, We have been using lucene for years and it serves us well. Sometimes when we issue a query, we only what to know how many hits it leads, not want any docs back. Is it possible to completely avoid ...
    Zhang, LishengZhang, Lisheng
    May 23, 2007 at 4:46 pm
    May 24, 2007 at 4:41 pm
  • Hi, I indexed emails. And now i want to restrict the search functionality for users so they only can search for emails to/from him. i know the email address of the user so my plan is to do it in the ...
    May 24, 2007 at 12:35 pm
    May 24, 2007 at 1:38 pm
  • I'm constructing a search with some required terms and some optional terms in in the query. According to some earlier posts that looks like "+(A B) C D E" in query syntax for required terms A and B ...
    Peter BloemPeter Bloem
    May 21, 2007 at 12:38 am
    May 22, 2007 at 4:00 pm
  • Hi there, I have started using Lucene not long ago, with plans to replace my current sql queries in my application with it. As I wasn't aware of Lucene before, I have implemented some similar tools ...
    May 21, 2007 at 8:05 pm
    May 22, 2007 at 5:41 am
  • Hello! I am new to Lucene, so forgive me if my question is basic. I did try googling for an answer... For an ajax autocomplete widget, I am querying using Lucene. I only want to return, for example, ...
    David LeangenDavid Leangen
    May 14, 2007 at 7:29 am
    May 18, 2007 at 7:26 am
  • I really want to use document numbers as a secondary key in my object storage. If I got it all right, the main problem is deleted documents and optimization. Are there any other issues? All my tests ...
    Karl wettinKarl wettin
    May 10, 2007 at 6:22 pm
    May 11, 2007 at 4:34 pm
  • Hi all, I am new to Lucene. I am developing a small search utility using lucene. I have to create the index from my Oracle database. Can anybody tell me how to create the index from Oracle using ...
    Krishna Prasad MekalaKrishna Prasad Mekala
    May 11, 2007 at 10:33 am
    May 24, 2008 at 1:47 am
  • Hello! I have a Document with tow fields: one I would like to write with SimpleAnalyzer, the other I want to use StandardAnalyzer, is there a simple way to do it? thanks -- Paulo E. A. Silveira ...
    Paulo SilveiraPaulo Silveira
    May 25, 2007 at 7:33 am
    Jun 19, 2007 at 2:06 pm
  • Hi, I am not sure, so i need ur opinion to these 2 questions: Is it save to search an index while its beeing optimized by another java process? Is it save to add documents to an index while its ...
    May 29, 2007 at 3:10 pm
    Jun 5, 2007 at 2:25 pm
  • I've got a field that is indexing people names. The field is multivalued and I'm using Solr with a positionIncrementGap of 100. I've found that trying to specify a near query using something like: ...
    Daniel EinspanjerDaniel Einspanjer
    May 29, 2007 at 3:38 pm
    May 31, 2007 at 12:31 am
  • Hi, I'm trying to figure what I need to do with Lucene to score a document higher when it has a larger number of unique search terms that are hit, rather than term frequency counts. A quick example. ...
    Walt StoneburnerWalt Stoneburner
    May 24, 2007 at 3:22 pm
    May 25, 2007 at 7:57 pm
  • Hi All, Thanks for your response. I have one more doubt. How can I update a index once created from Oracle, instead of recreating the whole. Whenever there is a change in the oracle table ...
    Krishna Prasad MekalaKrishna Prasad Mekala
    May 14, 2007 at 1:00 pm
    May 23, 2007 at 4:17 am
  • Hello, I have been using a large, in memory MultiSearcher that is reaching the limits of my hardware RAM with this code: try { IndexSearcher[] searcher_a= { new IndexSearcher(new ...
    Peter W.Peter W.
    May 21, 2007 at 9:39 pm
    May 22, 2007 at 11:34 pm
  • We are a startup company based in the city of Sheffield, UK actively seeking experienced java programmers to develop intelligent web mining systems using Apache Lucene/Nutch. Experience of genetic ...
    May 11, 2007 at 10:45 am
    May 12, 2007 at 3:47 am
  • I have a use case, in which I need to select the Analyzer based on a Locale. For example: "nl" = DutchAnalyzer "nl_BE" = DutchAnalyzer "fr" = FrenchAnalyzer "foobar" = StandardAnalyzer (fallback) I ...
    Geoffrey De SmetGeoffrey De Smet
    May 8, 2007 at 1:20 pm
    May 9, 2007 at 6:01 pm
  • Hi Mark, Do you know of a good paid product that does this? Thanks, Arsen ----- Original Message ---- From: Mark Miller <markrmiller@gmail.com To: java-user@lucene.apache.org Sent: Wednesday, May 2, ...
    May 7, 2007 at 2:58 am
    May 8, 2007 at 9:24 pm
  • I'm trying to build a custom MoreLikeThis implementation that will run within solr and I've run into a few API hurdles... 1. Can MLT.java be modified to optionally take the Similarity implementation ...
    Ryan McKinleyRyan McKinley
    May 30, 2007 at 6:46 am
    May 30, 2007 at 6:35 pm
  • Hello, Currently we are attempting to optimize the search time against an index that is 26 GB in size (~35 million docs) and I was wondering what experiences others have had in similar attempts. ...
    Scott SellmanScott Sellman
    May 24, 2007 at 5:32 pm
    May 25, 2007 at 5:28 am
  • Hello, My application is working with PDF files so i use lucene with PdfBox to create a little search engine. I am new to lucene. All seemed to work fine but after some tests I saw that some ...
    Stefan ColellaStefan Colella
    May 18, 2007 at 12:12 pm
    May 23, 2007 at 6:19 am
  • The query syntax reference page talks about the NOT and the - operators, but it wasn't clear to me what exactly the difference is between them. Could someone tell me briefly what that difference ...
    Daniel EinspanjerDaniel Einspanjer
    May 6, 2007 at 3:05 am
    May 8, 2007 at 11:15 am
  • Looking over the implementation of SpanNearQuery I came upon what looked like a bug. Below is a test which fails due to it. SpanNearQuery doesn't return all matching spans; once it's found a span it ...
    Moti NisensonMoti Nisenson
    May 6, 2007 at 2:12 pm
    May 7, 2007 at 7:08 pm
  • Does anyone know how to fix the .cfs file name in an index directory? The deletable and segments file names are always the same, but we have observed that the .cfs file name changes each time you ...
    Shaw, JamesShaw, James
    May 3, 2007 at 11:06 pm
    May 5, 2007 at 12:59 am
  • Hi, I am a little confused, probably, because I missed some detail when looking through the code of lucene 2.1. Scenario: Deleting documents works for a while, eventually, I get the exception that ...
    Martin KobeleMartin Kobele
    May 30, 2007 at 3:33 pm
    May 30, 2007 at 6:02 pm
  • Hi, I use lucene in my project and it works well. Now I hope that the search result presenting to the user include the times of the keyword match in a document. Is there someone do this before,Or is ...
    Anny BridgeAnny Bridge
    May 28, 2007 at 2:31 am
    May 28, 2007 at 6:54 am
Group Navigation
period‹ prev | May 2007 | next ›
Group Overview
groupjava-user @

143 users for May 2007

Erick Erickson: 69 posts Chris Hostetter: 29 posts Mark Miller: 26 posts Bhecht: 18 posts Doron Cohen: 17 posts Grant Ingersoll: 17 posts Otis Gospodnetic: 16 posts Karl wettin: 15 posts Yonik Seeley: 15 posts Carlos Pita: 14 posts Michael McCandless: 13 posts Mohammad Norouzi: 13 posts Paul Taylor: 13 posts Daniel Einspanjer: 12 posts Steven Rowe: 11 posts Andreas Guther: 10 posts John Powers: 9 posts Jolinar13: 8 posts Laxmilal Menaria: 8 posts Paul Elschot: 8 posts
show more