Search Discussions

132 discussions - 541 posts

  • Hi, Maybe this question is trivial but I need to ask it. I've some problem with indexing large number of documents, and I seek for better solution. Task is to index about 33GB text data CSV (each ...
    Java ProgrammerJava Programmer
    Feb 17, 2006 at 10:53 am
    Feb 21, 2006 at 8:28 pm
  • My primary sort is by relevance score and my secondary sort is by date. The Hits.getScore() method returns the score by 7 digits to the right of the decimal point. Therefore, If I round to only 2 ...
    Daniel ClarkDaniel Clark
    Feb 1, 2006 at 12:24 pm
    Mar 1, 2007 at 1:30 pm
  • Hi, I'm wondering if anyone has tested Lucene indexing/search performance with different file system block sizes? I just realized one of the servers where I run a lot of Lucene indexing and searching ...
    Otis GospodneticOtis Gospodnetic
    Feb 10, 2006 at 9:55 pm
    Feb 14, 2006 at 5:02 am
  • Hi. I've got an unusual (if not crazy) question about implementing custom queries. Basically we have a UI where a user can enter a query and then select a bunch of filters to be applied to the query. ...
    Daniel NollDaniel Noll
    Feb 7, 2006 at 5:59 am
    Feb 10, 2006 at 2:11 am
  • I've been wrestling with a way to index and search data with a geo-positional aspect. By a geo-positional search, I want to constrain search results within a given location range. Furthermore, I want ...
    Jeff RodenburgJeff Rodenburg
    Feb 28, 2006 at 8:11 pm
    Mar 1, 2006 at 11:49 pm
  • I'm trying to delete a large number of documents (~15million) from a a large index (30+ million documents). I've started with an optimized index, and a list of docIds (our own unique identifier for a ...
    Greg GershmanGreg Gershman
    Feb 13, 2006 at 2:47 pm
    Feb 15, 2006 at 3:38 pm
  • I've been trying to use ant to rebuild lucene after toying with the source. I am getting an error message I don't understand (though I am admittedly new to ant). The error is below. Any help is ...
    Michael DodsonMichael Dodson
    Feb 18, 2006 at 11:33 pm
    Feb 21, 2006 at 12:00 am
  • hi all, I just wnted to know how to increase the speed of indexing of files . I tried it by using Multithreading approach but couldn't get much better performance. It was same as it is in usual ...
    Revati joshiRevati joshi
    Feb 24, 2006 at 5:42 pm
    Jul 25, 2006 at 7:08 am
  • Hi to all! I added some documents to my index repository this way: Document document = new Document(); document.add(Field.Keyword("id", id)); document.add(Field.Keyword("type", type)); ...
    Samuru JacksonSamuru Jackson
    Feb 26, 2006 at 10:38 am
    Feb 28, 2006 at 2:41 pm
  • Yes. We have the same problem. It is mainly because TermInforReader.java that takes memory space to keep *.tii. Eugene -----Original Message----- From: Leon Chaddock Sent: Tuesday, February 14, 2006 ...
    Eugene TuanEugene Tuan
    Feb 14, 2006 at 6:36 pm
    Feb 15, 2006 at 8:55 pm
  • Hi, I am trying to suggest refine searches for my Lucene search. For example, if a search turned out too many searches, it would list a number of document title subsequences that occurred frequently ...
    Chun Wei HoChun Wei Ho
    Feb 13, 2006 at 9:35 am
    Feb 14, 2006 at 4:44 pm
  • Hey Everyone, I'm running into the "More than 32 required/prohibited clauses in query" exception when running a query. I thought I understood the problem but the following two scenarios confuse me. ...
    Kevin DutcherKevin Dutcher
    Feb 9, 2006 at 12:25 am
    Feb 9, 2006 at 9:47 pm
  • After upgrading to Lucene 1.9, an index that used to take about 9h to build now requires 13h. Any one else notice a decrease in performance? This is how I configure the IndexWriter: writer = new ...
    Eric JainEric Jain
    Feb 25, 2006 at 1:22 pm
    Mar 1, 2006 at 9:30 am
  • Hello there, i am a "newby" in usage of Apache Lucene. If have a relativly big database indexed by lucene (about 300MB File). Up to now - all users could search over the hole index. How to restrict ...
    Thomas PapkeThomas Papke
    Feb 24, 2006 at 1:11 pm
    Feb 27, 2006 at 1:39 pm
  • Hi, I am using MultiFieldQueryParser (Lucene 1.9) to search title and body fields in the documents. The requirement is that documents with title match should be returned before the documents with ...
    Feb 17, 2006 at 6:28 pm
    Feb 20, 2006 at 3:22 pm
  • Hello, Is there a way to access a Lucene Index which is stored inside a .zip or .jar file? This is important because my indexes are very large ( 200 M.B.) and I need to compress them. I tried to ...
    Ahmed El-dawyAhmed El-dawy
    Feb 18, 2006 at 8:38 pm
    Feb 19, 2006 at 11:26 pm
  • Hi, I've following constellation (planned architecture): [Webserver - APACHE] which serves the content [unspecified other servers] [CMS Server / SearchEngine - TOMCAT] handles the content creation ...
    David TrattnigDavid Trattnig
    Feb 16, 2006 at 12:21 pm
    Feb 16, 2006 at 8:02 pm
  • Hi all, I am trying to index in 2 different indexes with different mergefactors. so if one of the indexes is merging than I want to index only on the other index. is there a way to know if an index ...
    Omar DidiOmar Didi
    Feb 2, 2006 at 8:40 pm
    Feb 15, 2006 at 10:38 pm
  • Given a query, I want to be able to, for each query term, get the number of occurrences of the term. I have tried what I'm including below and it does not seem to provide reliable results. Seems to ...
    Dmitry GoldenbergDmitry Goldenberg
    Feb 6, 2006 at 10:34 pm
    Feb 9, 2006 at 10:30 pm
  • I am trying to figure out whether or not Lucene is an appropriate solution for a problem that our site faces. Our site allows users to post their opinions on various topics. Due to various government ...
    Jeff ThorneJeff Thorne
    Feb 6, 2006 at 3:56 am
    Feb 6, 2006 at 10:54 pm
  • I have just joined this user group, but I probably will be asking questions / contributing for a while now as I am starting to work on a product which will use Lucene exclusively. Still in the ...
    Pradeep SharmaPradeep Sharma
    Feb 1, 2006 at 2:04 am
    Feb 2, 2006 at 12:24 pm
  • I search the index with a group of terms. I want to get every term's frequency in each document of the search result. How can I? thx, sog ...
    Feb 22, 2006 at 4:36 am
    Feb 24, 2006 at 1:27 am
  • I'm sure you've taken care of this, but I am curious myself: If the 301 document only has a single term "batteries" (and thus is so far low on the Hits), but has a price of seven cents, then the sort ...
    John PowersJohn Powers
    Feb 21, 2006 at 5:54 pm
    Feb 23, 2006 at 2:20 am
  • I am trying to adopt lucene for a special IR system. The following scenario is an approximation of what I am trying to do. Please bear with me if some things doesnt make sense. I need some ...
    Rajesh MunavalliRajesh Munavalli
    Feb 21, 2006 at 11:45 pm
    Feb 22, 2006 at 4:57 pm
  • Hello, I would like to figure out if it is possible to write a java applet able to search with lucene through an HTML documentation WITHOUT having a webserver installed on the system and on multiple ...
    Paolo bertoPaolo berto
    Feb 20, 2006 at 10:44 am
    Feb 22, 2006 at 11:21 am
  • Hi, is it possible to open an IndexWriter and an IndexReader on the same index, at the same time, to do deleteTerm and addDocument? Thanks! Pierre-Luc
    Pierre Luc DupontPierre Luc Dupont
    Feb 21, 2006 at 3:06 pm
    Feb 22, 2006 at 8:31 am
  • This is somewhat related to a question sent to this list a while ago: Is there an efficient way to count the number of occurrences of a phrase (not term) in an index? ...
    Eric JainEric Jain
    Feb 23, 2006 at 8:32 am
    Feb 26, 2006 at 2:11 am
  • Maybe too general a question, but is there anything about creating an IndexSearcher( directory) object that would make the instantiation really slow? I have one index where the instantiation is very ...
    Gus KormeierGus Kormeier
    Feb 22, 2006 at 5:26 pm
    Feb 23, 2006 at 12:57 am
  • Hi, Can anyone share the experience of how to implement Relevance Feedback in Lucene? Can someone suggest me some algorithms and papers which can help me in building an effective Relevance Feedback ...
    Varun soodVarun sood
    Feb 15, 2006 at 6:36 am
    Feb 15, 2006 at 10:39 pm
  • Hello all, I'm replying to two threads at once as what I have to say relates to both. My company recently started an open source project called Aperture (http://sourceforge.net/projects/aperture), ...
    Christiaan FluitChristiaan Fluit
    Feb 9, 2006 at 12:11 pm
    Feb 14, 2006 at 11:03 am
  • Hello lucene members, i'm the silent member of this group.last week i had sent some query regarding reindexing,but i dn't received any reply from any one.Still i'm stuck up with the same problem of ...
    Revati joshiRevati joshi
    Feb 7, 2006 at 12:59 pm
    Feb 8, 2006 at 12:22 pm
  • Hi, I have a problem of understanding the queryNorm and fieldNorm. The following is an example. I try to follow what said in the Javadoc "Computes the normalization value for a query given the sum of ...
    Feb 6, 2006 at 9:19 am
    Feb 6, 2006 at 5:51 pm
  • I have a problem. There is an index, which contains about 6,000,000 records (15,000,000 will be soon) the size is 4GB. Index is optimized and consists of only one segment. This index stores the ...
    Anton PotehinAnton Potehin
    Feb 28, 2006 at 8:00 am
    Mar 3, 2006 at 5:58 pm
  • Hi there. I am new to Lucene and I have been developing a semantic application for a while and it appears to me Lucene could help me to get a much needed search with reasonable speed. I have some ...
    David PrattDavid Pratt
    Feb 22, 2006 at 3:20 am
    Feb 24, 2006 at 1:47 am
  • I was wondering: Is there any good reason why x AND y OR z is interpreted as +(+x y z) rather than +(+(+x +y) z) ? If yes, any suggestions how this could be accomplished most easily? Searched the ...
    Eric JainEric Jain
    Feb 21, 2006 at 1:41 pm
    Feb 22, 2006 at 6:48 pm
  • While building a large index, we had a power outage. Over 2 million documents had been added, each document with up to about 20 fields. The size of the index on disk is ~500MB. When I started the ...
    Michael van RooyenMichael van Rooyen
    Feb 19, 2006 at 10:07 pm
    Feb 22, 2006 at 9:46 am
  • Hi all. I've just implemented some magic query syntax which expands simple queries to queries containing a whole lists of words. I've implemented the queries themselves using a slight modification on ...
    Daniel NollDaniel Noll
    Feb 17, 2006 at 4:17 am
    Feb 19, 2006 at 11:31 pm
  • If I am using lucene (daily build from ~ a month ago or so) on windows - and when I merge two indexes together, I get a number of .cfs files noted in my 'deleteable' file - but they never seem to ...
    Dan ArmbrustDan Armbrust
    Feb 13, 2006 at 5:27 pm
    Feb 14, 2006 at 9:21 am
  • I've been hunting an insidious problem whereby during heavy incremental indexing operations in production on redhat el3 machine I notice that the java process has a lot of open files which appear to ...
    Paul SmithPaul Smith
    Feb 13, 2006 at 6:43 am
    Feb 13, 2006 at 11:04 pm
  • I'm trying to upgrade our search functionality (currently, RTF/text only, and exact phrase match only) at my company, and have run into some concerns. Our 4 main formats are: RTF - javax.swing looks ...
    Feb 9, 2006 at 2:52 am
    Feb 11, 2006 at 12:49 am
  • Is it possible to add records into lucene index using following algorithm: 1) create Document object 2) add 5 fields into Document (id, name, field1, field2, field3). All fields are stored, indexed ...
    Anton PotehinAnton Potehin
    Feb 8, 2006 at 10:01 am
    Feb 10, 2006 at 2:15 am
  • Hi Friends How do I send one search query to multiple search Indexes which are on remote machines ? Which Technology will help me (AJAX / simple Servlet) ? Thanks........... in advance I hope I will ...
    Vikas KhengareVikas Khengare
    Feb 2, 2006 at 8:21 am
    Feb 3, 2006 at 6:14 am
  • Hi! Is there a way to retrieve a List of the matching words for a Hit? For example I create a query like this one: "Paris London -Stockholm" Now I get a Hit object with a couple of results back where ...
    Samuru JacksonSamuru Jackson
    Feb 27, 2006 at 11:50 am
    Mar 1, 2006 at 9:33 pm
  • Hi, My documents are in the following format. doc.add(new Field ("id",page, Field.Store.YES, Field.Index.TOKENIZED)); doc.add(new Field ("content",fileContent.toString(), Field.Store.YES, ...
    Seeta SomaganiSeeta Somagani
    Feb 28, 2006 at 3:55 pm
    Feb 28, 2006 at 9:38 pm
  • Hi all Due a performance problem, I'm looking a way of restricting the docs returned based of the number of docs which a field has the same value. At the moment we just discard the docs if more than ...
    Emerson cargninEmerson cargnin
    Feb 27, 2006 at 4:43 pm
    Feb 28, 2006 at 3:51 pm
  • I have been trying to figure out why my query below would not return any hits. I use two custom analyzers for indexing and searching. The one I use for indexing uses this: public TokenStream ...
    Mufaddal KhumriMufaddal Khumri
    Feb 23, 2006 at 9:17 pm
    Feb 23, 2006 at 10:56 pm
  • Hello, Before I learned about filters in lucene I was building my initial query as a stringbuffer and then I use that with a queryparser. Is there any difference/advantage to separating out the ...
    John PowersJohn Powers
    Feb 21, 2006 at 3:33 pm
    Feb 21, 2006 at 7:49 pm
  • Hi, I have used Lucene in my application and am just indexing and searching on some documents. The code that indexes the documents was working fine till yesterday and suddenly stopped working. I get ...
    Shivani SawhneyShivani Sawhney
    Feb 16, 2006 at 4:28 am
    Feb 17, 2006 at 6:39 pm
  • I am looking for a comparison between the theoretical Vector Space Model and the theoretical Probabilistic Model in Information Retrieval. I know that comcrete implementations do differ from that. ...
    Karl KochKarl Koch
    Feb 16, 2006 at 6:29 pm
    Feb 17, 2006 at 2:47 pm
  • Hi, I would like to implement the Okapi BM25 weighting function using my own Similarity implementation. Unfortunately BM25 requires the document length in the score calculation, which is not provided ...
    Trieschnigg, R.B. \(Dolf\)Trieschnigg, R.B. \(Dolf\)
    Feb 16, 2006 at 10:41 am
    Feb 17, 2006 at 10:52 am
Group Navigation
period‹ prev | Feb 2006 | next ›
Group Overview
groupjava-user @

147 users for February 2006

Chris Hostetter: 40 posts Erik Hatcher: 36 posts Otis Gospodnetic: 29 posts Mufaddal Khumri: 22 posts Daniel Noll: 18 posts Michael D. Curtin: 16 posts Yonik Seeley: 16 posts Eric Jain: 11 posts Grant Ingersoll: 9 posts John Powers: 9 posts Xing jiang: 9 posts Leon Chaddock: 8 posts Doug Cutting: 7 posts Greg Gershman: 6 posts Jeff Rodenburg: 6 posts Paul Elschot: 6 posts Shivani Sawhney: 6 posts Anton Potehin: 5 posts Chun Wei Ho: 5 posts Dmitry Goldenberg: 5 posts
show more