Search Discussions

99 discussions - 453 posts

  • hi, i am using lucene's "IndexSearcher" to search the given xml by keyword which contains farsi information. while searching i use ranges like آ-ث | ج-خ | د-ژ | س-ظ | ع-ق | ک-ل | م-ی when i do search ...
    Apr 30, 2008 at 7:22 am
    May 11, 2008 at 6:32 pm
  • Hi, I have been trying to bind my lucene instance (JVM - Sun Hotspot*) to a particular core so as to improve the performance. Is there a way to do so or is there support in lucene to explicitly ...
    Apr 21, 2008 at 7:02 am
    Apr 25, 2008 at 2:51 pm
  • Hi, We are using Lucene 2.0 to index data stored inside relational database. Like any relational database, our database has quite a few one-to-one and one-to-many relationships. For example, let’s ...
    Rajesh parabRajesh parab
    Apr 10, 2008 at 10:17 pm
    Apr 14, 2008 at 6:06 pm
  • Hi, I have two question about this GREAT tool.. (framework, library... "whatever") Well I decide put spell checker on my applications and I start to read some papers and "found out" the Lucene ...
    Apr 8, 2008 at 3:11 pm
    Apr 8, 2008 at 6:04 pm
  • Hello everybody, I know there was written a tons of words about this issue, but I'm just not clear enough about it. I have these facts: 1. my query is always 1 letter and *, eg. M* 2. i always want ...
    Joe KJoe K
    Apr 10, 2008 at 12:53 pm
    Sep 19, 2008 at 10:51 am
  • Hi All, Does Lucene supports Billions of data in a single index store of size 14 GB for every search.I have 3 Index Store of size 14 GB per index i need to search these index store and retreive the ...
    Apr 30, 2008 at 3:54 am
    May 2, 2008 at 8:36 am
  • Hi all, I am a lucene newbie:) It seems that lucene doesn't support distributed indexing:( As some IR research papers mentioned, when the documents collection become large, the index will be large ...
    Samuel GuoSamuel Guo
    Apr 26, 2008 at 6:33 am
    Jun 12, 2008 at 6:10 pm
  • 用英文可能描述得不是很清楚,不好意思:) ----- Original Message ----- From: 王建新 To: Chris Sent: Tuesday, April 22, 2008 9:52 AM Subject: Re: Need addtional info for Field 谢谢。 ...
    Apr 22, 2008 at 5:06 am
    May 4, 2008 at 11:21 am
  • Is there any way to execute a MoreLikeThis over a subset of documents? I need to retrieve a set of interesting keywords from a subset of documents and not the entire index (imagine that my index has ...
    Jonathan ArielJonathan Ariel
    Apr 22, 2008 at 9:29 pm
    Apr 23, 2008 at 2:07 pm
  • Hi everyone, I know that there are packages that support the "Did you mean ... ?" search features with lucene which tries to find the most suited correct-word query.. however, so far I haven't ...
    Marjan CelikikMarjan Celikik
    Apr 3, 2008 at 1:12 pm
    Apr 13, 2008 at 10:23 am
  • Hello, I'm using lucene within a new project and I'm not sure about how to solve the following problem: My index consists of the two attributes "id" and "searchable". "id" is the id of a product and ...
    Daniel FreudenbergerDaniel Freudenberger
    Apr 25, 2008 at 3:52 pm
    Apr 26, 2008 at 3:31 pm
  • <benchmark <ul <p <b Hardware Environment</b <br/ <li <i Dedicated machine for indexing</i : yes</li <li <i CPU</i : Dual processor dual core Xeon CPU 3.00GHz; hyperthreading ON for 8 virtual ...
    Glen NewtonGlen Newton
    Apr 15, 2008 at 4:26 pm
    Apr 18, 2008 at 3:06 pm
  • I've seen some recent activity on LUCENE-831 "Complete overhaul of FieldCache API" and read that it must be able to cleanly patch to trunk (haven't tried yet). What I'd like to know from people ...
    Apr 17, 2008 at 12:15 pm
    Apr 18, 2008 at 9:34 am
  • We're planning to archive email over many years and have been looking at using DB to store mail meta data and Lucene for the indexed mail data, or just Lucene on its own with email data and structure ...
    Antony BowesmanAntony Bowesman
    Apr 11, 2008 at 10:20 am
    Apr 15, 2008 at 5:24 am
  • Hi all, Can someone from the experts here explain why Lucene has to get a "rewritten" query for the Searcher - so Phrase or Wildcards queries have to rewrite themselves into a "primitive" query, that ...
    Itamar Syn-HershkoItamar Syn-Hershko
    Apr 7, 2008 at 9:17 pm
    Apr 10, 2008 at 5:33 pm
  • Dear Fellow Java/Lucene developers: I am writing an application where a user is able to search for keywords from within a single book. When the user conducts a search, he/she should receive a set of ...
    Apr 7, 2008 at 5:13 pm
    Apr 8, 2008 at 2:28 pm
  • writer = new IndexWriter(indexLocation, new StandardAnalyzer(), true); String string = request.getParameter("text"); this.log("Text is "+string); Date date = new Date(); String dateString = ...
    Hasan DiwanHasan Diwan
    Apr 24, 2008 at 2:44 am
    Apr 24, 2008 at 5:00 pm
  • Hello, I am building fairly large directories (200-500 MB of disk space) using lucene-java. Sometimes it can take upwards of 10-15 mins to create the documents and write them to disk using my current ...
    Chris McGeeChris McGee
    Apr 10, 2008 at 6:35 pm
    Apr 16, 2008 at 6:22 pm
  • Hi, We are planning to provide search functionality in the a web base application. Can we use Lucene for it to search data from database like oracle and MS-Sql? Thanks and Regards प्रशांत सराफ ...
    Prashant SarafPrashant Saraf
    Apr 10, 2008 at 1:58 pm
    Apr 11, 2008 at 5:26 am
  • Hi! I found that when sorting the search result -depending on the amount of data in the field to sort by - this can easily lead to FieldCacheImpl to allocate hundreds of MByte RAM. How does this work ...
    Timo NentwigTimo Nentwig
    Apr 13, 2008 at 2:46 pm
    Apr 27, 2008 at 6:58 am
  • Hi Folks, I´m trying to load Google Web 1T 5 Gram to Lucene. (This corpus contains English word n-grams and their observed frequency counts. The length of the n-grams ranges from unigrams(single ...
    Rafael TurkRafael Turk
    Apr 23, 2008 at 11:26 am
    Apr 24, 2008 at 11:02 am
  • Hi! I'm using Lucene Proximity Searches, but I've seen Lucene only scores documents which contain all the terms in the phrase. I also need to score documents although they don't contain all those ...
    Ana RábadeAna Rábade
    Apr 3, 2008 at 4:13 pm
    Apr 18, 2008 at 4:00 pm
  • Hi, We are currently using Lucene 2.0 for full-text searches within our enterprise application, which can be deployed in clustered environment. We generate Lucene index for data stored inside ...
    Rajesh parabRajesh parab
    Apr 4, 2008 at 12:20 am
    Apr 10, 2008 at 3:18 am
  • Hello all. Before I ask my question, I'd like to clarify I've read the manual and searched the archives, and if I'm here, it is because I've neither found a suitable answer, or (most likely) I didn't ...
    João RodriguesJoão Rodrigues
    Apr 29, 2008 at 5:49 pm
    Apr 30, 2008 at 4:04 pm
  • Hey gang, The finally block was not hiding any exceptions: I'm still seeing the hangs in IndexWriter.close(). I'm appending the relevant output from 'setInfoStream(System.out)'. It looks like after I ...
    Stu HoodStu Hood
    Apr 21, 2008 at 9:25 pm
    Apr 23, 2008 at 7:53 pm
  • HI , We have been observing the following problem while tokenizing using lucene's StandardAnalyzer. Tokens that we get is different on different machines. I am suspecting it has something to do with ...
    Prashant MalikPrashant Malik
    Apr 22, 2008 at 6:45 pm
    Apr 22, 2008 at 9:04 pm
  • Hi all, i'm using Lucene 2.3.1. What i'm trying to do seems straightforward enough (to me), but i just can't find the method to do so. Let's say i'm doing a PhraseQuery of the phrase "apples and ...
    Edwin LeeEdwin Lee
    Apr 19, 2008 at 8:43 am
    Apr 22, 2008 at 10:29 am
  • Hi All, I have a small confusion regarding the document ids which we collect using HitCollector.collect() method. Here is the description of the confusion : First i created a FieldCache of type ...
    Shailendra MudgalShailendra Mudgal
    Apr 17, 2008 at 5:46 pm
    Apr 18, 2008 at 2:15 pm
  • I'm new to Lucene, and would like to use it to find duplicate (or similar) names in a contact list. Is Lucene a good fit? We have a form where a user enters a company or person's name, and we want ...
    Andy DePueAndy DePue
    Apr 16, 2008 at 4:08 pm
    Apr 17, 2008 at 2:07 pm
  • I'm curious how people are building the "all" Field (for searching "all of the terms at once"). I understand using store=NO, Index=Tokenized is generally the way to add the field, but what if I need ...
    AJ WeberAJ Weber
    Apr 15, 2008 at 2:28 pm
    Apr 15, 2008 at 7:05 pm
  • Hi, Pardon the noob question. But which approach is going to be faster over extremely large document sets. A or B? A) Multiple field values, Stored.NO,TOKENIZED. word: one word: two word: three B) ...
    Apr 15, 2008 at 2:54 pm
    Apr 15, 2008 at 6:12 pm
  • Hi all. I have an index with a set of phrases(one or several words). I need to make search for these phrases. I am confused as I can not find a good way to search for phrases. For example I need to ...
    Apr 14, 2008 at 2:37 pm
    Apr 15, 2008 at 6:08 pm
  • Hello. We've designing a CMS in Java, and I've trying to implement site search function using lucene. The basic conception is that: - Site features numerous objects that we'd like to throw into ...
    Илья КазначеевИлья Казначеев
    Apr 3, 2008 at 12:24 pm
    Apr 15, 2008 at 2:51 pm
  • Hi all, I am wondering if there are possible "holes" in set of index documents ids. Being more specific - is it possible that there exist integer i between 0 and IndexReader.maxDoc() such that ...
    Wojtek HWojtek H
    Apr 9, 2008 at 4:49 pm
    Apr 14, 2008 at 10:12 pm
  • Hi:
    John WangJohn Wang
    Apr 3, 2008 at 2:27 pm
    Apr 3, 2008 at 4:52 pm
  • Hi, I want to reconstruct the field value from index, just the same as the function "Reconstruct and Edit" in the tool "Luke" . Just any hints is OK. Thanks in advance. Thanks -qi
    Apr 2, 2008 at 11:05 am
    Apr 2, 2008 at 5:33 pm
  • Hi, This is from javadoc of ParallelReader: ====================================================== An IndexReader which reads multiple, parallel indexes. Each index added must have the same number of ...
    Rajesh parabRajesh parab
    Apr 29, 2008 at 5:06 pm
    Apr 30, 2008 at 9:49 pm
  • Hi, I want to use payload to store some kind of object id which is an arbitrary byte array for better performance. But I do need some kind of function like searching against payload value. Also when ...
    Apr 21, 2008 at 9:35 am
    Apr 22, 2008 at 10:49 am
  • Hello all- While adapting some code to use IndexWriter instead of IndexModifier (as indicated by the deprecation warnings), I stumbled upon an issue that I at first thought was a bug, but I'm sure ...
    Ulf DittmerUlf Dittmer
    Apr 18, 2008 at 11:02 am
    Apr 18, 2008 at 4:33 pm
  • Hi all, I've been doing some performance testing and found that using QueryWrapperFilter for a location field restriction I have to do allows my search results to approach 5-10ms. This was ...
    Michael StoppelmanMichael Stoppelman
    Apr 16, 2008 at 6:05 pm
    Apr 17, 2008 at 7:07 am
  • Hi all, Snowball stemmers are part of Lucene, but for few languages only. We have documents in various languages and so need stemmers for many languages (in particular polish). One of the ideas is to ...
    Wojtek HWojtek H
    Apr 1, 2008 at 9:59 am
    Apr 15, 2008 at 9:51 am
  • Hello, *Sample code:* SpellChecker spell; RAMDirectory dram = new RAMDirectory(); Dicionario dic = new Dicionario(); //one implementation of spell.Dictionary spell= new SpellChecker(dram); ...
    Apr 10, 2008 at 6:43 pm
    Apr 11, 2008 at 6:33 am
  • Hi, I am a new user of Java Lucene. The default index is such that a new files is created every time, which requires me to delete the existing index folder. I want to append to the existing index. ...
    Nitasha Walia (niwalia)Nitasha Walia (niwalia)
    Apr 7, 2008 at 7:45 pm
    Apr 8, 2008 at 1:16 am
  • Is it so that document and field boosts are omitted together with Field#setOmitNorms? By setting lengthNorm to 1f in the Similarity for these fields and not omitting norms would fix it? karl ...
    Karl WettinKarl Wettin
    Apr 5, 2008 at 12:34 am
    Apr 5, 2008 at 5:48 pm
  • Is there any reliable implementation for parsing email mailbox files (mbox format), especially large ( 50MB) archives ? Even after searching lucene mailing list archives, googling around, I couldn't ...
    Subodh DamleSubodh Damle
    Apr 3, 2008 at 7:00 pm
    Apr 4, 2008 at 2:13 pm
  • Hi Guys, I make the following test – I create 2 files. File1.txt with content: “apple 2 3 4 pear” And File2.txt with content: “pear 2 3 4 apple” I made the following searching tests: 1. Using Luke ...
    Ivan VasilevIvan Vasilev
    Apr 3, 2008 at 10:04 am
    Apr 4, 2008 at 7:43 am
  • Hi, I am a new user of Java Lucene and need to learn how to add a new attribute, such that, given a database of emails, containing sender information, searching for a keyword, results in 1. The ...
    Nitasha Walia (niwalia)Nitasha Walia (niwalia)
    Apr 2, 2008 at 6:26 pm
    Apr 3, 2008 at 9:48 pm
  • I'm trying to figure out what the best practice is in term of using sorting or customized scoring. For example, if I have want to index some static pages and rank them by how many times a page is ...
    John XiaoJohn Xiao
    Apr 2, 2008 at 3:57 pm
    Apr 2, 2008 at 7:04 pm
  • I'm working on a JSP-based, free-form text storage & retrieval system based on lucene. Part of my desired feature set includes the ability to retrieve, edit, and update text comprising the document. ...
    Hasan DiwanHasan Diwan
    Apr 28, 2008 at 4:41 am
    Apr 29, 2008 at 5:36 pm
  • Hi Thank you for reading my post. I have to design a system with the following requirements, I think Lucene or one of the projects which are based on Lucene can help me as a base to continue on. Here ...
    Legolas woodLegolas wood
    Apr 27, 2008 at 6:28 pm
    Apr 27, 2008 at 7:25 pm
Group Navigation
period‹ prev | Apr 2008 | next ›
Group Overview
groupjava-user @

121 users for April 2008

Karl Wettin: 27 posts Erick Erickson: 24 posts Michael McCandless: 19 posts Glen Newton: 17 posts Grant Ingersoll: 17 posts Mathieu Lecarme: 17 posts Rajesh parab: 16 posts Otis Gospodnetic: 12 posts Chris Hostetter: 10 posts Leandro: 9 posts Anshum: 8 posts John Wang: 8 posts Jonathan Ariel: 8 posts Antony Bowesman: 7 posts Mark Miller: 7 posts Paul Elschot: 7 posts Cedric Ho: 6 posts 王建新: 6 posts Fang_li: 5 posts Hasan Diwan: 5 posts
show more