Search Discussions

124 discussions - 495 posts

  • Hi All, Before I start reinventing wheels I would like to do a short check to see if anybody else has already tried this. A customer has requested us to look into the possibility to perform a spell ...
    Aad NalesAad Nales
    Sep 9, 2004 at 7:51 am
    Sep 20, 2004 at 1:07 pm
  • Hi, I am facing an out of memory problem using Lucene 1.4.1. I am re-indexing a pretty large number ( about 30.000 ) of documents. I identify old instances by checking for a unique ID field, delete ...
    Daniel TauratDaniel Taurat
    Sep 9, 2004 at 5:47 pm
    Sep 13, 2004 at 1:37 pm
  • Hi! I'm using Lucene for an application which has lots of fields/document, in which the users can specify in their config files what fields they wish to be included by default in a search. I'd been ...
    Bill JanssenBill Janssen
    Sep 8, 2004 at 1:07 am
    Oct 4, 2004 at 10:10 pm
  • Hi All, How do I implement PharseQuery API? Pls send me some sample code.( How can I handle "java is platform" as single word? ) Regards, Natarajan.
    Sep 14, 2004 at 9:21 am
    Sep 14, 2004 at 2:38 pm
  • Dear all, I saw a post about an attempt to integrate Carrot2 with Lucene. It was a while ago, so I'm curious if any outcome has been achieved. Anyway, as the project coordinator I can offer my help ...
    Dawid WeissDawid Weiss
    Sep 23, 2004 at 11:36 am
    Oct 7, 2004 at 4:07 pm
  • Hi, I think I can reproduce memory leaking problem while reopening an index. Lucene version tested is 1.4.1, version 1.4 final works OK. My JVM is: $ java -version java version "1.4.2_05" Java(TM) 2 ...
    Jiří KuhnJiří Kuhn
    Sep 13, 2004 at 1:06 pm
    Sep 14, 2004 at 7:24 am
  • Hi all, i use pdfbox to parse pdf file to lucene document.when i parse Chinese pdf file,pdfbox is not always success. Is anyone have some advice? ...
    Sep 8, 2004 at 5:37 am
    Sep 9, 2004 at 12:31 pm
  • I have been investigating a serious memory problem in our web app (using Tapestry, Hibernate, & Lucene) and have reduced it to being the way in which we are using Lucene to search on things. Being a ...
    Bryan DotzourBryan Dotzour
    Sep 29, 2004 at 1:11 pm
    Oct 1, 2004 at 4:45 pm
  • Hi all, first, here's how to reproduce the problem: Go to http://www.denic.de/en/special/index.jsp and enter "obscure service" in the search field. You'll get 132 hits. Now enter "obscure service*" - ...
    Ulrich MayringUlrich Mayring
    Sep 23, 2004 at 9:09 am
    Sep 24, 2004 at 8:35 am
  • Hi, I have built a nice lucene application on linux with no problems, but when I ported to windows for the customer, I started experiencing problems with the index not closing. This prevents ...
    Fred TothFred Toth
    Sep 20, 2004 at 3:19 am
    Sep 20, 2004 at 6:59 pm
  • Hi I have implemented Text based search using lucene. I was wonderful playing around with it. Now I want to enchance the application. I have a Root folder, under that I have many other folder, that ...
    Mahaveer jainMahaveer jain
    Sep 14, 2004 at 3:22 pm
    Sep 15, 2004 at 7:04 am
  • http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/IndexSearcher.html#close() What is the intent of IndexSearcher.close()? I want to know how, in a web app, one can stop a search ...
    David SpencerDavid Spencer
    Sep 8, 2004 at 6:17 pm
    Sep 10, 2004 at 4:11 am
  • hi all, i have a strange problem with the get and setBoost functions (lucene-1.4.1). i am trying the following code: [...] Document d1 = new Document(); Field f1 = Field.Text("field", "word"); ...
    Bastian Grimm [Eastbeam GmbH]Bastian Grimm [Eastbeam GmbH]
    Sep 22, 2004 at 4:44 pm
    Sep 29, 2004 at 7:27 pm
  • Hello, I can successfully index and search the PDF documents, however i am not able to highlight the searched text in my original PDF file (ie: like dtSearch highlights on original file) I took a ...
    Balasubramanian VijayBalasubramanian Vijay
    Sep 20, 2004 at 9:54 pm
    Sep 27, 2004 at 6:37 pm
  • Hi, I was hoping it wouldn't come to this: I've got unicode in my source HTML. In particular, within meta tags, and it's getting broken by the indexer. Note that I'm not trying to query on any of ...
    Fred TothFred Toth
    Sep 24, 2004 at 5:57 pm
    Oct 1, 2004 at 8:31 am
  • Hi, I know this is probably a common question and I've found a couple of posts about it in the archive but none with a complete answer. If there is one please point me to it! The question is that I ...
    Sep 22, 2004 at 7:22 am
    Sep 22, 2004 at 1:29 pm
  • I have a question regarding QueryParser and lucene-1.4.1.jar: When using lucene-1.3-final.jar, a query of the form: Field:(A AND -(B)) was parsed into +Field:A -Field:B (using QueryParser.parse()). ...
    Polina LitvakPolina Litvak
    Sep 15, 2004 at 7:55 pm
    Sep 17, 2004 at 1:26 pm
  • Anyone know of any reliable parsers out there for pdf word excel or powerpoint? --------------------------------------------------------------------- To unsubscribe, e-mail: ...
    Sep 9, 2004 at 1:48 pm
    Sep 13, 2004 at 7:38 am
  • Hi, I noticed a behavior with wildcard searches and like to clarify. http://www.jguru.com/faq/view.jsp?EID=538312 in JGuru, Analyzer is not used for wildcard queries. In my case I have a document ...
    Honey GeorgeHoney George
    Sep 9, 2004 at 11:41 am
    Sep 9, 2004 at 3:24 pm
  • Hello, I'm seeing in my index directory some segment files that are not included in the segments or deletable files. These segment files show their last modified date to be anywhere between a couple ...
    Edwin TangEdwin Tang
    Sep 29, 2004 at 4:25 pm
    Oct 4, 2004 at 9:21 pm
  • I was wondering was the best way was to go about returning say 1,000,000 results, divided up into say 50 element sections and then accessing them via the first 50, second 50, etc etc. Is there a way ...
    Chris FraschettiChris Fraschetti
    Sep 21, 2004 at 7:33 pm
    Sep 22, 2004 at 3:39 pm
  • Hello There, Due to the fact that the [# TO #] range search works lexographically, I am forced to build a rather large boolean query to get range data from my index. I have an ID field that contains ...
    Shawn KonopinskyShawn Konopinsky
    Sep 20, 2004 at 4:26 pm
    Sep 20, 2004 at 8:30 pm
  • Hi, I'm currently developping a search engine for a few websites and would like to use Lucene to do so. After reading some docs, a post on jGuru states that some concurrent operations are forbidden ...
    Daniel CHANDaniel CHAN
    Sep 15, 2004 at 9:53 am
    Sep 17, 2004 at 12:31 am
  • Hi, This might be more of a questing related to the PorterStemmer algorithm rather than with lucene, but if anyone has the knowledge please share. I am using the PorterStemFilter that some with ...
    Honey GeorgeHoney George
    Sep 14, 2004 at 5:57 pm
    Sep 15, 2004 at 5:27 am
  • This doesn't work either! Lets concentrate on the first version of my code. I believe that the code should run endlesly (I have said it before: in version 1.4 final it does). Jiri. -----Original ...
    Jiří KuhnJiří Kuhn
    Sep 13, 2004 at 3:50 pm
    Sep 13, 2004 at 9:55 pm
  • My application currently uses Lucene with an index living on the filesystem, and it works fine. I'm moving to a clustered environment soon and need to figure out how to keep my indexes together. ...
    Ben SinclairBen Sinclair
    Sep 7, 2004 at 8:35 pm
    Sep 8, 2004 at 3:06 pm
  • Hi all, I want to discuss a little problem, lucene doesn't support *Term like queries. I know that this can bring a lot of results in the memory and therefore it is restricted. I think that allowing ...
    Sergiu gordeaSergiu gordea
    Sep 8, 2004 at 10:24 am
    Oct 7, 2004 at 9:54 am
  • I am new to lucene, and trying to perform a sorted query on a list of people's names. Lucene seem unable to properly sort on the name field of my indexed documents. If I sort by the other (shorter) ...
    Daly, PeteDaly, Pete
    Sep 28, 2004 at 7:46 pm
    Sep 30, 2004 at 1:41 pm
  • Hi all, I'm trying to understand what's going on with the query parser and keyword fields. I've got a large subset of my documents which are "publications". So as to be able to query these, I've got ...
    Fred TothFred Toth
    Sep 24, 2004 at 4:25 pm
    Sep 29, 2004 at 4:06 pm
  • can someone assist me in building or deny the possibility of combing a range query and a standard query? say for instance i have two fields i'm searching on... one being the a field with an epoch ...
    Chris FraschettiChris Fraschetti
    Sep 20, 2004 at 3:44 am
    Sep 20, 2004 at 7:27 am
  • I sent out an email to this list a few weeks ago about how to fix a corrupt index. I basically edited the segments file with a hex editor removing the entry for the missing file and decremented the ...
    Sep 7, 2004 at 3:48 pm
    Sep 8, 2004 at 1:40 am
  • I want to sort a result set but perform a group by as well... IE remove duplicate items. Is this possible with the new API? Seems like a huge drawback to lucene right now. Kevin -- Please reply using ...
    Kevin A. BurtonKevin A. Burton
    Sep 5, 2004 at 8:16 am
    Sep 7, 2004 at 4:51 pm
  • Hi, Does anyone know if there is free-software to crawl internet site (webcrawler)? I know currently lucene does not have this feature according to official lucene FAQ. Thanks very much for helps, ...
    Zhang, LishengZhang, Lisheng
    Sep 29, 2004 at 5:38 pm
    Sep 30, 2004 at 10:57 am
  • Hi, I'm trying to learn the Scoring mechanism of Lucene. I want to fetch each parameter value individually as they are collectively dumped out by Explanation. I've managed to pull out TF and IDF ...
    Zia SyedZia Syed
    Sep 28, 2004 at 7:27 pm
    Sep 29, 2004 at 5:15 pm
  • I am having touble reindexing. Basically what I want to do is: 1. Delete the old index 2. Write the new index. The enviroment: The index is search by a web app running from the Orion App Server. This ...
    Sep 29, 2004 at 1:46 am
    Sep 29, 2004 at 3:54 pm
  • Hello I want administrate two index, one for online searches and another for index process. I want that the users search with a complete index, if I leave search at the users over the same index that ...
    Ernesto De SantisErnesto De Santis
    Sep 27, 2004 at 3:14 pm
    Sep 28, 2004 at 7:23 am
  • I am working on extending Lucene to support documents with special islands of an XML language, and I want to index the islands differently from the text. My current plan is to break the document's ...
    Greg LangmeadGreg Langmead
    Sep 23, 2004 at 9:10 pm
    Sep 23, 2004 at 10:57 pm
  • Hi, I've been working with the HTML parser demo that comes with Lucene and I'm trying to understand why it's multi-threaded, and, more importantly, how to exit gracefully on errors. I've discovered ...
    Fred TothFred Toth
    Sep 23, 2004 at 2:43 am
    Sep 23, 2004 at 7:47 pm
  • Is there a limitation in Lucene when it comes to wildcard search ? Is it a problem if we use less than 3 characters along with a wildcard(*). Gives me error if I try using 45* , *34 , *3 ..etc . Too ...
    Raju, Robinson (Cognizant)Raju, Robinson (Cognizant)
    Sep 21, 2004 at 4:50 am
    Sep 23, 2004 at 4:12 am
  • Hi everyone, I am trying to use the Lucene + BDB integration from the sandbox (http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/db/). I installed C Berkeley DB 4.2.52 and I have ...
    Christian RodriguezChristian Rodriguez
    Sep 20, 2004 at 10:37 pm
    Sep 21, 2004 at 5:16 pm
  • Hi, I was looking through the score computation when running search, and think there may be a discrepancy between what is _documented_ in the org.apache.lucene.search.Similarity class overview ...
    Ken McCrackenKen McCracken
    Sep 14, 2004 at 12:20 am
    Sep 20, 2004 at 6:53 pm
  • Hi, I'm having a problem with a range query. I have a field in my documents called "adzer". In at least one of those documents, the value is: "-000000009999999993" (without the quotes). I know this ...
    Derek BakerDerek Baker
    Sep 17, 2004 at 5:38 pm
    Sep 17, 2004 at 7:36 pm
  • Christoph, Just curious - how are you currently using Term Vectors? They seem to be a neat feature with lots of future promise, but I'm not sure how to best use them now. Regards, Terry ----- ...
    Terry SteichenTerry Steichen
    Sep 16, 2004 at 11:18 am
    Sep 17, 2004 at 2:57 pm
  • What is the best resource for beginners looking to understand Lucenes functionality, ie its use of fields, documents, the index reader and writer etc. is there any web resource that goes into details ...
    Ian McDonnellIan McDonnell
    Sep 15, 2004 at 4:52 pm
    Sep 16, 2004 at 7:38 am
  • Luceners, My search looks up the whole entities. My entities are accounts, contacts, tasks, etc. My searching looks up a group of entity's fields. This works fine despite, I don't have indexed any ...
    Wermus FernandoWermus Fernando
    Sep 15, 2004 at 5:12 pm
    Sep 15, 2004 at 6:14 pm
  • Hi Guys Apologies.......... The Task for me is to build the Index folder using Lucene & a simple Build.xml for ANT The Problem ...... Same 'Build .xml' should be used for differnet O/s... [ Win / ...
    Karthik N SKarthik N S
    Sep 13, 2004 at 8:58 am
    Sep 14, 2004 at 3:08 pm
  • Hi all, I was wondering if anyone could tell me what the expected behaviour is for calling an explain() without calling a search() first on a particular query. Would it effectively do a search and ...
    Minh Kama YieMinh Kama Yie
    Sep 8, 2004 at 4:23 am
    Sep 13, 2004 at 1:32 am
  • Is it safe to change the compound file format option at any time during the life of an index? Can I build an index with it off, then turn it on, and call optimize, and have a compound file formatted ...
    Armbrust, Daniel C.Armbrust, Daniel C.
    Sep 8, 2004 at 7:01 pm
    Sep 8, 2004 at 8:05 pm
  • Hi all, I met with such a problem with lucene demo: Each time when I create lucene index, I have to first stop tomcat, and restart tomcat after the index is created. The reason is: the index is ...
    Hui liuHui liu
    Sep 7, 2004 at 8:34 pm
    Sep 7, 2004 at 10:48 pm
  • Hi I am new to Lucene. Can anyone guide me from where i can download free Lucene book. Thanx & Regards E.Faisal Important Email Information :- The information in this email is confidential and may be ...
    Ebrahim FaisalEbrahim Faisal
    Sep 7, 2004 at 6:58 am
    Sep 7, 2004 at 12:38 pm
Group Navigation
period‹ prev | Sep 2004 | next ›
Group Overview
groupjava-user @

111 users for September 2004

David Spencer: 29 posts Otis Gospodnetic: 25 posts Doug Cutting: 20 posts Erik Hatcher: 19 posts Sergiu Gordea: 18 posts Morus Walter: 15 posts Daniel Naber: 14 posts Wermus Fernando: 13 posts Honey George: 11 posts Kevin A. Burton: 11 posts Paul Elschot: 11 posts Chris Fraschetti: 10 posts Cocula Remi: 10 posts Erik Hatcher: 10 posts Fred Toth: 10 posts Aad Nales: 8 posts Aviran: 8 posts Daniel Taurat: 8 posts Will Allen: 8 posts Jiří Kuhn: 7 posts
show more