Search Discussions

116 discussions - 464 posts

  • I've got an application that will be doing constant updates to an index. I've looked into batching those updates, however, based on the way the application works, the updates can't be batched. (Well, ...
    Roy KleinRoy Klein
    Apr 14, 2005 at 1:40 pm
    Jul 7, 2005 at 9:55 am
  • Hi all. I'm running Lucene.NET in a Windows/ASP.NET environment. We are searching a 300meg index in a web environment, where the IndexSearcher is cached. Every 10-30 minutes, a separate process ...
    Monsur HossainMonsur Hossain
    Apr 28, 2005 at 10:10 pm
    Apr 29, 2005 at 8:19 pm
  • In my application, by default I display all documents that are in the index. I sort them either using a "time modified" or "time created". If I have a newly created empty index, I find I get an error ...
    Bill TschumyBill Tschumy
    Apr 11, 2005 at 7:28 pm
    Apr 15, 2005 at 9:48 pm
  • Hi, I am sure this question must be raised before and maybe it has been even answered. I would be grateful, if someone could point me in the right direction or give their thoughts on this topic. The ...
    Mufaddal KhumriMufaddal Khumri
    Apr 19, 2005 at 6:10 pm
    Apr 22, 2005 at 9:33 pm
  • Is the sourcecode of Lucene 2.0 accessable? I have looked on the site, but I couldn`t find a link. And where are the archived mailinglists? They where of great value to me. Met vriendelijke groet, ...
    Peter Veentjer - Anchor MenPeter Veentjer - Anchor Men
    Apr 25, 2005 at 12:08 pm
    May 2, 2005 at 7:29 am
  • Hi, Seems like an odd request I'm sure. However, my application relies an index, and should the index become unusable for some unfortunate reason, I'd like my app to gracefully cope with this ...
    Andy RobertsAndy Roberts
    Apr 19, 2005 at 8:18 pm
    Apr 21, 2005 at 8:41 am
  • I am writing a document management system for my company, and many of our feature names are in Hungarian notation (PowerQuery, TransactionManager, etc.). This can make it hard to find some things ...
    Paul SmithPaul Smith
    Apr 12, 2005 at 11:41 pm
    Apr 30, 2005 at 8:18 pm
  • Hi Volodymyr, About the trick you described about wildcard search replacement, you mentioned: as sequence of terms, each of containing single digit from needed value. (For example I have “123214213” ...
    Aalap ParikhAalap Parikh
    Apr 19, 2005 at 7:22 pm
    Apr 27, 2005 at 5:17 pm
  • I can't connect svn.apache.org. It seems that apache.org is down. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org ...
    Volodymyr BychkoviakVolodymyr Bychkoviak
    Apr 20, 2005 at 4:12 pm
    Apr 20, 2005 at 5:45 pm
  • hello, this is my first posting to this thread, and i haven't played with the libraries as of yet. i'm curious whether people have been using lucene/nutch to convert results to rss and what would be ...
    Apr 13, 2005 at 3:57 pm
    Apr 13, 2005 at 7:54 pm
  • Hello, I have really been pulling out hair for a while over this. The problem only occur using FSDirectory on some Linux system (some Debian, Suse and Redhat - but not all) and never under Windows, ...
    Kristian OttosenKristian Ottosen
    Apr 2, 2005 at 8:29 am
    Apr 13, 2005 at 7:47 am
  • Hello, If I don't know the language of the input terms, how can I use different analyzer to search it ? For example, the input box accepts UTF-8 search text, they can be anything, such as Chinese, ...
    Eric ChowEric Chow
    Apr 11, 2005 at 1:21 am
    Apr 12, 2005 at 12:10 pm
  • I had a customer report a corrupted Lucene index. He had copied the index to backup storage, reformatted his drive, and then restored the data. After that Lucene has trouble opening the index. Here ...
    Bill TschumyBill Tschumy
    Apr 8, 2005 at 5:26 pm
    Apr 11, 2005 at 6:09 pm
  • Are there any Lucene extensions that can do simple stemming, i.e. just for plurals? Or is the only stemming package available Snowball? Cheers -- Miles Barr <miles@runtime-collective.com Runtime ...
    Miles BarrMiles Barr
    Apr 1, 2005 at 5:16 pm
    Apr 2, 2005 at 10:40 am
  • I doubt many are on this list, yet. But your question is probably best asked on the java-user@lucene list rather than here. I'll CC java-user this time to loop those folks in. That's pretty much how ...
    Erik HatcherErik Hatcher
    Apr 26, 2005 at 1:00 pm
    May 4, 2005 at 1:00 am
  • Any ideas on this? I have purchased your book, Lucene in Action, which is quite good. To make things easier, consider the example on p212. In item 4, when you combine the queries, what happens you ...
    Kipping, PeterKipping, Peter
    Apr 1, 2005 at 4:14 pm
    Apr 27, 2005 at 4:00 pm
  • I have an index of around 3 million records, and typical queries can result in result sets of between 1 and 400,000 results. We have indexed "dateTime" fields in the form 20050415142, that is, to ...
    James LevineJames Levine
    Apr 22, 2005 at 1:27 am
    Apr 22, 2005 at 8:02 pm
  • According to "Lucene in Action" it is possible to get synonyms indexed together with a word by putting multiple words with the same position-id in the term vector. My problem is, however, that some ...
    Peter Hotm. NørregaardPeter Hotm. Nørregaard
    Apr 11, 2005 at 1:36 pm
    Apr 12, 2005 at 2:54 pm
  • Hi everybody, We have been using Lucene for about one year now with great success. Recently though the index has growed noticably and so has the number of searches. I was wondering if anyone would ...
    Daniel HerlitzDaniel Herlitz
    Apr 6, 2005 at 10:39 pm
    Apr 7, 2005 at 4:05 pm
  • Oops, sorry. First went to dev by accident. ---------- Forwarded message ---------- I know Lucene is very scalable in many ways, but how about number of fieldnames? We have an index using around 6000 ...
    Yonik SeeleyYonik Seeley
    Apr 4, 2005 at 9:38 pm
    Apr 6, 2005 at 4:59 pm
  • Hi I was wondering whether anyone has any experience of multithreaded updates to indexes. I the web app I am working on there are additions, updates and deletes that need to happen to the index ...
    Lee TurnerLee Turner
    Apr 5, 2005 at 7:40 am
    Apr 6, 2005 at 12:06 pm
  • I'm writing a little application and therefore I've implemented unit tests. There i've a method to test my removeindex method, the problem is can't delete the cfs file. When i try to delete it ...
    Gusenbauer StefanGusenbauer Stefan
    Apr 2, 2005 at 2:13 pm
    Apr 5, 2005 at 6:23 pm
  • Hi all, I have indexed a field that describes the "category" of the document. Thus, I want to know how many categories have a specific term. Could someone help me to get this with good performance? ...
    Pablo Gomes LudermirPablo Gomes Ludermir
    Apr 24, 2005 at 2:05 pm
    May 11, 2005 at 9:03 pm
  • Hi, I am working on a program to index/search chemical element/compound. Say I write an analyzer to filter out chemical terms, such as H2O. I noticed that I can specify a tocken's type. Can I ...
    Apr 16, 2005 at 2:21 am
    Apr 22, 2005 at 10:19 am
  • Hi, Lucene users: Does anyone knows how to add the Lucene search results with Line number in original source content? for example: I have a file "Test.java" which is indexed by lucene. When I search ...
    Cerberus yaoCerberus yao
    Apr 11, 2005 at 9:50 am
    Apr 11, 2005 at 5:55 pm
  • he wiki appears to have undergone some style cahnges recently, the layout is a lot different now (and in my opinion: cleaner) but a side effect seems to be that some page formatting which used to ...
    Chris HostetterChris Hostetter
    Apr 5, 2005 at 9:56 pm
    Apr 6, 2005 at 6:02 pm
  • Hello, java-user. I have documents with tokenized, indexes and stored field. This field contain one-two words usually. I need to be able to search exact matches for two words. For example search ...
    Yura SmolskyYura Smolsky
    Apr 4, 2005 at 8:34 pm
    Apr 6, 2005 at 11:39 am
  • Hi guys Apologies.......... Using a MultiFieldQueryParser for a query like below for a CUSTOM SEARCH (+(+KEY3:camera +KEY3:photo) -KEY3:accessories -KEY3:studio -KEY3:cleaners -KEY3:film -KEY3: ...
    Karthik N SKarthik N S
    Apr 4, 2005 at 8:54 am
    Apr 4, 2005 at 5:31 pm
  • -----Oorspronkelijk bericht----- Van: Peter Veentjer - Anchor Men Verzonden: dinsdag 26 april 2005 15:44 Aan: 'Daniel Naber' Onderwerp: RE: CVS Lucene 2.0 -----Oorspronkelijk bericht----- Van: Daniel ...
    Peter Veentjer - Anchor MenPeter Veentjer - Anchor Men
    Apr 26, 2005 at 1:45 pm
    Apr 26, 2005 at 2:13 pm
  • Hello Everyone, I need to be able to iterate through the entire set of documents within the index to perform some auditing. I originally tried the following code snip: int ndoc = idxReader.numDocs(); ...
    Tomcat ProgrammerTomcat Programmer
    Apr 19, 2005 at 9:11 pm
    Apr 21, 2005 at 3:21 am
  • Hi, I am working on an index to search XML data in a fixed format that I master well... The idea is that the XML content (which I have as JDOM object) actually carries the semantic which would be ...
    Paul LibbrechtPaul Libbrecht
    Apr 19, 2005 at 7:55 pm
    Apr 20, 2005 at 7:51 pm
  • Hi, currently I'm writing my Bachelorthesis about Lucene. I searched for theoretical information for example about the IR-model Lucene uses, but I couldn't find anything so I had to figure it out on ...
    Barbara KrauszBarbara Krausz
    Apr 20, 2005 at 4:12 pm
    Apr 20, 2005 at 7:14 pm
  • Hello all, We're currently evaluating search tools to cover the following requirement: We have an NTFS file server with 2 TB of files (word, excel, pdf, txt, etc). We would like to index all these ...
    Maher MartinMaher Martin
    Apr 13, 2005 at 10:39 am
    Apr 14, 2005 at 9:13 am
  • Hello, I am new with Lucene. I have following problem. When I execute a search I receive the list of document Hits. I get without problem the content of the documents too: for (int i = 0; i < ...
    Patricio GaleasPatricio Galeas
    Apr 11, 2005 at 12:27 pm
    Apr 12, 2005 at 2:33 pm
  • I'm forced to keep date up to milisec. The reason is simple: I get at least a couple of new messages per sec, if all of them are stamped with the same time, the retrieval order id undefined, i.e. ...
    Iouli GolovatyiIouli Golovatyi
    Apr 6, 2005 at 12:36 pm
    Apr 7, 2005 at 4:16 pm
  • I will soon create some tests for this scenario, but wanted to run this by the list as well.... What performance differences would be seen between a query like this: a AND b AND c AND d and this one: ...
    Erik HatcherErik Hatcher
    Apr 1, 2005 at 4:15 pm
    Apr 6, 2005 at 7:49 am
  • Hello all, Is it possible to skip the first "xx" words while indexing a document? For instance, on the code bellow, I would like to skip the "xx" first words of "file" on the "CONTENTS_FIELD". Is ...
    Pablo Gomes LudermirPablo Gomes Ludermir
    Apr 29, 2005 at 11:50 am
    Apr 29, 2005 at 1:51 pm
  • First of all, a big thanks to all the Lucene hackers - I've only been using your product for a couple of weeks, and I've been very impressed by what I've seen. Here's my question: I have an index ...
    Mike BaranczakMike Baranczak
    Apr 19, 2005 at 11:44 pm
    Apr 20, 2005 at 5:47 pm
  • I have a bunch of documents in my index, some of which have values for a certain field while others don't. I'd like the ones that do have a value to always show up before the ones who don't when ...
    Martin MayMartin May
    Apr 14, 2005 at 8:21 pm
    Apr 14, 2005 at 8:53 pm
  • Hi, I am currently evaluating the need for an elaborate query data-structure (to be exchanged over XML-RPC) as opposed to working with plain strings. One thing that would heavily vote for strings ...
    Paul LibbrechtPaul Libbrecht
    Apr 14, 2005 at 3:32 pm
    Apr 14, 2005 at 4:55 pm
  • Hello all, I would like to get the following information from the index: 1. Given a term, how many times the term occurs in each document. Something like a triple: < Term, Doc1, Freq , <Term, Doc2, ...
    Pablo Gomes LudermirPablo Gomes Ludermir
    Apr 14, 2005 at 3:16 pm
    Apr 14, 2005 at 3:33 pm
  • Hello, I am a beginner in using Lucene. My files are contains different language (English, Chinese, Portuguese, Japanese and some Asian languages, non-latin languages). They always contain in one ...
    Eric ChowEric Chow
    Apr 11, 2005 at 9:55 am
    Apr 11, 2005 at 10:11 am
  • Hi, Am new to Lucene. I found the following page: http://lucene.apache.org/java/docs/queryparsersyntax.html. At the bottom of the page there is a section that in order to escape special characters ...
    Mufaddal KhumriMufaddal Khumri
    Apr 7, 2005 at 6:22 am
    Apr 7, 2005 at 9:09 am
  • I've got a situation where I'm searching over a number of different repositories, each containing a different set of documents. I'd like to run searches over, say, 4 different indices, then combine ...
    Bill JanssenBill Janssen
    Apr 5, 2005 at 12:55 am
    Apr 5, 2005 at 1:50 am
  • Hi, Maybe this query has been answered before. My first email to this user group did not generate any response. I had forwarded it to the following email ids : java-user-info@lucene.apache.org ...
    Apr 23, 2005 at 9:46 pm
    May 16, 2005 at 3:23 pm
  • Hi, I am trying to index 20349 records. When I index using the FSDirectory I get 20349 documents - this is correct. Now when I ude the RAMDirectory to create my index and write all documents from the ...
    Mufaddal KhumriMufaddal Khumri
    Apr 28, 2005 at 8:54 pm
    Apr 28, 2005 at 10:24 pm
  • Hi folks, I have a question about boosting fields in a Query. Suppose we have documents like this in the index: fieldA:String fieldB:String fieldC:Date fieldD:Number And the query is like that: ...
    Apr 27, 2005 at 3:03 pm
    Apr 27, 2005 at 3:36 pm
  • Hello everyone, In the project I'm currently involved we are using lucene (+ Digester) to index a small number of XML files. To be able to perform the searches I want, I should need to query the ...
    Victor AbeytuaVictor Abeytua
    Apr 25, 2005 at 2:40 pm
    Apr 26, 2005 at 8:29 am
  • My machine is pretty good and fairly new. The disk for sure is not slow and also I am not indexing large Documents; 27 fields with each field value being a string with no more than 15-20 characters ...
    Aalap ParikhAalap Parikh
    Apr 21, 2005 at 11:47 pm
    Apr 22, 2005 at 5:24 pm
  • Hi, I am looking to get Lucene to participate in a JTA transaction. What would be the best way to do this? I am thinking maybe use a message queue that feeds an indexing thread/message driven bean ...
    Peter GelderbloemPeter Gelderbloem
    Apr 21, 2005 at 1:43 pm
    Apr 21, 2005 at 4:01 pm
Group Navigation
period‹ prev | Apr 2005 | next ›
Group Overview
groupjava-user @

130 users for April 2005

Erik Hatcher: 56 posts Chris Hostetter: 20 posts Peter Veentjer - Anchor Men: 17 posts Yonik Seeley: 17 posts Doug Cutting: 15 posts Chuck Williams: 13 posts Otis Gospodnetic: 13 posts Paul Libbrecht: 12 posts Andy Roberts: 11 posts Aalap Parikh: 10 posts Volodymyr Bychkoviak: 10 posts Bill Tschumy: 8 posts Gusenbauer Stefan: 8 posts Monsur Hossain: 8 posts Paul Elschot: 8 posts Daniel Naber: 7 posts Eric Chow: 7 posts Mufaddal Khumri: 7 posts Andrzej Bialecki: 6 posts Chris Lamprecht: 6 posts
show more