Search Discussions

126 discussions - 493 posts

  • Hello, I have a problem and i tried everything i could think of to solve it. TO understand my situation, i create indexes on several computers on our network and they are copied to one server. There, ...
    Trezzi MichaelTrezzi Michael
    Aug 10, 2005 at 7:48 am
    Aug 19, 2005 at 11:31 am
  • Hi, i wrote an Index update, where first the IndexReader delete all files from index which are changed. Than add documents which are not in the index! Alone the deletion take so long, because i have ...
    Aug 22, 2005 at 1:30 pm
    Aug 29, 2005 at 3:01 pm
  • Is there any way to do a case-sensitive search? Thanks Tareque ControlDOCS --------------------------------------------------------------------- To unsubscribe, e-mail: ...
    Aug 18, 2005 at 7:50 pm
    Aug 22, 2005 at 8:43 pm
  • Hi everybody, Has anyone tried to code a solution like Google's "Did you mean?" in Lucene? I would be very happy to hear your ideas, approaches, suggestions. Best, Martin ...
    Martin RodeMartin Rode
    Aug 29, 2005 at 8:50 am
    Sep 2, 2005 at 4:40 pm
  • Are there any other books (despite "Lucene in Action") perhaps written in a different perspective (e.g. different applications or problem areas)? Karl -- 5 GB Mailbox, 50 FreeSMS ...
    Karl KochKarl Koch
    Aug 17, 2005 at 6:50 pm
    Aug 30, 2005 at 1:46 pm
  • Hello, Is there a way I can control the IO bandwidth utilized by Lucene? Here is my scenario. RAMDirectory is used to build a in-memory index and finally the index size approaches a limit, the ...
    Gopikrishnan SubramaniGopikrishnan Subramani
    Aug 2, 2005 at 8:05 pm
    Sep 1, 2005 at 10:57 pm
  • When I try to parse a query with an escaped backslash character like this (using Lucene 1.4.3): -id:20677 +(addr:Street143 AND zip:\\) the QueryParser thows an Exception: Encountered "<EOF " at line ...
    Matt MagoffinMatt Magoffin
    Aug 12, 2005 at 2:30 pm
    Aug 15, 2005 at 7:49 pm
  • Hi All, I have been using Lucene in my application to search over 4 million recordes updated daily. I am currently using a single index with 21 fields. Some of my fields contain numbers that are ...
    kapilChhabra (sent by Nabble.com)kapilChhabra (sent by Nabble.com)
    Aug 24, 2005 at 7:46 am
    Aug 31, 2005 at 5:10 am
  • All, First, my thanks to those who've contributed to the current best practices for highlighting. We use your code! However, after reviewing recent discussions about highlighting, and struggling with ...
    Fred TothFred Toth
    Aug 25, 2005 at 2:50 am
    Aug 26, 2005 at 4:25 pm
  • Hi! I've been having problems with lucene's QueryParser, apparently it is not thread-safe. That means I can't parse queries in threads where the queryparser object is created once and reused for each ...
    Aug 23, 2005 at 4:47 pm
    Aug 24, 2005 at 4:09 pm
  • Hey lucene guys. I know for a fact that a bunch of you have been curious about language categorization for a long time now and Java has lacked a solid way to solve this problem. Anyway. This new ...
    Kevin BurtonKevin Burton
    Aug 20, 2005 at 2:38 pm
    Aug 22, 2005 at 7:51 am
  • Why I got this error by writing for example: Field senderNameField = Field.Text("senderName", senderName); Field subjectField = Field.Text("subject", subject); subjectField.setBoost(1.2); as in the ...
    Riccardo DaviddiRiccardo Daviddi
    Aug 4, 2005 at 11:34 am
    Aug 11, 2005 at 10:05 am
  • Is it possible to write a custom sort for a query such that the first N documents that match a certain additional criteria get pushed to the top of the sort? For instance say you sort your query ...
    raymondcreel (sent by Nabble.com)raymondcreel (sent by Nabble.com)
    Aug 29, 2005 at 8:18 pm
    Sep 9, 2005 at 12:17 am
  • Does anyone have experience using lots of indexes simultaneously with the multisearcher? I'm looking to index 15 distinct objects for searching, and was thinking of creating 15 distinct indexes for ...
    Friedland, Zachary (EDS - Strategy)Friedland, Zachary (EDS - Strategy)
    Aug 30, 2005 at 10:35 pm
    Sep 2, 2005 at 5:08 pm
  • Hi, I've about five different projects which would need to access a lucene index for searching. The projects are completely unrelated to each other however it's all about the same: indexing HTML ...
    Markus FischerMarkus Fischer
    Aug 31, 2005 at 9:56 am
    Sep 1, 2005 at 11:59 am
  • Hello, I'm using Lucene for a few weeks now in a small project and just ran into a problem. My index contains words that contain one or more underlines, e.g. XYZZZY_DE_SA0001 or XYZZZY_AT0001. ...
    Is, StudcioIs, Studcio
    Aug 29, 2005 at 2:45 pm
    Aug 30, 2005 at 2:01 pm
  • Greets, As part of my attempt to speed up Plucene and establishing index compatibility between Plucene and Java Lucene, I'm porting InputStream and OutputStream to XS (the C API for accessing Perl's ...
    Marvin HumphreyMarvin Humphrey
    Aug 26, 2005 at 11:51 pm
    Aug 28, 2005 at 10:24 pm
  • I'm adding files to an index over time, so after some time I'm likely to see the same file more than once. I would like to be able to search for the information about that particular instance of the ...
    Chris DChris D
    Aug 10, 2005 at 5:54 pm
    Aug 22, 2005 at 9:18 pm
  • Hi Lucener's Apologies.......... I have seen forms using 'DEFAULT_OPERATOR_AND' with something like below How to use the DEFAULT_OPERATOR_AND when using MultiFieldQueryParser as below "query = ...
    Karthik N SKarthik N S
    Aug 17, 2005 at 8:19 am
    Aug 19, 2005 at 11:14 am
  • Not sure if this is appropriate or not, but I just put live a web site that I have been working on for over a year, and it uses Lucene for all it's searching. I have 46 million documents in 15 Lucene ...
    Robert SchultzRobert Schultz
    Aug 7, 2005 at 10:17 pm
    Aug 8, 2005 at 9:31 pm
  • I am using the Lucene 1.4.3 API. After building the index over 150000 documents (~250 MB data), Lucene does not free the memory that is used during indexing. The searcher runs as a servlet under ...
    Jan Philipp SengJan Philipp Seng
    Aug 2, 2005 at 1:31 pm
    Aug 3, 2005 at 5:05 pm
  • Hi: I posted a bug (36147) a few days ago and didn't hear anything, so I thought I'd try my luck on this list. The idea is to avoid score calculations on documents to be filtered out anyway. (e.g. ...
    John WangJohn Wang
    Aug 17, 2005 at 2:46 am
    Sep 21, 2005 at 6:16 pm
  • I have the following strange behavior for an index. The index has been optimized and has no deletions. It's in compound file format. Using Luke 0.6 I can browse by Term and find my term ...
    Dan ClimanDan Climan
    Aug 25, 2005 at 1:06 am
    Aug 26, 2005 at 5:33 pm
  • Does anyone have solutions for handling intraword delimiters (case changes, non-alphanumeric chars, and alpha-numeric transitions)? If the source text is Wi-Fi, we want to be able to match the ...
    Yonik SeeleyYonik Seeley
    Aug 15, 2005 at 10:16 pm
    Aug 16, 2005 at 4:06 am
  • Hello! I am using Lucene 1.4.3 I'm building a Lucene index, that will have about 25 million documents when it is done. I'm adding 250,000 at a time. Currently there is about 1.2Million in there, and ...
    Robert SchultzRobert Schultz
    Aug 1, 2005 at 3:19 am
    Aug 1, 2005 at 8:34 pm
  • A simple question and I guess it may have been asked before. Does the order of Querys in a BooleanQuery affect search speed? By this I mean if the first clause of a BooleanQuery only returns a few ...
    Paul IllingworthPaul Illingworth
    Aug 26, 2005 at 3:56 pm
    Dec 5, 2005 at 6:39 am
  • I'm pleased to announce that for about a month now, CNET's "Product Listing" pages are powered by Lucene 1.4.3. These pages not only allow users to browse CNET's catalog of tech products by category, ...
    Chris HostetterChris Hostetter
    Aug 31, 2005 at 2:25 am
    Aug 31, 2005 at 10:54 pm
  • Hello, I have a set of index files that I'd like to distribute with my Java application. The only way this seems practical is to place the index files in a jar file. I tried this, but the search ...
    Thomas LepkowskiThomas Lepkowski
    Aug 29, 2005 at 3:43 pm
    Aug 30, 2005 at 9:12 pm
  • Hi All, Currently, Documents cannot contain other documents. I have a Graph of Objects (Documents) to search in. I could flatten them and search but... Is there any nice way to do it? Rohit
    Rohit LodhaRohit Lodha
    Aug 22, 2005 at 6:27 am
    Aug 28, 2005 at 2:23 pm
  • Are there any documens or plans to integrate Lucene With Apache Derby (database)? --------------------------------------------------------------------- To unsubscribe, e-mail: ...
    Mag GamMag Gam
    Aug 13, 2005 at 10:15 pm
    Aug 16, 2005 at 10:55 am
  • Hi, we are in the process of planning a search feature of a product and we are having quite a hard time figuring out the "right" way to do it. The requirements for our app are the following: 1) Large ...
    Benjamin ReitzammerBenjamin Reitzammer
    Aug 4, 2005 at 8:54 am
    Aug 15, 2005 at 5:17 am
  • I am using the queryparser to search for names. If I search for: john j* I'd expect to get everybody called john j-something. john johnson, john joe doe ect. Instead I just all john and joes. In many ...
    Lasse LLasse L
    Aug 12, 2005 at 7:11 pm
    Aug 13, 2005 at 9:23 pm
  • The StandardAnalyzer should work just fine with it, It will break the search string to 5 search terms. HTH Aviran http://www.aviransplace.com From: Karthik N S Sent: Friday, August 05, 2005 1:57 AM ...
    Mordo, Aviran (EXP N-NANNATEK)Mordo, Aviran (EXP N-NANNATEK)
    Aug 5, 2005 at 2:28 pm
    Aug 8, 2005 at 12:55 pm
  • Hi, I currently pass the search tokens as Vector to my query function and construct the string to pass to the QueryParse.parse() by hand. StringBuffer qStr = new StringBuffer(); qStr.append("title:" ...
    Markus FischerMarkus Fischer
    Aug 29, 2005 at 1:05 pm
    Jun 1, 2007 at 6:16 am
  • Hi My site has large database of Television and Movie titles, in English, Spanish language. The movie data starts from year 1928 till date for selected studios like MGM, Disney etc . The site user ...
    Seema paiSeema pai
    Aug 30, 2005 at 4:47 am
    Aug 31, 2005 at 1:04 am
  • Hey all. We just relaunched our search feature over here at Xanga.com; the Blogs, Metros and Blogrings sections are powered by Lucene.NET! You can check it out here: http://search.xanga.com/ This is ...
    Monsur HossainMonsur Hossain
    Aug 24, 2005 at 2:27 am
    Aug 25, 2005 at 10:45 am
  • I have a custom Analyzer which performs normalization on all of the terms as they pass through. It does normalization like the following: trees - tree Sometimes my normalizer returns multiple words ...
    Dan ArmbrustDan Armbrust
    Aug 22, 2005 at 7:54 pm
    Aug 22, 2005 at 9:28 pm
  • In case subscribers to this list missed it, my article on how to add a "did you mean" facility to Lucene searches was published last week: ...
    Tom WhiteTom White
    Aug 18, 2005 at 8:18 am
    Aug 18, 2005 at 1:29 pm
  • Hello, I am just posting this question out here since this might be a common problem and some of you might have good pointers. Is there algorithms/api built into lucene that would help de pluralize ...
    Mufaddal KhumriMufaddal Khumri
    Aug 5, 2005 at 12:08 am
    Aug 17, 2005 at 5:58 pm
  • Hi all, I'm running lucene 1.9-rc with jdk 1.5/5.0 on JBoss 3.6 with tomcat 5.0. I'm using JMeter to do my load testing. I'm getting several different exceptions (NullPointer, ArrayIndexOutofBounds ...
    Andrew BoydAndrew Boyd
    Aug 15, 2005 at 3:53 pm
    Aug 15, 2005 at 4:30 pm
  • Hi all This is a slightly long email. Pardon me. As Lucene does not allow for updating an existing document in the index, the only option is to delete and reindex the message.When you have too many ...
    John SmithJohn Smith
    Aug 11, 2005 at 6:12 pm
    Aug 12, 2005 at 2:26 am
  • maybe this is a stupid question, maybe not... hits.id returns an int.. which would lead me to assume the obvious limitations of the size of the index (size meaning number of docs) ... assuming I ...
    Chris FraschettiChris Fraschetti
    Aug 10, 2005 at 10:47 pm
    Aug 11, 2005 at 6:09 am
  • I'm currently attempting to index the distinct list of terms found in a Lucene index using the TermEnum. I'm creating a document with each list and indexing the document of terms. It appears there's ...
    Tim JohnsonTim Johnson
    Aug 10, 2005 at 7:29 pm
    Aug 10, 2005 at 10:33 pm
  • Hello I am looking at the RemoteSearchable code for inspiration on how to do remote searches (I will probably use something like SEDA to implement the rpc to avoid heavy thread creation issues of ...
    Ali RouhiAli Rouhi
    Aug 10, 2005 at 6:38 am
    Aug 10, 2005 at 7:43 pm
  • Hi, The "front-end" guys working on my application need a way to pass me complex queries. I was thinking that it'd be pretty straightforward to hand them a package which helps them to create a DOM ...
    Roy KleinRoy Klein
    Aug 10, 2005 at 2:09 pm
    Aug 10, 2005 at 6:52 pm
  • Hi all, I am new user of lucene. This query is posted at least once on alomost all lucene mailing lists. The query being about handling of date fields. In my case I need to find documents with dates ...
    Santo santoSanto santo
    Aug 9, 2005 at 6:27 am
    Aug 9, 2005 at 4:48 pm
  • Hi, Just an idea to make Lucene work with databases more easily. When I communicated with Shay Banon(Compass' author), it came to me that maybe Lucene can be wrapped around JDBC drivers. Let's say ...
    Chris LuChris Lu
    Aug 8, 2005 at 11:10 pm
    Aug 9, 2005 at 10:56 am
  • I know there used to be a webpage that gave the algorithm used by Lucene for scoring, along with some info on what each variable controlled, to some extent... I was looking to brush up on what the ...
    Dan ArmbrustDan Armbrust
    Aug 8, 2005 at 8:05 pm
    Aug 8, 2005 at 9:06 pm
  • hello all, lucene is already pretty fast, but i was wondering if you guys have experience with using gcj (on linux). how much faster is it for indexing? personally i have best performance with ...
    Martin RodeMartin Rode
    Aug 4, 2005 at 2:44 pm
    Aug 8, 2005 at 11:33 am
  • Hi list, Let me describe my issue taking a simpler model. Lets say I were to build a blog which allows each post to have multiple keywords. I want to provide a search over the posts but restricted to ...
    N. C. Deepak RameshN. C. Deepak Ramesh
    Aug 6, 2005 at 4:05 am
    Aug 6, 2005 at 5:57 pm
Group Navigation
period‹ prev | Aug 2005 | next ›
Group Overview
groupjava-user @

134 users for August 2005

Erik Hatcher: 34 posts Otis Gospodnetic: 33 posts Chris Hostetter: 24 posts Derya Kasapoglu: 14 posts Mordo, Aviran (EXP N-NANNATEK): 13 posts Karl Koch: 12 posts Chris Lu: 11 posts Paul Elschot: 11 posts Daniel Naber: 10 posts Jian chen: 10 posts Andrew Boyd: 9 posts Tareque: 8 posts Dan Armbrust: 8 posts Karthik N S: 8 posts Riccardo Daviddi: 8 posts Will (sent by Nabble.com): 8 posts Marvin Humphrey: 7 posts Miles Barr: 7 posts Rajesh Munavalli: 7 posts Yonik Seeley: 7 posts
show more