Search Discussions

78 discussions - 334 posts

  • Hi all, I've recently started using lucene and I'm running into the same issue with the query parser. I'd like to use queries that contain dashes in the field name, but as far as I can tell it seems ...
    Jon PipitoneJon Pipitone
    May 12, 2003 at 3:23 pm
    Jul 27, 2003 at 12:36 pm
  • List, I'm having problems using an absolute path to the index directory when my web application is deployed in a WAR file. The absolute path changes depending on the server. Is there a way to either ...
    Jason CoxJason Cox
    May 2, 2003 at 3:40 pm
    May 26, 2003 at 1:50 am
  • Hi list, I'm experiencing a high number of files in the Lucene index, even after running optimize I still have over 600 files in my Lucene index. Now the scary thing is that's about the same number ...
    Victor HadiantoVictor Hadianto
    May 2, 2003 at 3:32 pm
    May 6, 2003 at 1:02 pm
  • Is there a query object which can match all documents? As an example, suppose my search supports zero or more search criteria. Each criteria, if specified, is required to match the document. If no ...
    May 9, 2003 at 11:49 pm
    May 23, 2003 at 4:35 am
  • Hi all, I have a rather nice html parser that I got from SourceForge. Does anyone know of any good parsers for pdf and Microsoft Office Suite (.doc, .ppt, .xls, etc), any help would be much ...
    Pete LewisPete Lewis
    May 28, 2003 at 10:48 am
    May 29, 2003 at 2:14 pm
  • Hi - I have created an index of 1.8 million documents, each document containing 5-10 fields. When I run a search, that I know has a small number of hits, it works great. However, if I run a search ...
    Cory AlbrightCory Albright
    May 27, 2003 at 6:30 pm
    May 29, 2003 at 4:08 am
  • Hi All, I have a couple of questions regarding using Lucene in an EJB environment. There seems to be a number of probs wrt the ejb spec and lucene. The spec says that EJBs shouldn't (amoungst other ...
    Leslie HughesLeslie Hughes
    May 22, 2003 at 7:16 am
    May 27, 2003 at 1:59 pm
  • Hi, is it possible to get a list of subqueries(array of query-objects) after a queraparser has done parsing? Thanks for your help Günter
    Günter KukiesGünter Kukies
    May 19, 2003 at 2:50 pm
    May 27, 2003 at 6:54 pm
  • Hi, please have a look at the FuzzyTermEnum class in Lucene. There is an impressive implementation of Levenshtein distance there that you can use; simply set the fuzzy distance higher than 0.5 (0.75 ...
    Karsten KonradKarsten Konrad
    May 30, 2003 at 7:06 pm
    Jun 5, 2003 at 7:14 am
  • Hello all, As far as I have understood, lucene does not allow search queries starting with wildcards. I have a file database indexed by content and also by filename. It would be nice if the user ...
    Andrei MelisAndrei Melis
    May 28, 2003 at 6:52 am
    May 30, 2003 at 12:51 am
  • Hi, I am using Lucene 1.3. I want the 'default' search field to point to multiple fields (actually, all available fields). Is there API support to accomplish this ? thanks for your help, vikas. ...
    Ramrakhiani, VikasRamrakhiani, Vikas
    May 13, 2003 at 7:35 am
    May 14, 2003 at 9:51 am
  • If I use my search engine is possible to highlight the string in an excel file and open it? Michel -----Original Message----- From: Shoba Ramachandran Sent: Monday, May 12, 2003 8:59 PM To: Lucene ...
    May 12, 2003 at 7:07 pm
    May 14, 2003 at 12:13 am
  • Hello I just started using Lucene, and I'm writing a simple program (swing interface) that adds files to an index (I'm not searching that index yet). In my main frame, I click a button, a FileChooser ...
    Guilherme BarileGuilherme Barile
    May 21, 2003 at 1:59 pm
    May 21, 2003 at 8:52 pm
  • Using 1.3-RC1, I've got an index where a keyword field contains the primary key value of a database row (an int), and when a user updates the data for the row, I delete the document from the index ...
    Doug KirkDoug Kirk
    May 21, 2003 at 1:33 pm
    May 21, 2003 at 4:04 pm
  • I have an indexer that reads data from database and indexes the data. foreach(db_row) { Document doc = new Document(); doc.add(Field.Text("Product", productName); doc.add(Field.Text("Description", ...
    Venkatraman, ShivVenkatraman, Shiv
    May 31, 2003 at 2:37 pm
    May 31, 2003 at 5:14 pm
  • Hi, I was just wondering what the rationale is behind lowercasing wildcard queries produced by QueryParser? It's just that my data is all upper case and my analyser doesn't lowercase so it seems a ...
    Leslie HughesLeslie Hughes
    May 30, 2003 at 5:24 am
    May 31, 2003 at 1:12 am
  • Hi folks, I have a requirement to find documents similar to another. Can that be accomplished using a PhraseQuery, or some other way? Thanks, Rick ...
    Wirthlin, Rick - WorkstreamWirthlin, Rick - Workstream
    May 29, 2003 at 8:31 pm
    May 30, 2003 at 10:44 am
  • Hey, I succesfully made an index of the content of a database, the document is constructed as following: Document doc = new Document(); doc.add(Field.Text("VOORNAAM", voornaam)); ...
    May 15, 2003 at 1:40 pm
    May 15, 2003 at 3:04 pm
  • Hi, I did it, but I use only lucene. You need to create an IndexWriter with SimpleAnalyzer, an InputStream as new FileInputStream, create Document with two Fields: one contains the file path and one ...
    May 1, 2003 at 5:25 am
    May 6, 2003 at 2:43 pm
  • Hi, anybody knows which is the best way to implements in Lucene a fuctionality (that Google has) like this: Search text- notebok Answer- Did you mean: notebook ? Thanks, Dario ...
    Dario DentaleDario Dentale
    May 30, 2003 at 9:13 am
    May 30, 2003 at 5:16 pm
  • After deleting a document from the index, and then adding a document to the index (same doc with updated info), it seems that the IndexSearcher doesn't find the updated document. Whether I specify no ...
    Doug KirkDoug Kirk
    May 22, 2003 at 5:34 pm
    May 23, 2003 at 3:38 am
  • Hi, I have a jsp which has five text box. The user puts in some text in 1 or more text boxes. I want to conduct a multi field wildcard search. Eg: for textBox1 the user enters "hello" for textBox2 ...
    Subhrajyoti MoitraSubhrajyoti Moitra
    May 17, 2003 at 9:14 am
    May 21, 2003 at 11:14 am
  • Hello, i use the fellowing filters: public TokenStream tokenStream(String fieldName, Reader reader) { TokenStream result = new StandardTokenizer(reader); result = new StandardFilter(result); result = ...
    Günter KukiesGünter Kukies
    May 15, 2003 at 12:43 pm
    May 19, 2003 at 10:07 am
  • Hi all, In our application we are indexing a massive amount of documents in parallel over 16 machines. Our problem is that we have duplicate documents in the input and the same document is indexed ...
    Victor HadiantoVictor Hadianto
    May 9, 2003 at 12:59 am
    May 9, 2003 at 7:58 am
  • Instead of Lucene xml.apache.org uses Google. What are the reasons for not using Lucene ? Andreas --------------------------------------------------------------------- To unsubscribe, e-mail: ...
    Andreas KuckartzAndreas Kuckartz
    May 16, 2003 at 3:13 pm
    May 17, 2003 at 12:35 am
  • Good morning all, Hats off to the folks involved in this project, you guys and gals are the true programmers. Writing a search engine is a dirty job and I am glad that this community is available to ...
    Bryan LaPlanteBryan LaPlante
    May 9, 2003 at 11:24 am
    May 12, 2003 at 12:29 pm
  • I am have a tough time locating the source of my problem. If anyone is interested in using the taglib for lucene and would care to interact with me on this, a second pair of eyes would be greatly ...
    Bryan LaPlanteBryan LaPlante
    May 21, 2003 at 8:39 pm
    May 28, 2003 at 8:54 pm
  • What is wrong with Document.setBoost(float) /** Sets a boost factor for hits on any field of this document. This value * will be multiplied into the score of all hits on this document. */ If you know ...
    Eric IsaksonEric Isakson
    May 16, 2003 at 4:00 pm
    May 17, 2003 at 1:04 am
  • Hi all, I've got a question regarding how Lucene priorizes complex --like "yoggy drop the honey pot"-- query results. I wanted lucene to show first those results having ALL terms. By default lucene ...
    Xavier GuardiolaXavier Guardiola
    May 16, 2003 at 8:09 am
    May 16, 2003 at 9:41 pm
  • Can someone help me on using date fields, I need to peform searching on date fields. I am able to add date fields to index using DateField.dateToString(messageDate) - where messageDate is a Date ...
    Sushma SinhaSushma Sinha
    May 1, 2003 at 9:33 am
    May 1, 2003 at 2:42 pm
  • Hi list, I am having a weird problem. I have a PDF document. Using PDFBox library i get the contents of the PDF file as a String. This string gets indexed using the StandardAnalyser. This document ...
    Subhrajyoti MoitraSubhrajyoti Moitra
    May 28, 2003 at 7:10 am
    May 28, 2003 at 7:27 am
  • Is there a (better) way that I can use to figure out which field in a document caused the document to be returned from a query? Currently, after I do a search across all of my fields and documents, I ...
    Armbrust, Daniel C.Armbrust, Daniel C.
    May 27, 2003 at 6:17 pm
    May 27, 2003 at 6:58 pm
  • What do you mean, one-user-at-a-time? I don't think either Lucene or JCA has that limitation. -----Original Message----- From: Guilherme Barile Sent: Thursday, May 22, 2003 10:04 AM To: Lucene Users ...
    Lichtner, GuglielmoLichtner, Guglielmo
    May 22, 2003 at 2:23 pm
    May 22, 2003 at 3:48 pm
  • Has anyone tried this? Its home page states: CLucene is faster than lucene as it is written in a C++. Otis --- otis@apache.org wrote:
    Otis GospodneticOtis Gospodnetic
    May 21, 2003 at 9:38 pm
    May 22, 2003 at 3:33 pm
  • Hi, I am trying to write a delete method using delete(int docNum) from the IndexReader class. The problem is that I don't know how to get the docNum parameter. Can you please help me. My idea was to ...
    Marie-Hélène ForgetMarie-Hélène Forget
    May 22, 2003 at 2:52 pm
    May 22, 2003 at 3:22 pm
  • I have just downloaded the Lucene 1.2 distribution. After running through the demo, I decided to index (using IndexFiles that is used to load the demo documents) about 100 XML files that I have ...
    Joe PaulsenJoe Paulsen
    May 21, 2003 at 3:41 pm
    May 21, 2003 at 6:53 pm
  • Hello: I'm using lucene (lucene-1.3-dev1) and tomcat (4.1) for my company's websites. The largest number of documents contained in any index is approx. 800,000. I'm getting OutOfMemoryErrors ...
    May 2, 2003 at 9:30 pm
    May 21, 2003 at 8:40 am
  • Erm, isnt "PageRank" trademarked by Google and covered by a patent (6285999) ?? So we wouldn't want to implement it and infringe would we ? ;-) ...
    Leslie HughesLeslie Hughes
    May 19, 2003 at 2:32 am
    May 19, 2003 at 3:33 pm
  • Hello, How do I create my own Analyzer that implements PorterStemFilter + StopFilter + LowerCaseFilter ? Thank you Andy --------------------------------------------------------------------- To ...
    Andy NauliAndy Nauli
    May 7, 2003 at 3:06 am
    May 7, 2003 at 5:29 am
  • hello, I am just starting looking at lucene for my project. Before I proceed, I would like to know if it's a good idea to use lucene for creating index and also performing statistical analysis on the ...
    Andy NauliAndy Nauli
    May 2, 2003 at 9:33 am
    May 2, 2003 at 5:29 pm
  • We toString() and reparse queries like the following with no apparent problems in version 1.2 but something undesirable could have crept into 1.3 RC-1. +kot:"projects" +added:[19990101-null] ...
    Hoad, Richard (AFIS)Hoad, Richard (AFIS)
    May 1, 2003 at 7:05 am
    May 1, 2003 at 3:12 pm
  • I found some references to an SQLDirectory class in the mailing list archives but I was unable to actually locate the package anywhere in the CVS (I looked in both the primary and the sandbox) nor ...
    Anthony EdenAnthony Eden
    May 31, 2003 at 3:27 am
    Jun 2, 2003 at 8:10 am
  • Hello, Is the cost of opening an IndexSearcher proportional to anything, e.g. physical index size, number of segments? Thanks. -- Herman ...
    Herman ChenHerman Chen
    May 29, 2003 at 9:23 am
    May 29, 2003 at 9:52 am
  • I think it's possible, but I'm not sure how Scorers work. I just want to place the most recent hits at the front and the oldest ones at the back (where "date" is a field in the documents). Is there a ...
    David WeitzmanDavid Weitzman
    May 28, 2003 at 12:32 am
    May 28, 2003 at 1:42 pm
  • Hi, I am have a problem using MultiSearcher and I want to ask if I am using it properly. Every other run of my jsp page throws an exception on the msearcher.search(query); line of code, otherwise it ...
    Bryan LaPlanteBryan LaPlante
    May 23, 2003 at 9:02 pm
    May 24, 2003 at 8:22 am
  • Hello, [Joe Paulsen [joseph.paulsen@verizon.net]]: Check out: <http://jakarta.apache.org/lucene/docs/api/index.html - class org.apache.lucene.index.IndexWriter - description of maxFieldLength: " The ...
    Materna, Wolf-Dietrich (empolis B)Materna, Wolf-Dietrich (empolis B)
    May 21, 2003 at 3:56 pm
    May 21, 2003 at 6:56 pm
  • Hi guys, I am new in Lucene but i like a lot. I am trying to develop a search engine. I have two problems: First I did a program for indexing files, here the code: public void indexFile(String ...
    May 20, 2003 at 7:07 pm
    May 20, 2003 at 7:13 pm
  • All, I've got a query of the form "A or B or ( C or D or E )" where the results must contain the terms A or B, but should be rated higher if they contain C,D, or E as well. Is there a good way to do ...
    Tom EskridgeTom Eskridge
    May 19, 2003 at 4:56 pm
    May 19, 2003 at 5:43 pm
  • Is it possible to search using a query like price: 15.00 or something similar. This does not seem possible to me since '<' and ' ' are not query operators. Also, the index created for filed 'price' ...
    Song, XuekaiSong, Xuekai
    May 15, 2003 at 3:30 pm
    May 15, 2003 at 3:37 pm
  • Hey, is it possible to load a FSDirectory at the start of my program into a RAMDirectory and write it back to a FSDirectory when I'm closing down?
    May 15, 2003 at 1:35 pm
    May 15, 2003 at 1:42 pm
Group Navigation
period‹ prev | May 2003 | next ›
Group Overview
groupjava-user @

96 users for May 2003

Otis Gospodnetic: 27 posts Guilherme Barile: 13 posts Rob Outar: 13 posts Eric Jain: 11 posts Shoba Ramachandran: 11 posts Victor Hadianto: 10 posts Aviran Mordo: 9 posts Bryan LaPlante: 9 posts Leslie Hughes: 9 posts Terry Steichen: 9 posts David Medinets: 8 posts Leo Galambos: 8 posts Tatu Saloranta: 8 posts David_birthwell: 7 posts Doug Kirk: 7 posts Kelvin Tan: 7 posts D-Fuse: 6 posts Mmachado: 5 posts Andy Nauli: 5 posts Cory Albright: 5 posts
show more