Search Discussions

86 discussions - 327 posts

  • I am trying index a set of data, storing only a "primary key". This primary key I left un-indexed. There is one "text" field, that I indexed and tokenized. The others I neither want to store or ...
    Cecil, Paula NewCecil, Paula New
    Nov 20, 2001 at 2:47 am
    Nov 29, 2001 at 2:28 am
  • hi to all, please help! I think I mixed my brain up already with this stuff... I'm trying to index about 29 textfiles where the biggest one is ~700Mb and the smallest ~300Mb. I achieved once to run ...
    Chantal AckermannChantal Ackermann
    Nov 28, 2001 at 12:51 pm
    Nov 29, 2001 at 10:09 pm
  • funny...I was just about to write something along the same lines.. I have 700.000 entries, in all about 1 gig of data. And when I search I have to allocate at least 150meg to the java-process or ...
    Anders NielsenAnders Nielsen
    Nov 9, 2001 at 1:13 am
    Nov 13, 2001 at 1:30 pm
  • Hi Since upgrading to 1.2 we've started getting the following error when creating an index in a directory with a large amount of files. Previous versions of Lucene were quite happy to index this ...
    Daryl ThachukDaryl Thachuk
    Nov 2, 2001 at 12:49 am
    Nov 9, 2001 at 5:05 pm
  • I am attempting to build the example code which is located in the \lucene-1.2-rc2\src\demo\org\apache\lucene directory in the distribution. Specifically, I'm trying to get IndexFiles.java, ...
    Joshua O'MadadhainJoshua O'Madadhain
    Nov 14, 2001 at 12:23 am
    Nov 17, 2001 at 9:23 am
  • Hello. Does anyone know of a way to sort search results other than by score? It seems like it would be very useful to be able to sort by date or maybe even by any field that has been indexed (which I ...
    Jeff KunkleJeff Kunkle
    Nov 16, 2001 at 10:03 pm
    Nov 19, 2001 at 8:25 pm
  • If I use a swedish national character as the first character of a search term, for example: öl (which is beer in Swedish :)) I get the following error: com.lucene.queryParser.TokenMgrError: Lexical ...
    Mikael AnderssonMikael Andersson
    Nov 13, 2001 at 1:04 pm
    Nov 15, 2001 at 8:46 am
  • This is a repost of a question posted to jGuru Lucene Forum. Didn't get a response there so I'm trying my luck here... What's the recommended way of performing multi-field searches? a.. ...
    Kelvin TanKelvin Tan
    Nov 12, 2001 at 10:05 am
    Nov 20, 2001 at 10:37 pm
  • Hello, We have been using PDFHandler - a pdf parser provided by websearch, to search in pdf files. We are trying to get the contents using pdfHandler.getContents() to arrive at a context-sensitive ...
    Nov 23, 2001 at 8:40 am
    Nov 25, 2001 at 8:13 pm
  • Hello all- I'm currently having a problem overwriting an old index. Every night, the contents of a database I'm using get updated, so the lucene indexes are also recreated every night. The technique ...
    Tom BarrettTom Barrett
    Nov 16, 2001 at 4:10 pm
    Jan 15, 2002 at 10:22 am
  • Hi all, Thanks for all your continuing help! I have got the go ahead to build a production-level prototype of my project. I have to be able to serve several 100s of queries a second (on big boxes), ...
    Winton DaviesWinton Davies
    Nov 14, 2001 at 11:11 pm
    Nov 15, 2001 at 6:25 pm
  • These are mostly things that I wrote years ago when first developing Lucene, e.g., to test the index code before the search code was written, etc. They were mostly never really standalone test ...
    Doug CuttingDoug Cutting
    Nov 1, 2001 at 11:23 pm
    Nov 2, 2001 at 3:04 am
  • What exactly is the error that you are getting? Is it an Ant error or a Lucene error? Does Ant know to convert forward slashes to back slashes on Windows? Have you tried using \ instead of / in those ...
    Otis GospodneticOtis Gospodnetic
    Nov 2, 2001 at 3:12 am
    Nov 2, 2001 at 11:04 pm
  • In the FAQ in Searching question 41, it says about index modification: "The problems are only when you add documents or optimize an index, and then search with an IndexReader that was constructed ...
    Avi DrissmanAvi Drissman
    Nov 27, 2001 at 9:47 pm
    Nov 28, 2001 at 11:03 am
  • Hi, Does anyone know what's going on with JavaCC? I'm trying to find the source code, as I was under the impression it's Free Software? However, I seem to browse around and around various sites but ...
    Lee MallaboneLee Mallabone
    Nov 12, 2001 at 1:42 pm
    Nov 12, 2001 at 4:14 pm
  • Hi, I ran into a problem earlier this week, where by an index of 8 million small documents resulted in an index file of 2GB. It turns out this is a common file system limit (some say it might be a ...
    Winton DaviesWinton Davies
    Nov 2, 2001 at 9:24 pm
    Nov 9, 2001 at 10:47 pm
  • How could I make a search with something like "tes*" for test, testing, ...? Currently, I use a StandardAnalyzer for indexing and searching (with query parser), and it doesn't work. Do I need to use ...
    Nov 6, 2001 at 5:10 pm
    Nov 7, 2001 at 3:38 pm
  • Is range searching working in RC2? I have the following documents in my index (using StandardAnalyzer): id:doc1 age:30 FirstName:John id:doc2 age:40 FirstName:Wendy The following queries do not ...
    Paul FriedmanPaul Friedman
    Nov 1, 2001 at 2:46 pm
    Nov 1, 2001 at 10:14 pm
  • I have started to create a set of generic lucene document types that can be easily manipulated depending on the fields. I know other have generated Documents out of PDF. Is there some place we can ...
    Nov 29, 2001 at 5:03 pm
    Nov 30, 2001 at 5:03 pm
  • Hi all, I have a doubt. I know that lucene can index html and text documents, but can it index other type of documents like pdf,docs, and xls documents? if it can, how can I implement it? Perhaps can ...
    Antonio VazquezAntonio Vazquez
    Nov 29, 2001 at 3:41 pm
    Nov 30, 2001 at 12:00 pm
  • I've used Verity in Cold Fusion to index Databases. Is this possible with Lucene? From recent posts, it looks like I would have to write a custom parser to convert each row into a text document. Am I ...
    Weaver, ScottWeaver, Scott
    Nov 29, 2001 at 4:13 pm
    Nov 29, 2001 at 10:35 pm
  • Hi, has anyone done anything to autodetect Language of an HTML-Document which will be indexed by Lucene? I will use Lucene to index an multilingual Portal and want to filter the hits by language. ...
    Strittmatter Stephan (external)Strittmatter Stephan (external)
    Nov 23, 2001 at 1:56 pm
    Nov 28, 2001 at 9:20 am
  • Hi! I am trying to use Lucene with russian texts. I created an index of xml documents (UTF-8 encoded), but when I am trying to search an index with a query from a servlet, it seems, that Lucene just ...
    Philipp ChudinovPhilipp Chudinov
    Nov 12, 2001 at 10:15 pm
    Nov 13, 2001 at 12:08 am
  • My search works for At&t with the ampersand in the middle. However it doesn't work for e-commerce with the dash in the middle. Anything I have to do with the analyzers/filters to fix this? Thanks. ...
    Nov 9, 2001 at 12:42 am
    Nov 9, 2001 at 3:39 am
  • Are there any ready made implementations of Lucene like i know 1 of http://www.i2a.com/ please let me know if there are others
    Amit Roy \(Yahoo\)Amit Roy \(Yahoo\)
    Nov 6, 2001 at 1:04 pm
    Nov 7, 2001 at 6:35 am
  • Hi, prefix query seems to be working only with lowercase words. Example: indexed word = Albanien alb* finds word Alb* doesn't find word I'm using StopAnalyzer(). manfred -- Manfred Schäfer ...
    Manfred SchäferManfred Schäfer
    Nov 23, 2001 at 5:30 pm
    Dec 5, 2001 at 12:56 pm
  • Hi Steven, thanks for your reply. Here is my szenario: 1) I want to grab a the website "xy.com" to the local disc at C:\xy 2) While exporting I want to index the content to C:\xy\index 3) I put ...
    Christoph BreidertChristoph Breidert
    Nov 26, 2001 at 12:08 pm
    Nov 29, 2001 at 5:46 pm
  • I have posted several questions/examples regarding QueryParser throwing exceptions when the query string contains various punctuation characters. I have been downloading the nightly builds hoping ...
    Paul FriedmanPaul Friedman
    Nov 26, 2001 at 3:56 pm
    Nov 27, 2001 at 8:10 pm
  • Hi there, I just started playing around with lucene and I was wondering if there is a simple method to determine the line number of a match within a = file. The documentation did not give me any ...
    Lind JürgenLind Jürgen
    Nov 22, 2001 at 5:02 pm
    Nov 25, 2001 at 10:52 am
  • Hello, This is from a thread from about 2 weeks ago. What is the answer to this question? If data is written to disk only when IndexWriter's close() is called, wouldn't the sample code below be as ...
    Otis GospodneticOtis Gospodnetic
    Nov 22, 2001 at 6:41 am
    Nov 22, 2001 at 7:24 pm
  • Can anyone tell me the advantages of Lucene indexer for indexing data located in a database or LDAP server over creating another index in the database or from an LDAP console? There got to be some ...
    Emmanuel BridonneauEmmanuel Bridonneau
    Nov 20, 2001 at 10:03 pm
    Nov 21, 2001 at 7:06 pm
  • This is my first message to this list. I have successfully created several little tests of the Lucene api. In my last test, I am trying to index "data records". Only the "primary key" needs to be ...
    Cecil, Paula NewCecil, Paula New
    Nov 18, 2001 at 11:12 pm
    Nov 20, 2001 at 1:19 am
  • Hello, Is it possible to enumerate ALL the documents in a Lucene index, say for house-keeping purposes. Thanks in advance, Paul Cunningham -- To unsubscribe, e-mail: For additional commands, e-mail:
    Paul CunninghamPaul Cunningham
    Nov 17, 2001 at 11:18 am
    Nov 18, 2001 at 11:13 am
  • Can somebody please help, this is a BIG issue for the project I'm working on? I have attached a test case showing that QueryParser (using StandardAnalyzer) throws a TokenMgrError when parsing a query ...
    Paul FriedmanPaul Friedman
    Nov 14, 2001 at 3:26 pm
    Nov 16, 2001 at 10:16 pm
  • I am trying to construct a term-term correlation matrix from the data stored in the index, for an extension to the vector model that I am researching. In case my terminology is unfamiliar, what I ...
    Joshua O'MadadhainJoshua O'Madadhain
    Nov 16, 2001 at 7:44 am
    Nov 16, 2001 at 7:15 pm
  • The method DateField.dateToString(Date date) throws an exception if the long value of Date.getTime() is negative. Therefore, it is not possible to 'encode' dates that are prior to 1970. I think a ...
    Ogren, Philip V.Ogren, Philip V.
    Nov 14, 2001 at 2:08 pm
    Nov 14, 2001 at 10:05 pm
  • Hi, I'm using Lucene in an archived e-mail full-text search system and I'm agreeing with it's flexibility and speed, but I've got a doubt about querying capabilities. In one of Lucene's site linked ...
    Alessio FioreAlessio Fiore
    Nov 14, 2001 at 8:27 am
    Nov 14, 2001 at 3:35 pm
  • While comparing RAMDirectory vs FSDirectory indexing performance I ran across some strange behavior. If I try to add 1000 documents using the RAMDirectory and then write it to disk, the search fails ...
    Paul FriedmanPaul Friedman
    Nov 9, 2001 at 10:26 pm
    Nov 12, 2001 at 2:03 pm
  • Paul, Thanks for the nice test case! This bug was fixed a week or so ago. Try the latest nightly release from: http://jakarta.apache.org/builds/jakarta-lucene/nightly/ Using that, I get the desired ...
    Doug CuttingDoug Cutting
    Nov 9, 2001 at 11:13 pm
    Nov 10, 2001 at 2:13 am
  • I am unable to find the expected documents when I execute a query on a keyword whose value contains punctuation. Here is the test case: import org.apache.lucene.queryParser.*; import ...
    Paul FriedmanPaul Friedman
    Nov 7, 2001 at 10:45 pm
    Nov 9, 2001 at 1:13 am
  • Hi, I am new to Lucene.I would like to know if there is an archive of this group anywhere. Thanks in advance, vyas -- To unsubscribe, e-mail: For additional commands, e-mail:
    Duggirala VedavyasDuggirala Vedavyas
    Nov 8, 2001 at 6:46 am
    Nov 8, 2001 at 9:49 am
  • To whoever maintains the javadocs... The javadoc for QueryParser.parse(...) says that it throws a ParseException if an error occurs, in fact, it throws an org.apache.lucene.queryParser.TokenMgrError. ...
    Paul FriedmanPaul Friedman
    Nov 6, 2001 at 4:42 pm
    Nov 7, 2001 at 2:06 pm
  • Hi, I've a queryParser error if the first character is a spacial character. ie : ' @ ç à ... It doesn't appear with lucene old version (<1.1). Is it a bug or something has changed in the way to call ...
    Nov 6, 2001 at 4:26 pm
    Nov 7, 2001 at 11:19 am
  • How difficult would it be to get BooleanQuery to do a standalone NOT, do you suppose? That would be very useful in my case. Scott For additional commands, e-mail:
    Scott GanyoScott Ganyo
    Nov 1, 2001 at 12:37 pm
    Nov 2, 2001 at 12:20 pm
  • [ and ] are used for RangeQuery. They indicate an inclusive range. For example: "name:[adam-scott]" For additional commands, e-mail:
    Scott GanyoScott Ganyo
    Nov 1, 2001 at 12:35 pm
    Nov 1, 2001 at 2:25 pm
  • I noticed a post discussing term vectors on the lucene-dev list. It is great that people are working on adding term vectors to Lucene. Stored term vectors per field (or document) are a great way to ...
    Nov 30, 2001 at 2:10 pm
    Nov 30, 2001 at 3:35 pm
  • I have noticed that when I kill/interrupt an indexing process, that it leaves a "lock" file, preventing further indexing. This raises a couple of questions: a. When I simply delete the file and ...
    New, Cecil (GEAE)New, Cecil (GEAE)
    Nov 29, 2001 at 5:05 pm
    Nov 29, 2001 at 5:12 pm
  • Hi all, I got the lucene source files, When I started to compile all packages again in some order, it is giving some error saying some classnot found....the order in which I compiled is given below. ...
    Srinivasa vSrinivasa v
    Nov 28, 2001 at 3:32 pm
    Nov 28, 2001 at 3:58 pm
  • Hi, I am trying to learn firstly how to post messages thanks! Ferdinand Damon -- To unsubscribe, e-mail: For additional commands, e-mail:
    Ferdinand DamonFerdinand Damon
    Nov 28, 2001 at 11:04 am
    Nov 28, 2001 at 11:21 am
  • hi while testing the SqlDirectory, I found some really strange thing: scenario is concurrent writer and searcher: 1. a IndexWriter is started and creates a write.lock until the close method is ...
    Marc KramisMarc Kramis
    Nov 27, 2001 at 10:08 pm
    Nov 27, 2001 at 10:23 pm
Group Navigation
period‹ prev | Nov 2001 | next ›
Group Overview
groupjava-user @

89 users for November 2001

Doug Cutting: 29 posts Winton Davies: 29 posts Paul Friedman: 19 posts Ian Lea: 16 posts Otis Gospodnetic: 14 posts Steven J. Owens: 11 posts Kelvin Tan: 10 posts Cecil, Paula New: 9 posts Anders Nielsen: 8 posts Karl Øie: 8 posts Scott Ganyo: 8 posts Brian Goetz: 7 posts David Bonilla: 7 posts Emmanuel Bridonneau: 6 posts Jeff Trent: 6 posts Carlson: 5 posts Daryl Thachuk: 5 posts Joshua O'Madadhain: 5 posts Amit Roy \(Yahoo\): 4 posts Christophe: 4 posts
show more