FAQ

Search Discussions

72 discussions - 228 posts

  • Hello! I have a problem, a big one. I have successfully indexed 600 MB of XML data, but the search can't give any results if the field contains any '-' characters . For example: compound@cgx-code:[2 ...
    HermitHermit
    Feb 3, 2003 at 7:49 am
    Apr 7, 2003 at 6:35 pm
  • My first question I tried to write phrase query below is my attempt when i do a search the search content is in " " but it does not work it any idea what is wrong? I m using the index created by the ...
    Alex wongAlex wong
    Feb 24, 2003 at 6:43 pm
    Feb 25, 2003 at 5:26 pm
  • Hi! Is there anyone tried running Lucene successfully in Weblogic. If so then kindly guide me as am new to weblogic 6.0 Thanks Nellai...
    NellaiNellai
    Feb 14, 2003 at 7:54 am
    Feb 18, 2003 at 10:22 pm
  • Hello, writer.addDocument(doc) is throwing an NullPointerException. The stacktrace from the catched Exception is only one line "NullPointerException" without anything else. I open the IndexWriter ...
    Günter KukiesGünter Kukies
    Feb 21, 2003 at 12:43 pm
    Feb 24, 2003 at 5:03 pm
  • Hi, I use PorterStemmer with my analyzer for indexing the documents. And I have been using the same analyzer for searching too. When I search for a phrase like "security" AND database, I would like ...
    Mailing Lists AccountMailing Lists Account
    Feb 12, 2003 at 9:26 am
    Feb 14, 2003 at 9:54 pm
  • Rob, We ran into this problem too, and our solution was to use a native PDF text extractor (PDFBox just can't seem to handle large PDFs well). Basically, we try to parse with the native app first, ...
    Matt TuckerMatt Tucker
    Feb 18, 2003 at 10:42 pm
    Feb 25, 2003 at 10:37 pm
  • Hi, I have succesfully built Lucene using ant, (thank you for help!), and I am now trying to follow the demo given on the page http://jakarta.apache.org/lucene/docs/demo.html. I have set my Classpath ...
    ButlereeButleree
    Feb 13, 2003 at 1:02 pm
    Feb 14, 2003 at 7:45 am
  • that actually brings me to another question... 1) to what extent are wildcards supported by lucenes? 2) what are the known bugs and where can i find the relevant documentation pertaining to them? 3) ...
    Rishabh BajpaiRishabh Bajpai
    Feb 1, 2003 at 4:32 am
    Feb 2, 2003 at 6:50 pm
  • Hi! I am trying to parse some JSP files and i am trying to change the HTMLParser.jj code to accomodate this. As mentioned in the FAQ i created the 3rd comment tags type in the void CommentTag() :, ...
    Pinky IyerPinky Iyer
    Feb 28, 2003 at 5:04 pm
    Mar 2, 2003 at 2:05 pm
  • Hi all, I'm trying to gather information about my non-searched (ie not used for the search) fields. Let's take an index with 2 fields: 'artist' (for the artist name) an 'type' (for his type of ...
    Brisbart FranckBrisbart Franck
    Feb 4, 2003 at 8:12 am
    Apr 8, 2003 at 8:42 am
  • Hello friends, If it is not much of a trouble, I would like to ask as many of you as possible, to post some statistics. This would preferably include 1. Size of the index. 2. No of documents indexed. ...
    Samir SatamSamir Satam
    Feb 27, 2003 at 5:22 pm
    Feb 28, 2003 at 4:10 am
  • Hey there, is there an existing FrenchAnalyser with a Porter Algorythm, and a STOP_WORLDS list ? or may i have to create one ? Thx for ya help ;) Pierre Lacchini Consultant développement PeopleWare ...
    Pierre LacchiniPierre Lacchini
    Feb 27, 2003 at 2:17 pm
    Feb 27, 2003 at 2:37 pm
  • Hi, my application uses a GermanAnalyzer for tokenizing a search string and constructing Query classes: Analyzer an = new org.apache.lucene.analysis.de.GermanAnalyzer(); TokenStream ts = ...
    Volker LuedelingVolker Luedeling
    Feb 17, 2003 at 12:00 pm
    Feb 17, 2003 at 6:44 pm
  • I have an index which, when searched with this query ("cloning clone animal") produces 1103 hits. A different, more narrow query ("(cloning clone) AND animal") produces only 19 hits. What's puzzling ...
    Terry SteichenTerry Steichen
    Feb 15, 2003 at 9:08 pm
    Feb 16, 2003 at 3:04 pm
  • Hi, I am using Lucene to index a collection of XML documents. Each XML document a relative pathname to a DTD file in the DOCTYPE tag. The problem is that the relative path names are resolved relative ...
    IcewindIcewind
    Feb 14, 2003 at 8:08 pm
    Feb 14, 2003 at 9:27 pm
  • Hello, i'm trying to index html file with Lucene. Do u know what's the best HTML Parser in Java ? The most Powerful ? I need to extract meta-tag, and many other differents text fields... Thx for ur ...
    Pierre LacchiniPierre Lacchini
    Feb 24, 2003 at 1:28 pm
    Feb 25, 2003 at 6:48 pm
  • Hello, Is there any simple way to get the information from the search results on which of the query terms contributed the most to the document's score? I'm working on an application which could use ...
    Andrzej BialeckiAndrzej Bialecki
    Feb 24, 2003 at 3:10 pm
    Feb 24, 2003 at 5:47 pm
  • Hello, why got the QueryParser problems interpreting this query: doctypeid:0001 doctypeid is a Keyword-Field Maby the numbers are not allowed here, but how can i escape them. I use the StopAnalyzer. ...
    Günter KukiesGünter Kukies
    Feb 18, 2003 at 5:18 pm
    Feb 20, 2003 at 8:40 am
  • Hi All, I've spent the weekend mulling over a problem I had with a RangeQuery over a Price field. The problem was that prices can range from 1 to many digits (no cents, just dollars). My searches ...
    Host unknownHost unknown
    Feb 17, 2003 at 2:31 pm
    Feb 17, 2003 at 10:05 pm
  • i mite be missing something, but i didnt find any Document.setBoost(float) {is it float?} either in the javadocs (http://jakarta.apache.org/lucene/docs/api/index.html), or in the Lucene source ...
    Rishabh BajpaiRishabh Bajpai
    Feb 6, 2003 at 6:54 am
    Feb 7, 2003 at 2:03 pm
  • Hi, Is there any way to filter the search based on the modified date. For example, i need to fetch only those documents whose modified date or < or between. Can any one help me to solve this. Thanks ...
    NellaiNellai
    Feb 3, 2003 at 10:10 am
    Feb 3, 2003 at 2:59 pm
  • May it be, that microsoft is found, because the search is not case sensitive (text) and ct is not found because there the search is case sensitive (Keyword) Did you try +state:CT ...
    Kalthoff, MarcusKalthoff, Marcus
    Feb 28, 2003 at 6:17 am
    Feb 28, 2003 at 2:48 pm
  • Hi! I don't want to give write permission to the index directory while searching. But lucene needs write permission on index directory so that it can create locks while searching. So, I tried to use ...
    Giri, SandeepGiri, Sandeep
    Feb 13, 2003 at 8:12 pm
    Feb 28, 2003 at 8:00 am
  • Hi, I can understand that you need additional + before the search-words to get a "real" AND, but what confuses me is that 10060 && 10040 (which should be the same as AND) does a real boolean AND like ...
    Kalthoff, MarcusKalthoff, Marcus
    Feb 27, 2003 at 1:40 pm
    Feb 27, 2003 at 3:07 pm
  • http://sourceforge.net/projects/weblucene/ WebLucene: Lucene Web interface, use XML as a lightweight protocol. Developer convert data source (text, DB, MS Word, PDF... etc) into standard xml format ...
    Che DongChe Dong
    Feb 20, 2003 at 3:24 am
    Feb 21, 2003 at 6:20 am
  • Hello, I'm using Lucene, and I need to index an XML Database (Tamino). How can I do that ? Do i have to use an XML parser as Digester ? I'm kinda noob with Lucene, and I really need help ;) Thx ...
    Pierre LacchiniPierre Lacchini
    Feb 14, 2003 at 9:28 am
    Feb 17, 2003 at 12:34 am
  • Hello all, Any tips on how to index the following? xml / xschema files in both the filesystem and inside jars in the filesystem? I would only want to index based on certain markers like <description ...
    Hanasaki JiJiHanasaki JiJi
    Feb 12, 2003 at 1:51 am
    Feb 12, 2003 at 9:41 pm
  • Hello all! Wat's score(i) in Hits object? is it used to calculate the % of occurance of the searching parameter? Can anyone tell us the exact use of score() Thanks a ton Nellai...
    NellaiNellai
    Feb 10, 2003 at 12:47 pm
    Feb 11, 2003 at 10:38 am
  • Hi all, My Lucene IndexSearcher returns too few hits when I use some extended query syntaxt. I'll give examples of my query/hits pairs at the bottom. I'm indexing a database table: <CODE //creating a ...
    Marcel StörMarcel Stör
    Feb 5, 2003 at 3:01 pm
    Feb 6, 2003 at 4:33 pm
  • G'day One and all, I'm have some problems getting lucene to do case-insensitive searches. I have looked through the list archives and found that using the standardAnalyser should do the trick, ...
    Stephen EatonStephen Eaton
    Feb 4, 2003 at 2:15 pm
    Feb 4, 2003 at 3:57 pm
  • Hi! Should I use document.setBoot() only when searching, or I can set it while indexing? The idea is that some documents can have higher priority then others, so it would be great if I could set ...
    Philipp ChudinovPhilipp Chudinov
    Feb 2, 2003 at 10:34 pm
    Feb 2, 2003 at 11:05 pm
  • It is a web app, but I am absolutely sure it is correct. I just wrote a log.debug() to prove this. I suppose lucene simply ignores AND and OR, because OR is default anyway. But it does not ignore && ...
    Kalthoff, MarcusKalthoff, Marcus
    Feb 27, 2003 at 2:39 pm
    Feb 27, 2003 at 3:31 pm
  • I've had fairly good experience with Jtidy! But HTMLParser http://htmlparser.sourceforge.net/ seems to have the lighter looking API. It is Event based and I might need to parse some large HTML ...
    Nestel, Frank IZ/HZA-IC4Nestel, Frank IZ/HZA-IC4
    Feb 26, 2003 at 9:00 am
    Feb 26, 2003 at 8:51 pm
  • Is it possible to perform Complex Query ? On the front Side, i execute a search based on 2 Fields... Is it possible to combine those 2 differents queries ? For example : searching the word "Peace" in ...
    Pierre LacchiniPierre Lacchini
    Feb 25, 2003 at 9:20 am
    Feb 25, 2003 at 3:07 pm
  • I made a small mistake in my example. My application converted all characters to lowercase while indexing. When I comment this out, "Etagenwohnung" remains unchanged after stemming. So, my example is ...
    Volker LuedelingVolker Luedeling
    Feb 24, 2003 at 12:22 pm
    Feb 24, 2003 at 2:29 pm
  • Can BooleanClause be made serializable? I am constructing a compound query object in a servlet, and sending it to my back-end server which performs a lucene search and returns the results. All of the ...
    NewshamNewsham
    Feb 11, 2003 at 11:36 pm
    Feb 23, 2003 at 6:49 am
  • Hi, I am writing an application that constructs Lucene searches from XML queries. Each item from the XML is represented by a Query of the corresponding type. I have a problem when I try to search for ...
    Volker LuedelingVolker Luedeling
    Feb 14, 2003 at 9:58 am
    Feb 16, 2003 at 3:38 am
  • Hi, I am using Lucene right now to index several semi-structured documents. I recently had to implement a method 'getFrequencyVector()' to simply return a mapping of keyword - frequency from the ...
    ExtExt
    Feb 14, 2003 at 2:10 pm
    Feb 14, 2003 at 10:01 pm
  • Hi, I am new to Lucene, at the moment I'm trying to get it to work. I am using Windows 98 and I have installed ant and set my classpaths etc, however on running ant from c:\lucene\lucene-1.2-src I am ...
    ButlereeButleree
    Feb 13, 2003 at 12:36 am
    Feb 13, 2003 at 12:58 am
  • Hello, This is already done. You have to use the latest developer version from the CVS. Look at <http://jakarta.apache.org/site/cvsindex.html how to get it. Regards, Wolf-Dietrich -- Wolf-Dietrich ...
    Materna, Wolf-Dietrich (empolis B)Materna, Wolf-Dietrich (empolis B)
    Feb 12, 2003 at 9:44 am
    Feb 12, 2003 at 7:03 pm
  • Hi! can anyone tell me how to calculate the % of relevance using Lucene. Thanks in advance Nellai...
    NellaiNellai
    Feb 11, 2003 at 2:48 pm
    Feb 11, 2003 at 3:14 pm
  • Redirecting to lucene-user. No, not out of the box. You have to get the whole field of a document and look for the pieces of query in it yourself. Otis --- marco scibetta wrote: telefonino
    Otis GospodneticOtis Gospodnetic
    Feb 4, 2003 at 3:27 pm
    Feb 4, 2003 at 4:26 pm
  • From: Stone, Timothy Subject: Schreiber's TermHighlighter/LuceneTools utility and the Demo... Date: Wed, 11 Dec 2002 13:00:28 -0800 Has anyone successfully implemented Mark Schreiber's ...
    Massimo ManninoMassimo Mannino
    Feb 4, 2003 at 10:12 am
    Feb 4, 2003 at 3:21 pm
  • We are currently running lucene 1.2 on Sun Java 1.3.06_01. We have moved a number of our applications to 1.4.1_01. Will lucene 1.2 work under 1.4.1_01 by just recompiling? Thanks, Justin -- Justin ...
    Justin GreeneJustin Greene
    Feb 4, 2003 at 5:15 am
    Feb 4, 2003 at 5:25 am
  • Is there an existing API that allows you to conduct a search such that only hits with a score greater than X are returned? Regards, Terry
    Terry SteichenTerry Steichen
    Feb 3, 2003 at 9:38 pm
    Feb 3, 2003 at 10:07 pm
  • Hello all! Why am I not able to use wildchars in phrase. Somethink like this doesnt work: "Th* is a phrase." This works propperly: "This is a phrase." Can you help me? I guess you can ;-) -- Lukas ...
    Lukas ZapletalLukas Zapletal
    Feb 2, 2003 at 4:52 pm
    Feb 2, 2003 at 5:46 pm
  • Hi, I'm running into an issue with the way the query parameters are parsed in my application. When searching for the word "sync-up", it looks like the QueryParser interpreted that as a search for : ...
    Sylvain PucciantiSylvain Puccianti
    Feb 1, 2003 at 1:41 am
    Feb 1, 2003 at 7:21 am
  • Hi, I was away and when I read what I missed, well...ehm... have you read http://sustainability.open.ac.uk/gary/papers/netique.htm? i.e., see "Caution when quoting other messages while replying to ...
    Leo GalambosLeo Galambos
    Feb 28, 2003 at 9:26 pm
    Feb 28, 2003 at 9:26 pm
  • Yes it is possible. Instead of catching an Exception you can do anything else, e.g. try { ...} catch (MyException e) { System.err.prinltn(e.class.forName()); } But this is off-topic here, it´s an ...
    Borkenhagen, Michael (ofd-ko zdfin)Borkenhagen, Michael (ofd-ko zdfin)
    Feb 25, 2003 at 7:31 am
    Feb 25, 2003 at 7:31 am
  • I prefer JTidy http://lempinen.net/sami/jtidy/. Michael -----Ursprüngliche Nachricht----- Von: Otis Gospodnetic Gesendet: Montag, 24. Februar 2003 15:03 An: Lucene Users List; pl@peopleware.lu ...
    Borkenhagen, Michael (ofd-ko zdfin)Borkenhagen, Michael (ofd-ko zdfin)
    Feb 25, 2003 at 7:24 am
    Feb 25, 2003 at 7:24 am
Group Navigation
period‹ prev | Feb 2003 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions72
posts228
users75
websitelucene.apache.org

75 users for February 2003

Otis Gospodnetic: 17 posts Terry Steichen: 15 posts Tatu Saloranta: 13 posts Doug Cutting: 11 posts Günter Kukies: 11 posts Pinky Iyer: 11 posts Andrzej Bialecki: 7 posts Nellai: 7 posts Newsham: 6 posts Christoph Kiehl: 6 posts Rishabh Bajpai: 6 posts Mailing Lists Account: 5 posts Pierre Lacchini: 5 posts Rob Outar: 5 posts Borkenhagen, Michael (ofd-ko zdfin): 4 posts Che Dong: 4 posts Kalthoff, Marcus: 4 posts Materna, Wolf-Dietrich (empolis B): 4 posts Volker Luedeling: 4 posts Butleree: 3 posts
show more
Archives