Search Discussions

57 discussions - 192 posts

  • Hi guys, Just wondering if lucene indexes empty strings and if so, how to search for this using the query language? Regards, Minh Kama Yie This message is intended only for the named recipient. If ...
    Minh Kama YieMinh Kama Yie
    Dec 16, 2002 at 11:26 pm
    Dec 20, 2002 at 6:59 pm
  • I've been running some scalability tests on Lucene over the past couple of weeks. While there may be some flaws with some of my methods, I think they will be useful for people that want an idea as to ...
    Armbrust, Daniel C.Armbrust, Daniel C.
    Dec 20, 2002 at 4:57 pm
    Dec 24, 2002 at 7:19 pm
  • Hello everyone, I wish to implement a TokenFilter that will remove accentuated characters so for example 'é' will become 'e'. As I would rather not reinvent the wheel, I've tried to find something on ...
    Stephane vaucherStephane vaucher
    Dec 10, 2002 at 7:58 pm
    Dec 12, 2002 at 7:41 pm
  • I am a total newbie to Lucene. We are developing using a Component-Based Development (CBD) approach (j2ee, oracle, linux) where our app is built using separate stand-alone components. The standalone ...
    Cohan, SeanCohan, Sean
    Dec 10, 2002 at 10:38 pm
    Dec 13, 2002 at 12:34 pm
  • This may be of use to people who want to make lucene index faster. Also, I'm curious as to what JVM most people run Lucene under, and if anyone else has seen results like this: I'm using the class ...
    Armbrust, Daniel C.Armbrust, Daniel C.
    Dec 5, 2002 at 10:46 pm
    Dec 6, 2002 at 5:17 pm
  • I have a keyword field that has a value like: "/path/to/something". Is there a way I can use QueryParser to get documents that have that field value? It seems the Analyzer is kicking in and ...
    Erik HatcherErik Hatcher
    Dec 30, 2002 at 4:54 pm
    Jan 1, 2003 at 1:36 am
  • Hello, I'm in the process of creating the "about" page for my app and I was wondering what are the requirements to get included in the "Powered by Lucene" page? The app is a desktop application... ...
    Dec 20, 2002 at 7:31 pm
    Dec 27, 2002 at 5:32 pm
  • One final Lucene question for the day... Is it possible for me to retrieve all the values of a particular field that exists within an index, across all documents? I don't really need (or want) to ...
    Erik HatcherErik Hatcher
    Dec 30, 2002 at 5:24 pm
    Dec 30, 2002 at 7:16 pm
  • hello all i m using lucene to write a search method but when i import classes such as import org.apache.lucene.document; import org.apache.lucene.index; import org.apache.lucene.search; import ...
    Dec 21, 2002 at 11:58 pm
    Dec 23, 2002 at 7:15 pm
  • I tried to use IndexHTML (demo) and Lucene 1.2 for indexing *.CZ, but Lucene often falls to never-ending loop. I've analyzed my data, so I know what file(s) sent Lucene down. I don't see anything ...
    Leo GalambosLeo Galambos
    Dec 3, 2002 at 7:32 pm
    Dec 9, 2002 at 8:16 am
  • Hello List I am using PDFBox to index some of the PDF documents. The parser works fine and I can read the summary. But the contents are displayed as java.io.InputStream. When I try the following: ...
    Suhas IndraSuhas Indra
    Dec 27, 2002 at 6:47 am
    Jan 2, 2003 at 10:44 am
  • I tested StandardAnalyzer (which uses StandardTokenizer) by inputing the a set of strings which produced the following results: "aa/bb/cc/dd" was tokenized into 4 terms: aa, bb, cc, dd "aa/bb/cc/d1" ...
    Terry SteichenTerry Steichen
    Dec 26, 2002 at 9:41 pm
    Dec 30, 2002 at 10:13 pm
  • Hi, We use Lucene to index and search HTML Documents. We extract all text content from the html documents and index it. While searching the documents, we found in several instances that search terms ...
    Mailing Lists AccountMailing Lists Account
    Dec 30, 2002 at 10:59 am
    Dec 30, 2002 at 7:11 pm
  • So, I have tried this with Lucene: 1) original JavaCC LL(k) HTML parser 2) SWING's HTML parser In case of (1) I could process about 300K of HTML documents. In case of (2) more than 400K. But I cannot ...
    Leo GalambosLeo Galambos
    Dec 12, 2002 at 7:13 pm
    Dec 12, 2002 at 9:13 pm
  • Hi, Would it be possible for Lucene to provide package informations? Basically all the java.lang.Package attributes... Things like implementation vendor, name, version and so on... This would make it ...
    Dec 20, 2002 at 8:04 pm
    Dec 20, 2002 at 8:49 pm
  • Hi, i´d like to index .doc files, but i don´t know how to get it. If anyone could tell me where could i find some information (URL) about it, i would be very gratefull. Thanks a lot MSN. Más Útil ...
    Diego Gutierrez AlonsoDiego Gutierrez Alonso
    Dec 17, 2002 at 3:00 pm
    Dec 18, 2002 at 7:46 am
  • Hi I started to deploy Lucene 1.3-dev1 from CVS very recently and noticed that the "score" is kind of different. In the case of Lucene1.2 I received scores such as for instance 3.45345234 * 10e-1 In ...
    Michael WechnerMichael Wechner
    Dec 16, 2002 at 9:34 pm
    Dec 16, 2002 at 10:01 pm
  • Hey guys, We have a question about how the QueryParser class optimizes searches, if it does at all. We have some searches that are taking an abnormal amount of time. Our search string specifies 3-4 ...
    Dec 10, 2002 at 3:59 pm
    Dec 10, 2002 at 4:32 pm
  • I'm keeping an IndexWriter open so new documents can be indexed as they arrive. I open a new IndexSearcher every time a user runs a search. It seems that search results don't include all documents ...
    Ashley CollinsAshley Collins
    Dec 10, 2002 at 10:12 am
    Dec 10, 2002 at 1:07 pm
  • Hi all Does the lucene will do stemming of a word? If yes can anyone say how to do it in java using lucene api. Thanks rgds srinivas
    M Srinivas RaoM Srinivas Rao
    Dec 10, 2002 at 12:20 pm
    Dec 10, 2002 at 12:58 pm
  • Hi all, I have a rather large file system that I'm indexing (php/html files actually). I'm reindexing on a daily basis, however I don't want/need to reindex 95+% of my files since they're not going ...
    Host unknownHost unknown
    Dec 9, 2002 at 9:10 pm
    Dec 9, 2002 at 10:09 pm
  • Has anyone out there sucessfully implemented the larm with lucene? I have been pouring over the larm source (since there's no external documentation) with little success getting it to behave properly ...
    Host unknownHost unknown
    Dec 9, 2002 at 4:42 pm
    Dec 9, 2002 at 5:14 pm
  • Is it possible to stop keyword fields contributing to a document's score? Leaving only text fields? Is the best way to boost the terms I know are keyword fields by small numbers? e.g. ...
    Ashley CollinsAshley Collins
    Dec 6, 2002 at 4:15 pm
    Dec 9, 2002 at 9:16 am
  • Currently, I use the following procedure to update an index incrementally: 1. Build document 2. Open index reader 3. Delete any previous version of the document using a key field 4. Close index ...
    Eric JainEric Jain
    Dec 4, 2002 at 12:04 pm
    Dec 9, 2002 at 9:04 am
  • I'm using Lucene to index MIME messages and have a couple of questions. 1) What is the best way to handle keyword fields which are repeated? Like "recipient" for example. At the moment I have a for ...
    Ashley CollinsAshley Collins
    Dec 6, 2002 at 10:12 am
    Dec 6, 2002 at 10:43 am
  • Is there any special reason why TokenMgrError extends java.lang.Error rather than java.lang.Exception? From the Java API docs: "An Error is a subclass of Throwable that indicates serious problems ...
    Eric JainEric Jain
    Dec 5, 2002 at 1:58 pm
    Dec 5, 2002 at 5:05 pm
  • Hello, If you want to update a set of documents, you can remove their previous version first and then add them after that. In the mean time documents of this set are temporaly not available. If you ...
    Materna, Wolf-Dietrich (empolis B)Materna, Wolf-Dietrich (empolis B)
    Dec 5, 2002 at 8:37 am
    Dec 5, 2002 at 12:17 pm
  • Quick question. I have installed and extracted the lucene-1.2.zip file on my XP system. I have gone through the getting started guide, and walked into the simple demo section. I have added the two ...
    Enright, ToddEnright, Todd
    Dec 3, 2002 at 6:13 pm
    Dec 3, 2002 at 6:37 pm
  • I pack the some source code I wrote before extend to lucene project. Hope it can be added into sandbox and get more communications with other also interesting following issues, including: 1 ...
    车 东车 东
    Dec 30, 2002 at 3:34 pm
    Jan 26, 2003 at 5:33 am
  • Is it possible to get a collection of documents based on whether they have a particular field (regardless of value)? I'm indexing HTML documents, and want to pull out some information that may or may ...
    Erik HatcherErik Hatcher
    Dec 30, 2002 at 4:58 pm
    Dec 30, 2002 at 6:09 pm
  • My firewall does not allow me to download via CVS program. Can someone please email me the latest LARM package? Thank you.
    TJ TeeTJ Tee
    Dec 27, 2002 at 2:00 am
    Dec 27, 2002 at 8:49 pm
  • Hello,I use Lucene with Tomcat and I can now index and search all html documents. But I would like to index other documents such us pdf or Word (.doc), I hope that sameone can help me ! Join Excite! ...
    Friaa NafaaFriaa Nafaa
    Dec 19, 2002 at 4:11 pm
    Dec 19, 2002 at 8:27 pm
  • Hello, Is it possible to search PDF, Excel, Word, RTF files in Lucene ? Would you please to give me a simple example? Best regards, Eric ========================== If you know what you are doing, it ...
    Eric ChowEric Chow
    Dec 19, 2002 at 2:21 am
    Dec 19, 2002 at 2:23 am
  • I'm having trouble with date ranges...It seems that if the lower bound in the range is left blank, the upper bound becomes the first value of the parsed query. Which means, that I get results after ...
    Ashley CollinsAshley Collins
    Dec 18, 2002 at 1:50 pm
    Dec 18, 2002 at 3:29 pm
  • Hi All..... I'm out of ideas on how to get the PhraseQuery to return any results. I'm guessing I might not be indexing properly when the document data is being stored. Is there any particular Field ...
    Host unknownHost unknown
    Dec 13, 2002 at 6:24 pm
    Dec 13, 2002 at 6:34 pm
  • Hi all I have use the lucene demo to index my own files by typing java org.apache.lucene.demo.IndexFiles c:\docs I have a simply editor and wish to intergrate this demo with the editor so that ...
    Dec 10, 2002 at 6:44 pm
    Dec 11, 2002 at 4:30 pm
  • Hello, All! When I index large document 110000 symbols and then try to search using words that are in the very end of that document - can't find anything... :(( Is this a feature of Lucene or I am ...
    Andrey GrishinAndrey Grishin
    Dec 11, 2002 at 3:27 pm
    Dec 11, 2002 at 3:49 pm
  • hi all I was running the demo java org.apache.lucene.demo.IndexFiles {full-path-to-lucene}/src and it says it will produce a subdirctory called "index:" but i can't find it . Do any one know where it ...
    Dec 9, 2002 at 10:05 pm
    Dec 10, 2002 at 5:14 am
  • For asian language, Chinese Korean Japanese, bigram based word segment is easy way to solve the word segment problem. Bigram based word segment is: C1C2C3C4 = C1C2 C2C3 C3C4 (C# is single CJK ...
    Che DongChe Dong
    Dec 31, 2002 at 6:59 am
    Dec 31, 2002 at 6:59 am
  • Hi Cutting: Today, I found some articles Cutting wrote on search engine when clean up my documents. These articles I downloaded from www.lucene.com almost 1 and half years ago. I think these article ...
    车 东车 东
    Dec 30, 2002 at 2:30 pm
    Dec 30, 2002 at 2:30 pm
  • All - I'm proud (and worried about the support e-mails! :) to announce the near-final release of a project demonstrating Ant, XDoclet, Struts, JUnit, Cactus, and Lucene. Its called JavaDevWithAnt as ...
    Erik HatcherErik Hatcher
    Dec 28, 2002 at 2:07 am
    Dec 28, 2002 at 2:07 am
  • Some reading for those long winter nights ;-) "You Are Here" Still lost? A cadre of new companies want to show you the way. http://www.newarchitectmag.com/documents/s=7766/na0103a/index.html Happy ...
    Dec 24, 2002 at 10:28 am
    Dec 24, 2002 at 10:28 am
  • Have anyone successfully make LARM crawler integrate with Lucene on Windows 2000 platform? Thank you.
    TJ TeeTJ Tee
    Dec 24, 2002 at 1:31 am
    Dec 24, 2002 at 1:31 am
  • Hi all, This may be very trivial question for the Gurus. But can I use Field.Text() for indexing the title of the document so that when user hits on any word for the searching then result would yield ...
    Dec 23, 2002 at 1:56 pm
    Dec 23, 2002 at 1:56 pm
  • I am having problems indexing our website using Lucene 1.2. To parse our HTML pages, I use the HTMLParser on lucene-demo-1.2.jar. At the end of my creation script, I receive the numerous "pipe ...
    Dec 23, 2002 at 12:19 pm
    Dec 23, 2002 at 12:19 pm
  • Not too long ago someone on the list mentioned a small program that you could run to check how your tokenizer and analyzer was treating some test queries. I believe they retrieved it from an earlier ...
    Terry SteichenTerry Steichen
    Dec 22, 2002 at 11:27 pm
    Dec 22, 2002 at 11:27 pm
  • Hey folks, They've put an apache wiki at nagoya. I took the liberty of writing a paragraph about Lucene, please feel free to completely revise it :-). ...
    Steven J. OwensSteven J. Owens
    Dec 22, 2002 at 3:30 am
    Dec 22, 2002 at 3:30 am
  • Hi all, I'm having a problem searching for phrases (example: "bucky badger"). I can search for the terms individually (using and or or searches (booleanquery)), but can't seem to do a phrasequery ...
    Host unknownHost unknown
    Dec 12, 2002 at 4:49 pm
    Dec 12, 2002 at 4:49 pm
  • Has anyone successfully implemented Mark Schreiber's TermHighlighter interface in the demo? I'm seeking some pointers/samples. Thanks in advance, Tim -- To unsubscribe, e-mail: For additional ...
    Stone, TimothyStone, Timothy
    Dec 11, 2002 at 8:59 pm
    Dec 11, 2002 at 8:59 pm
  • (note: running on Solaris 9, Java 1.4, TomCat 4.04, Cocoon 2.0.3, Lucene 1.2) I'm learning how to use Lucene within my Cocoon application, and have two questions...need pointers to RTFMs, examples, ...
    Harry FoxwellHarry Foxwell
    Dec 6, 2002 at 1:45 am
    Dec 6, 2002 at 1:45 am
Group Navigation
period‹ prev | Dec 2002 | next ›
Group Overview
groupjava-user @

58 users for December 2002

Doug Cutting: 13 posts Otis Gospodnetic: 13 posts Petite_abeille: 13 posts Erik Hatcher: 11 posts Armbrust, Daniel C.: 10 posts Stephane vaucher: 8 posts Ashley Collins: 7 posts Eric Isakson: 7 posts 车 东: 7 posts Eric Jain: 6 posts Host unknown: 6 posts Leo Galambos: 6 posts Peter Carlson: 6 posts Alex: 5 posts Terry Steichen: 5 posts Cohan, Sean: 4 posts Joshua O'Madadhain: 4 posts Minh Kama Yie: 4 posts Jonathan Reichhold: 3 posts Kelvin Tan: 3 posts
show more