FAQ

Search Discussions

69 discussions - 276 posts

  • Hi, Actually already i have added some thousan documents for indexing. Now i need to include one more file for indexing. So if i recreate again, then it will take more time. So how to include this ...
    Nellaiyappan GomathinayagamNellaiyappan Gomathinayagam
    Mar 4, 2003 at 4:31 pm
    Mar 10, 2003 at 9:58 am
  • Thx René that's very Helpfull !!! But I got an error in the code : String s = stemmer.stem(token.termText()); The stem method uses a boolean argument, and not a string... any Idea ? -----Original ...
    Pierre LacchiniPierre Lacchini
    Mar 21, 2003 at 9:26 am
    Mar 25, 2003 at 2:49 pm
  • Hi, we are currently evaluating lucene. The data we'd like to index consists of ~ 80 collections of documents (a few hundred up to 200000 documents per collection, ~ 1.5 million documents total; ...
    Morus WalterMorus Walter
    Mar 19, 2003 at 8:42 am
    Mar 21, 2003 at 1:14 pm
  • I'm interested in using the textmining/textextraction utilities using Apache POI, that Ryan was discussing. However, I'm having some difficulty determining what the insertion point would be to ...
    Eric AndersonEric Anderson
    Mar 5, 2003 at 11:26 am
    Mar 6, 2003 at 6:32 pm
  • I have a very simple problem: I need to get a list of the words that will result in a hit if searched on. Should be simple, but I'm not quite sure where to start. Thanks, Jon ...
    JcrowellJcrowell
    Mar 31, 2003 at 5:42 pm
    Apr 17, 2003 at 6:45 pm
  • Caveat: I have not yet installed Lucerne or begun to experiment with it yet. I have scanned the FAQ, but don't see anything that addresses this question. Pardon the somewhat slow buildup to the ...
    Gary H MerrillGary H Merrill
    Mar 27, 2003 at 8:33 pm
    Mar 28, 2003 at 12:34 pm
  • Hi all, I've a matter with indexing then searching docs written in non-latin languages and encoded in utf-8 (Russian, by example). I have a web application, with a simple form to search in the ...
    MERCIER ALEXANDREMERCIER ALEXANDRE
    Mar 18, 2003 at 4:36 pm
    Mar 19, 2003 at 3:53 am
  • Hello, I have tried downloading the LARM source in the lucene-sandbox but there appears to be nothing there? any suggestions [or simply emailing me the source] would be helpful. thanks. John
    John BresnikJohn Bresnik
    Mar 21, 2003 at 9:45 pm
    Mar 25, 2003 at 6:56 pm
  • I've successfully used Lucene to do indexing of about 50-100K files, and have been keeping the index on a local disk. It's time to move up, and now I'm planning to index from 100-500K files. I'm ...
    Avi DrissmanAvi Drissman
    Mar 19, 2003 at 4:44 pm
    Mar 20, 2003 at 12:29 am
  • Hello I have written an Analyzer for swedish. Compound words are common in swedish, therefore my Analyzer tries to split the compound words into its parts. For example the swedish word fotbollsmatch ...
    Magnus JohanssonMagnus Johansson
    Mar 11, 2003 at 10:05 am
    Mar 14, 2003 at 5:13 am
  • Hello, Would anyone be interested in ability to use Lucene search on the data from a database? I've written a small framework that allows to create Lucene index files out of the database data, and ...
    Tom SzymanskiTom Szymanski
    Mar 10, 2003 at 3:38 pm
    Mar 11, 2003 at 5:00 pm
  • What order does Lucene sort in? In my application the results returned are in ascending order which doesn't seem logical. --------------------------------------------------------------------- To ...
    Rick BakerRick Baker
    Mar 7, 2003 at 5:41 pm
    Mar 10, 2003 at 4:45 pm
  • I would like to announce the next release of PDFBox. PDFBox allows for PDF documents to be indexed using lucene through a simple interface. Please take a look at ...
    Ben LitchfieldBen Litchfield
    Mar 5, 2003 at 11:51 pm
    Mar 10, 2003 at 2:38 am
  • I've got a versioning content system where I want to replace documents in a lucene repository. To do so, according to the FAQ and the mailing list archives, I need to open an IndexReader, look for ...
    Joseph OttingerJoseph Ottinger
    Mar 5, 2003 at 5:06 pm
    Mar 5, 2003 at 6:18 pm
  • Hi all, There ist something I don't understand about the wildcard queries. I have values like 'REGENERATION GAS DISTRIBUTION' in the table. when I make a query like descr: Gas I recieve 31 hits. The ...
    Test2 SchwabTest2 Schwab
    Mar 27, 2003 at 1:35 pm
    Mar 30, 2003 at 2:03 am
  • I have not been able to install Lucene correctly (Apache Tomcat 4.1), the demo only works in the lucene directory executing some commands, but the web version is not working!!! I have been reading a ...
    Elsa HernandezElsa Hernandez
    Mar 7, 2003 at 11:21 pm
    Mar 9, 2003 at 6:59 am
  • Hi, --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: ...
    Rende Francesco, CSRende Francesco, CS
    Mar 28, 2003 at 2:02 pm
    Mar 28, 2003 at 9:32 pm
  • Heya all, I'm looking for a full French Analyser, containing a FrenchPorterStemmer... Does anyone know where i can find one ? And if I wanna create my own FrenchAnalyser - I have the STOP_WORDS list ...
    Pierre LacchiniPierre Lacchini
    Mar 19, 2003 at 2:40 pm
    Mar 19, 2003 at 9:20 pm
  • Well guys, here's my (silly) question : I got 2 Fields in my Index, for example Title and Author... If i want to perform a complex query like : search "Williams" in fields "Author" AND "Sword" in ...
    Pierre LacchiniPierre Lacchini
    Mar 17, 2003 at 9:24 am
    Mar 17, 2003 at 2:53 pm
  • I have seen some previous postings about "Escape woes" and "Hyphens not matching", but I haven't seen any resolutions to an issue I've been trying to work out. I don't want my search field to be case ...
    Sieretzki, Dionne R, SOLGVSieretzki, Dionne R, SOLGV
    Mar 13, 2003 at 3:16 pm
    Mar 13, 2003 at 4:44 pm
  • Dear lucene-user group: in the lucene site,there are: " Now you're ready to roll. In your browser set the url to "http://localhost:8080/luceneweb" enter "test" and the number of items per page and ...
    Tian LUOTian LUO
    Mar 11, 2003 at 9:38 pm
    Mar 12, 2003 at 12:08 am
  • Hi, Can somebody help us to figure out how to build queries (or tune Lucene) to return the result in a specific order? When search against multiple fields, It seems like Lucene will give partial ...
    Ching-Pei HsingChing-Pei Hsing
    Mar 11, 2003 at 3:06 am
    Mar 11, 2003 at 7:09 pm
  • I have a project for which I want to characterize Lucene query performance on different size archives of my XML files. I have created archives and indices of 1000, 2000, 4000, 8000, and 16000 XML ...
    Harry FoxwellHarry Foxwell
    Mar 2, 2003 at 2:40 am
    Mar 3, 2003 at 6:27 pm
  • Hi, is it intentional that '?' matches exactly one character within wildcard terms but one or zero characters at the end of wildcard terms? That is: r?? matches r ra rab ... whereas r?b matches rab ...
    Morus WalterMorus Walter
    Mar 25, 2003 at 11:00 am
    Apr 16, 2003 at 6:15 am
  • Probably tokenized 1234 as a string and treated '-' as a separator. See previous discussion on "query". Regards, Terry ----- Original Message ----- From: "Lixin Meng" <lixin@fulldegree.com To: ...
    Terry SteichenTerry Steichen
    Mar 26, 2003 at 3:42 am
    Mar 28, 2003 at 4:36 pm
  • Hi, It´s possible index a document with a field repeated several times?. For example, I´ve a photograph and I need to index the published dates. <PublishDate=20030303 <PublishDate=20030305 ...
    Jose GalianaJose Galiana
    Mar 27, 2003 at 6:47 pm
    Mar 27, 2003 at 8:30 pm
  • Hi everyone, I have indexed a table in the database. the table has a column named TagNr. It contains values like 25-XX8569, 41-VL451 ect.... By indexing the table I use the factory method ...
    Test2 SchwabTest2 Schwab
    Mar 25, 2003 at 10:41 am
    Mar 25, 2003 at 4:46 pm
  • Are there any parser for the following format - doc - xls - ppt - pdf Thanks for help Daniel
    Daniel HunzikerDaniel Hunziker
    Mar 21, 2003 at 3:48 am
    Mar 21, 2003 at 12:02 pm
  • Your Syntax seems to be wrong; try Author:Williams AND Title:Sword - Title:House or Author:Williams AND Title:Sword NOT Title:House Michael -----Ursprüngliche Nachricht----- Von: Pierre Lacchini ...
    Borkenhagen, Michael (ofd-ko zdfin)Borkenhagen, Michael (ofd-ko zdfin)
    Mar 17, 2003 at 11:00 am
    Mar 17, 2003 at 11:29 am
  • are we sure of this?? i was under the impression that Lucene does "first-found-first-returned", and as a result I ended up writing a sorting method on the results? so can i actually do away with ...
    Rishabh BajpaiRishabh Bajpai
    Mar 8, 2003 at 11:31 am
    Mar 8, 2003 at 10:16 pm
  • Hi there. Consider the following examples that I do while searching: fun - 19 results fun () - 0 results fun "" - ParseException Help! I really don't want to get ParseException thrown. (I am using ...
    Stray ToasterStray Toaster
    Mar 7, 2003 at 12:52 pm
    Mar 7, 2003 at 4:22 pm
  • One of my clients is asking for an old-style boolean query search on my keywords fields. A string might look like this: "oracle admin*" and java and oracle and ("8.1.6" or "8.1.7") and ("solaris" or ...
    Shah, VineelShah, Vineel
    Mar 28, 2003 at 10:49 pm
    Mar 29, 2003 at 12:28 am
  • Hi all, I have a question. I have 2 indexes (1 - continually growing, never deleted archive index. 2 - an index that is wiped and recreated daily. These are completely disjoint sets of data) I ...
    Host unknownHost unknown
    Mar 27, 2003 at 2:15 pm
    Mar 27, 2003 at 6:44 pm
  • Hi, In an index I have documents with a field that has been constructed using Field.UnIndexed(). Now I want to switch to Field.Keyword() so I can search for those fields, too. Does it cause any harm ...
    Maik SchreiberMaik Schreiber
    Mar 27, 2003 at 5:19 pm
    Mar 27, 2003 at 5:26 pm
  • Can some one please help me with the command to get O/P from PDFBox on command line or into streams rather that dumping it into a text file. thanks, vikas. ...
    Ramrakhiani, VikasRamrakhiani, Vikas
    Mar 25, 2003 at 2:17 pm
    Mar 25, 2003 at 2:23 pm
  • Hi, 1- If a stop word is the first term of AND operator, ArrayIndexOutOfBounsException is raised. The word "use" being in my stopword list, the query below fails : QueryParser parser = new ...
    René FerréroRené Ferréro
    Mar 23, 2003 at 9:03 pm
    Mar 23, 2003 at 10:40 pm
  • Heya, as u can see, I want to create my own french Analyzer, using the snowball's FrenchStemmer... But i don't really know how to proceed... Does anyone know where I can find a tutorial, or a clear ...
    Pierre LacchiniPierre Lacchini
    Mar 21, 2003 at 10:33 am
    Mar 21, 2003 at 3:15 pm
  • Howdy All, I am interested in several things to improve the speed of my indexing. First would be to find out if it's possible (as well as how) to merge lucene indexes of similarly structured (same ...
    Vince TaluskieVince Taluskie
    Mar 21, 2003 at 3:40 am
    Mar 21, 2003 at 4:28 am
  • Any quick easy way to index static files (html/pdf/doc/<point to an http URL/...) and provide web search interface like: google http://www.htdig.org/ ???? ...
    Hanasaki JiJiHanasaki JiJi
    Mar 19, 2003 at 8:16 pm
    Mar 20, 2003 at 12:30 am
  • Robert, I'm moving this to lucene-user, which is a more appropriate list for this type of a problem. You are not saying whether you are using some of those handy -X (-Xms -Xmx) command line switches ...
    Otis GospodneticOtis Gospodnetic
    Mar 19, 2003 at 3:19 pm
    Mar 19, 2003 at 6:11 pm
  • Recently someone posted a link to Oracle in this list. They maintain stop word list for different languages. Marcel --------------------------------------------------------------------- To ...
    Marcel StörMarcel Stör
    Mar 18, 2003 at 3:59 pm
    Mar 18, 2003 at 10:27 pm
  • HI! When running lucene i get this error with certain searches, does anyone know what might be the cause of this? java.io.IOException: Bad file descriptor at java.io.RandomAccessFile.seek(Native ...
    Eoghan SEoghan S
    Mar 16, 2003 at 3:47 pm
    Mar 17, 2003 at 2:54 am
  • Hi, I am getting a long value between 1(included) and 0(excluded-I think), and it makes sense to me logically as well - I wouldnt know what a value of greater than 1 would mean, and why should a term ...
    Rishabh BajpaiRishabh Bajpai
    Mar 14, 2003 at 4:44 am
    Mar 14, 2003 at 11:16 pm
  • Hello all, I have an exception in Lucene v1.2 final where I try to use PorterStemmer compiled using JIKES: This seems like a serious bug in JIKES! Anyone already reported Jikes comminity? Shall I do ...
    Lukas ZapletalLukas Zapletal
    Mar 12, 2003 at 2:14 pm
    Mar 12, 2003 at 4:54 pm
  • hi! i am currently in my final year of a software engineering degree, for my project i have built a distributed search engine and file sharing system using Sun's JXTA technology and lucene. i am ...
    Eoghan SEoghan S
    Mar 10, 2003 at 7:12 pm
    Mar 10, 2003 at 7:18 pm
  • Hi Serge Knystautas, Exactly i need the same functionality. Thanks for the information. And if you don't mind, can u please send me the sample code of implemeting the stuff. Thanks a ton Nellai.... ...
    Nellaiyappan GomathinayagamNellaiyappan Gomathinayagam
    Mar 5, 2003 at 1:23 pm
    Mar 8, 2003 at 12:42 am
  • I personally believe that we should take the conceptual (design) point of view as the exact method signatures will be looked up in javadoc anyway once the decision to subclass has been made. You bet! ...
    Marcel StorMarcel Stor
    Mar 7, 2003 at 10:15 am
    Mar 7, 2003 at 8:06 pm
  • Hello, that is what I know about indexing international documents: 1. I have a language ID 2. with this ID I choose an special Analzer for that language 3. I can use one index for all languages But ...
    Günter KukiesGünter Kukies
    Mar 6, 2003 at 7:18 am
    Mar 6, 2003 at 2:58 pm
  • Hi, We are incorporating Lucene in a CMS. It does some quite fancy matching and searching of documents and uses Lucene as one of its components. We would like to influence the scoring of search terms ...
    Marc WorrellMarc Worrell
    Mar 4, 2003 at 5:03 pm
    Mar 5, 2003 at 5:01 am
  • Amit, When you emailed me privately I suggested using lucene-user list, not lucene-dev. I'm moving this thread to lucene-user. My guess is that your problem has nothing to do with index size (40 MB ...
    Otis GospodneticOtis Gospodnetic
    Mar 31, 2003 at 3:57 pm
    Mar 31, 2003 at 3:57 pm
Group Navigation
period‹ prev | Mar 2003 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions69
posts276
users93
websitelucene.apache.org

93 users for March 2003

Otis Gospodnetic: 32 posts Samuel Alfonso "Velázquez" "Díaz": 15 posts Eric Anderson: 11 posts Tatu Saloranta: 11 posts Leo Galambos: 9 posts Morus Walter: 8 posts Andrzej Bialecki: 7 posts Doug Cutting: 7 posts Pierre Lacchini: 7 posts René Ferréro: 7 posts Borkenhagen, Michael (ofd-ko zdfin): 6 posts David Spencer: 6 posts John Bresnik: 6 posts Pinky Iyer: 6 posts Marcel Stör: 5 posts Terry Steichen: 5 posts Ben Litchfield: 4 posts Elsa Hernandez: 4 posts Joseph Ottinger: 4 posts Rishabh Bajpai: 4 posts
show more
Archives