Search Discussions
-
Hi, Actually already i have added some thousan documents for indexing. Now i need to include one more file for indexing. So if i recreate again, then it will take more time. So how to include this ...
Nellaiyappan Gomathinayagam
Mar 4, 2003 at 4:31 pm
Mar 10, 2003 at 9:58 am -
Thx René that's very Helpfull !!! But I got an error in the code : String s = stemmer.stem(token.termText()); The stem method uses a boolean argument, and not a string... any Idea ? -----Original ...
Pierre Lacchini
Mar 21, 2003 at 9:26 am
Mar 25, 2003 at 2:49 pm -
Hi, we are currently evaluating lucene. The data we'd like to index consists of ~ 80 collections of documents (a few hundred up to 200000 documents per collection, ~ 1.5 million documents total; ...
Morus Walter
Mar 19, 2003 at 8:42 am
Mar 21, 2003 at 1:14 pm -
I'm interested in using the textmining/textextraction utilities using Apache POI, that Ryan was discussing. However, I'm having some difficulty determining what the insertion point would be to ...
Eric Anderson
Mar 5, 2003 at 11:26 am
Mar 6, 2003 at 6:32 pm -
I have a very simple problem: I need to get a list of the words that will result in a hit if searched on. Should be simple, but I'm not quite sure where to start. Thanks, Jon ...
Jcrowell
Mar 31, 2003 at 5:42 pm
Apr 17, 2003 at 6:45 pm -
Caveat: I have not yet installed Lucerne or begun to experiment with it yet. I have scanned the FAQ, but don't see anything that addresses this question. Pardon the somewhat slow buildup to the ...
Gary H Merrill
Mar 27, 2003 at 8:33 pm
Mar 28, 2003 at 12:34 pm -
Hi all, I've a matter with indexing then searching docs written in non-latin languages and encoded in utf-8 (Russian, by example). I have a web application, with a simple form to search in the ...
MERCIER ALEXANDRE
Mar 18, 2003 at 4:36 pm
Mar 19, 2003 at 3:53 am -
Hello, I have tried downloading the LARM source in the lucene-sandbox but there appears to be nothing there? any suggestions [or simply emailing me the source] would be helpful. thanks. John
John Bresnik
Mar 21, 2003 at 9:45 pm
Mar 25, 2003 at 6:56 pm -
I've successfully used Lucene to do indexing of about 50-100K files, and have been keeping the index on a local disk. It's time to move up, and now I'm planning to index from 100-500K files. I'm ...
Avi Drissman
Mar 19, 2003 at 4:44 pm
Mar 20, 2003 at 12:29 am -
Hello I have written an Analyzer for swedish. Compound words are common in swedish, therefore my Analyzer tries to split the compound words into its parts. For example the swedish word fotbollsmatch ...
Magnus Johansson
Mar 11, 2003 at 10:05 am
Mar 14, 2003 at 5:13 am -
Hello, Would anyone be interested in ability to use Lucene search on the data from a database? I've written a small framework that allows to create Lucene index files out of the database data, and ...
Tom Szymanski
Mar 10, 2003 at 3:38 pm
Mar 11, 2003 at 5:00 pm -
What order does Lucene sort in? In my application the results returned are in ascending order which doesn't seem logical. --------------------------------------------------------------------- To ...
Rick Baker
Mar 7, 2003 at 5:41 pm
Mar 10, 2003 at 4:45 pm -
I would like to announce the next release of PDFBox. PDFBox allows for PDF documents to be indexed using lucene through a simple interface. Please take a look at ...
Ben Litchfield
Mar 5, 2003 at 11:51 pm
Mar 10, 2003 at 2:38 am -
I've got a versioning content system where I want to replace documents in a lucene repository. To do so, according to the FAQ and the mailing list archives, I need to open an IndexReader, look for ...
Joseph Ottinger
Mar 5, 2003 at 5:06 pm
Mar 5, 2003 at 6:18 pm -
Hi all, There ist something I don't understand about the wildcard queries. I have values like 'REGENERATION GAS DISTRIBUTION' in the table. when I make a query like descr: Gas I recieve 31 hits. The ...
Test2 Schwab
Mar 27, 2003 at 1:35 pm
Mar 30, 2003 at 2:03 am -
I have not been able to install Lucene correctly (Apache Tomcat 4.1), the demo only works in the lucene directory executing some commands, but the web version is not working!!! I have been reading a ...
Elsa Hernandez
Mar 7, 2003 at 11:21 pm
Mar 9, 2003 at 6:59 am -
Hi, --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: ...
Rende Francesco, CS
Mar 28, 2003 at 2:02 pm
Mar 28, 2003 at 9:32 pm -
Heya all, I'm looking for a full French Analyser, containing a FrenchPorterStemmer... Does anyone know where i can find one ? And if I wanna create my own FrenchAnalyser - I have the STOP_WORDS list ...
Pierre Lacchini
Mar 19, 2003 at 2:40 pm
Mar 19, 2003 at 9:20 pm -
Well guys, here's my (silly) question : I got 2 Fields in my Index, for example Title and Author... If i want to perform a complex query like : search "Williams" in fields "Author" AND "Sword" in ...
Pierre Lacchini
Mar 17, 2003 at 9:24 am
Mar 17, 2003 at 2:53 pm -
I have seen some previous postings about "Escape woes" and "Hyphens not matching", but I haven't seen any resolutions to an issue I've been trying to work out. I don't want my search field to be case ...
Sieretzki, Dionne R, SOLGV
Mar 13, 2003 at 3:16 pm
Mar 13, 2003 at 4:44 pm -
Dear lucene-user group: in the lucene site,there are: " Now you're ready to roll. In your browser set the url to "http://localhost:8080/luceneweb" enter "test" and the number of items per page and ...
Tian LUO
Mar 11, 2003 at 9:38 pm
Mar 12, 2003 at 12:08 am -
Hi, Can somebody help us to figure out how to build queries (or tune Lucene) to return the result in a specific order? When search against multiple fields, It seems like Lucene will give partial ...
Ching-Pei Hsing
Mar 11, 2003 at 3:06 am
Mar 11, 2003 at 7:09 pm -
I have a project for which I want to characterize Lucene query performance on different size archives of my XML files. I have created archives and indices of 1000, 2000, 4000, 8000, and 16000 XML ...
Harry Foxwell
Mar 2, 2003 at 2:40 am
Mar 3, 2003 at 6:27 pm -
Hi, is it intentional that '?' matches exactly one character within wildcard terms but one or zero characters at the end of wildcard terms? That is: r?? matches r ra rab ... whereas r?b matches rab ...
Morus Walter
Mar 25, 2003 at 11:00 am
Apr 16, 2003 at 6:15 am -
Probably tokenized 1234 as a string and treated '-' as a separator. See previous discussion on "query". Regards, Terry ----- Original Message ----- From: "Lixin Meng" <lixin@fulldegree.com To: ...
Terry Steichen
Mar 26, 2003 at 3:42 am
Mar 28, 2003 at 4:36 pm -
Hi, It´s possible index a document with a field repeated several times?. For example, I´ve a photograph and I need to index the published dates. <PublishDate=20030303 <PublishDate=20030305 ...
Jose Galiana
Mar 27, 2003 at 6:47 pm
Mar 27, 2003 at 8:30 pm -
2
query
Hi everyone, I have indexed a table in the database. the table has a column named TagNr. It contains values like 25-XX8569, 41-VL451 ect.... By indexing the table I use the factory method ...Test2 Schwab
Mar 25, 2003 at 10:41 am
Mar 25, 2003 at 4:46 pm -
2
parser
Are there any parser for the following format - doc - xls - ppt - pdf Thanks for help DanielDaniel Hunziker
Mar 21, 2003 at 3:48 am
Mar 21, 2003 at 12:02 pm -
Your Syntax seems to be wrong; try Author:Williams AND Title:Sword - Title:House or Author:Williams AND Title:Sword NOT Title:House Michael -----Ursprüngliche Nachricht----- Von: Pierre Lacchini ...
Borkenhagen, Michael (ofd-ko zdfin)
Mar 17, 2003 at 11:00 am
Mar 17, 2003 at 11:29 am -
are we sure of this?? i was under the impression that Lucene does "first-found-first-returned", and as a result I ended up writing a sorting method on the results? so can i actually do away with ...
Rishabh Bajpai
Mar 8, 2003 at 11:31 am
Mar 8, 2003 at 10:16 pm -
Hi there. Consider the following examples that I do while searching: fun - 19 results fun () - 0 results fun "" - ParseException Help! I really don't want to get ParseException thrown. (I am using ...
Stray Toaster
Mar 7, 2003 at 12:52 pm
Mar 7, 2003 at 4:22 pm -
One of my clients is asking for an old-style boolean query search on my keywords fields. A string might look like this: "oracle admin*" and java and oracle and ("8.1.6" or "8.1.7") and ("solaris" or ...
Shah, Vineel
Mar 28, 2003 at 10:49 pm
Mar 29, 2003 at 12:28 am -
Hi all, I have a question. I have 2 indexes (1 - continually growing, never deleted archive index. 2 - an index that is wiped and recreated daily. These are completely disjoint sets of data) I ...
Host unknown
Mar 27, 2003 at 2:15 pm
Mar 27, 2003 at 6:44 pm -
Hi, In an index I have documents with a field that has been constructed using Field.UnIndexed(). Now I want to switch to Field.Keyword() so I can search for those fields, too. Does it cause any harm ...
Maik Schreiber
Mar 27, 2003 at 5:19 pm
Mar 27, 2003 at 5:26 pm -
Can some one please help me with the command to get O/P from PDFBox on command line or into streams rather that dumping it into a text file. thanks, vikas. ...
Ramrakhiani, Vikas
Mar 25, 2003 at 2:17 pm
Mar 25, 2003 at 2:23 pm -
Hi, 1- If a stop word is the first term of AND operator, ArrayIndexOutOfBounsException is raised. The word "use" being in my stopword list, the query below fails : QueryParser parser = new ...
René Ferréro
Mar 23, 2003 at 9:03 pm
Mar 23, 2003 at 10:40 pm -
Heya, as u can see, I want to create my own french Analyzer, using the snowball's FrenchStemmer... But i don't really know how to proceed... Does anyone know where I can find a tutorial, or a clear ...
Pierre Lacchini
Mar 21, 2003 at 10:33 am
Mar 21, 2003 at 3:15 pm -
Howdy All, I am interested in several things to improve the speed of my indexing. First would be to find out if it's possible (as well as how) to merge lucene indexes of similarly structured (same ...
Vince Taluskie
Mar 21, 2003 at 3:40 am
Mar 21, 2003 at 4:28 am -
Any quick easy way to index static files (html/pdf/doc/<point to an http URL/...) and provide web search interface like: google http://www.htdig.org/ ???? ...
Hanasaki JiJi
Mar 19, 2003 at 8:16 pm
Mar 20, 2003 at 12:30 am -
Robert, I'm moving this to lucene-user, which is a more appropriate list for this type of a problem. You are not saying whether you are using some of those handy -X (-Xms -Xmx) command line switches ...
Otis Gospodnetic
Mar 19, 2003 at 3:19 pm
Mar 19, 2003 at 6:11 pm -
Recently someone posted a link to Oracle in this list. They maintain stop word list for different languages. Marcel --------------------------------------------------------------------- To ...
Marcel Stör
Mar 18, 2003 at 3:59 pm
Mar 18, 2003 at 10:27 pm -
HI! When running lucene i get this error with certain searches, does anyone know what might be the cause of this? java.io.IOException: Bad file descriptor at java.io.RandomAccessFile.seek(Native ...
Eoghan S
Mar 16, 2003 at 3:47 pm
Mar 17, 2003 at 2:54 am -
Hi, I am getting a long value between 1(included) and 0(excluded-I think), and it makes sense to me logically as well - I wouldnt know what a value of greater than 1 would mean, and why should a term ...
Rishabh Bajpai
Mar 14, 2003 at 4:44 am
Mar 14, 2003 at 11:16 pm -
Hello all, I have an exception in Lucene v1.2 final where I try to use PorterStemmer compiled using JIKES: This seems like a serious bug in JIKES! Anyone already reported Jikes comminity? Shall I do ...
Lukas Zapletal
Mar 12, 2003 at 2:14 pm
Mar 12, 2003 at 4:54 pm -
hi! i am currently in my final year of a software engineering degree, for my project i have built a distributed search engine and file sharing system using Sun's JXTA technology and lucene. i am ...
Eoghan S
Mar 10, 2003 at 7:12 pm
Mar 10, 2003 at 7:18 pm -
Hi Serge Knystautas, Exactly i need the same functionality. Thanks for the information. And if you don't mind, can u please send me the sample code of implemeting the stuff. Thanks a ton Nellai.... ...
Nellaiyappan Gomathinayagam
Mar 5, 2003 at 1:23 pm
Mar 8, 2003 at 12:42 am -
I personally believe that we should take the conceptual (design) point of view as the exact method signatures will be looked up in javadoc anyway once the decision to subclass has been made. You bet! ...
Marcel Stor
Mar 7, 2003 at 10:15 am
Mar 7, 2003 at 8:06 pm -
Hello, that is what I know about indexing international documents: 1. I have a language ID 2. with this ID I choose an special Analzer for that language 3. I can use one index for all languages But ...
Günter Kukies
Mar 6, 2003 at 7:18 am
Mar 6, 2003 at 2:58 pm -
Hi, We are incorporating Lucene in a CMS. It does some quite fancy matching and searching of documents and uses Lucene as one of its components. We would like to influence the scoring of search terms ...
Marc Worrell
Mar 4, 2003 at 5:03 pm
Mar 5, 2003 at 5:01 am -
Amit, When you emailed me privately I suggested using lucene-user list, not lucene-dev. I'm moving this thread to lucene-user. My guess is that your problem has nothing to do with index size (40 MB ...
Otis Gospodnetic
Mar 31, 2003 at 3:57 pm
Mar 31, 2003 at 3:57 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 69 |
posts | 276 |
users | 93 |
website | lucene.apache.org |
93 users for March 2003
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)