Search Discussions
-
Hi friends, How do you think use the lucene for searching in the Semantic Web? I am trying using the lucene for searching documents with ontological annotation. But i do not get a better model to ...
Jason
Jan 17, 2006 at 2:35 pm
Jan 20, 2006 at 10:01 am -
Hi I want to find out the intersection of two hits objects please help me Hits hits1 = Searcher.searh(strQuery1); Hits hits2 = Searcher.searh(strQuery2); Now In hits1 contains records numbers 3,4 ...
Ravi
Jan 18, 2006 at 1:18 pm
Jan 19, 2006 at 11:04 am -
Hello, Is it possible to retrieve the top 'n' most often appearing words within a search criteria? I've seen the High Frequency Terms code in the sandbox but it works across the whole index. To put ...
Chris Brown
Jan 9, 2006 at 4:28 pm
Jan 11, 2006 at 6:30 pm -
I would like to do queries that are negative. I mean a query with only negative terms and phrases. For example, retrieve all documents that do not contain the term "apple". For now, I have a limited ...
Beady Geraghty
Jan 6, 2006 at 5:05 pm
Jan 7, 2006 at 5:46 pm -
I'm throwing myself at the mercy of the lucene community, I'm a bit brain dead today after looking after a screaming 3 month old baby for 4 hours last night... We have a 'title' field indexed as ...
Paul Smith
Jan 5, 2006 at 10:09 pm
Jan 6, 2006 at 12:11 pm -
I'm having some trouble coming up with a good search strategy for geographical data. e.g., given: [1] city: London, United Kingdom [2] city: London, Ontario, Canada [3] city: Ontario, California, ...
Colin Young
Jan 27, 2006 at 4:09 pm
Feb 11, 2006 at 9:47 am -
hey, i have a bit of a complex problem, i need to group results recieved in a result set, for example: my result set returns 10,000 results there are about 10 fields in each result document i need to ...
Zzzzz shalev
Jan 29, 2006 at 12:56 pm
Jan 31, 2006 at 9:26 am -
I'm having a problem with keyword fields and how they're treated by QueryParser. At indexing time, I index my documents, as follows: Content - tokenized, indexed field (the default field) DocType - ...
Dmitry Goldenberg
Jan 24, 2006 at 11:47 pm
Feb 17, 2006 at 9:55 am -
9
BTree
Does Lucene use a BTree kind of structure for storing the index (atleast in the memory) .? or is it just a list. Based on the file format in the index directory ( where in the terms are are ...Shailesh kumar
Jan 11, 2006 at 12:23 pm
Jan 13, 2006 at 12:03 am -
I'm using the .net port of highlighter (1.5) and I notice it doesn't highlight range or prefix queries.. Is this consistent with the java version? Only I note my standard reference of ...
Gwyn Carwardine
Jan 24, 2006 at 9:32 pm
Jan 26, 2006 at 1:53 pm -
Is there an efficient way to determine if two or more terms frequently appear next to each other sequence? For a query like: a b c one or more of the following suggestions could be generated: "a b c" ...
Eric Jain
Jan 10, 2006 at 11:10 pm
Jan 12, 2006 at 8:11 pm -
Hi, Does anyone know if it is possible to show related searches with lucene, for example if someone searched for "car insurance" you could bring back the results and related searches like these ...
Leon Chaddock
Jan 30, 2006 at 12:05 pm
Feb 1, 2006 at 11:05 am -
7
encoding
Hello, I 've a problem with data i try to index with lucene. I browse a directory and index text from different types of files throw parsers. For text files, data could be in different languages so ...Arnaudbuffet
Jan 26, 2006 at 8:56 am
Jan 28, 2006 at 4:20 pm -
I am attempting to prune an index by getting each document in turn and then checking/deleting it: IndexReader ir = IndexReader.open(path); for(int i=0;i<ir.numDocs();i++) { Document doc = ...
Chun Wei Ho
Jan 26, 2006 at 8:15 am
Jan 27, 2006 at 8:24 am -
I am periodically getting "Too many open files" error when searching. Currently there are over 500 files in my Lucene directory. I am attempting to run optimize( ) to reduce the number of files. ...
Steve Rajavuori
Jan 23, 2006 at 5:18 pm
Jan 23, 2006 at 7:16 pm -
the similarity measure is term based, tf*idf weighted in ist simple form. Further enhancement would be a "weighting" of nodes e.g. based on information content (see e.g. Rodriguez, M.A. & Egenhofer, ...
Mathias Lux
Jan 19, 2006 at 12:36 pm
Jan 19, 2006 at 5:15 pm -
Folks, I'm a Lucene newbie, and I've been searching awhile today to answer this question. Googled, read Lucene FAQ, looked at Javadoc for Document and Hits, etc. How would you implement "snippets" ...
Dan Frankowski
Jan 4, 2006 at 11:04 pm
Jan 8, 2006 at 5:38 pm -
Hello, I have a couple instances of lucene. I just altered on implementation and now its not keeping a segments file. while indexing occurs, there is a segment file. but once its done, there isn't. ...
John Powers
Jan 27, 2006 at 3:10 am
Jan 30, 2006 at 6:44 am -
Hi, Apologies if this question has being asked before on this list. I am working on an application with a Lucene index whose performance (response time for a query) has started degrading as its size ...
Ori Schnaps
Jan 24, 2006 at 12:25 am
Jan 25, 2006 at 7:31 pm -
Two queries about ranges: 1. field:[a TO z] does not return the same as field:[z TO a] I think it should. The standard QueryParser or even the range query should ascertain the lowest and highest and ...
Gwyn Carwardine
Jan 23, 2006 at 3:43 pm
Jan 25, 2006 at 12:00 pm -
Hi Friends, I have very basic question that 1] Why we use Lucene for Database search like Oracle / Sybase ? 2] For that first we have to convert all records one bye one in string then build lucene ...
Vikas Khengare
Jan 17, 2006 at 10:52 am
Jan 20, 2006 at 5:43 am -
Hi, I am currently looking for a way to limit the amount of Hits which are returned by a Query. What I am doing is following: Searcher s = ...; Query q = QueryParser.parse("...", "...", new ...
Daniel Pfeifer
Jan 19, 2006 at 1:00 pm
Jan 19, 2006 at 8:34 pm -
Hi, I got a problem of using the lucene. I write a SynonymFilter which can add synonyms from the WordNet. Meanwhile, i used the SnowballFilter for term stemming. However, i got a problem when ...
Jason
Jan 16, 2006 at 8:54 am
Jan 17, 2006 at 2:38 pm -
Hi everybody, Well I will explain you my problem: I am indexing ".txt" files and basically I split each file in paragraphs, I mean, i create a Document for each file and within this Document I add ...
Jairo Sánchez Menéndez
Jan 29, 2006 at 6:18 pm
Jan 30, 2006 at 2:53 pm -
Hi, I want to know how the lucene normalizes the score. I see hits class has this function to get each document's score. But i dont know how lucene calculates the normalized score and in the "Lucene ...
Xing jiang
Jan 27, 2006 at 7:16 am
Jan 28, 2006 at 6:04 am -
Hi, I am trying to find a way to create scores with a custom formula based on the initial score from Lucene and field values from each document, e.g. for each document: finalScore = searchScore * ...
Nick Vincent
Jan 24, 2006 at 5:42 pm
Jan 24, 2006 at 11:14 pm -
I would like to store large source documents ( 10MB) in the index in their original form, i.e. as text for text documents or as byte[] for binary documents. I have no difficulty adding the source ...
George Washington
Jan 20, 2006 at 4:52 am
Jan 22, 2006 at 6:42 am -
Hello dear Lucene users! Is their an easy way to check, whether index is optimized or not? Best regards, Max --------------------------------------------------------------------- To unsubscribe, ...
Maxim Patramanskij
Jan 11, 2006 at 1:25 pm
Jan 12, 2006 at 10:17 am -
Lucene seems to prefer matches in shorter documents. Is it possible to influence the scoring mechanism to have matches in shorter fields score higher instead? For example, a query for "europe" should ...
Eric Jain
Jan 9, 2006 at 9:35 am
Jan 10, 2006 at 11:06 pm -
Question: Run attached java class and see the also attached when I ran it. I assumed the second writer should get an IOException in getWriter (in first TESTCASE). However this does not happen! This ...
Dick de Jong
Jan 8, 2006 at 4:54 am
Jan 10, 2006 at 10:05 am -
Hi, I want to update a document in the lucene index. As mentioned in the documentation, I tried to delete the document using IndexReader.delete method. But even after I delete the document, I am able ...
Harini Raghavan
Jan 8, 2006 at 4:11 pm
Jan 9, 2006 at 3:22 pm -
Hello, I found in Lucenebook, that searching always reads first first 100 documents. I take a look to java source code and found that in Hits.java. I cannot find any easy way, how to forbid this ...
Leos Literak
Jan 7, 2006 at 3:11 pm
Jan 9, 2006 at 8:36 am -
HI all. i am newbie to lucene Do lucene provides any way to do indexing ,searching and deleting simultaneously .. I hope we could do searching and indexing which means there can be multiple ...
K.A.Hussain Ali
Jan 5, 2006 at 3:00 pm
Jan 5, 2006 at 6:36 pm -
Hi, We use lucene without any problems even for German text bit with Chinese text nothing is found. What is the best way to index and search Chinese text? Zsolt ...
Zsolt
Jan 28, 2006 at 8:27 am
Jan 31, 2006 at 11:54 am -
4
SoundEx
Hi, Does anyone know if there is a SoundEx analyzer implementation available for lucene??? thanks. Steve, --------------------------------------------------------------------- To unsubscribe, e-mail: ...Steven Pannell
Jan 18, 2006 at 8:27 am
Jan 26, 2006 at 2:02 pm -
I have ~5 million documents that are in categories and subcategories. Let us say that my query is for search terms in one top-level category and it returns a large amount of documents and I want to ...
Mike Austin
Jan 25, 2006 at 6:58 pm
Jan 26, 2006 at 8:28 am -
Hi A database is used as our primary data store. Our lucene index is then created and updated from this database. We store the value of the database primary key in the lucene index as we need to be ...
Allan Dewar
Jan 23, 2006 at 3:48 pm
Jan 24, 2006 at 11:02 am -
Hi, there, In "Lucene in action", it mentions in Section 3.2.3 "reading indexes into memory" that, "...RAMDirectory's constructor can be used to read a file system-based index into memory, allowing ...
Kan Deng
Jan 12, 2006 at 1:37 am
Jan 12, 2006 at 7:34 pm -
So far I've succeeded in grabbing the latest source (and I can build with ant), but now I'm trying to open it in eclipse by creating a new "Java Project from Existing And Buildfile," but when I ...
Colin Young
Jan 5, 2006 at 3:08 am
Jan 5, 2006 at 8:34 pm -
hello all - i have a problem with a SpanNearQuery returning incorrect (false positive) results. I am creating the context of a field using tokens which have position increment set to either 1 or 0. ...
Marc Hadfield
Jan 5, 2006 at 2:39 am
Jan 5, 2006 at 6:46 pm -
Hi, I would like to associate information (or labels) with each word or a range of words in a document. Information such as this word is a noun, that word is a verb, this period marks the end of a ...
Dave Kor
Jan 4, 2006 at 6:34 am
Jan 4, 2006 at 2:06 pm -
Normally I wouldn't post this here, but I haven't been able to find any info about how I would go about downloading the latest source from the SVN repository. I've got a bit of experience with CVS, ...
Colin Young
Jan 4, 2006 at 3:45 am
Jan 4, 2006 at 12:27 pm -
Hi, I'm currently using the GermanStemmer and it works well. However today I've found two words which get stemmed to the same stemm-word. "Suche" and "Sucht" both get stemmed to the same "such" it ...
Markus Fischer
Jan 31, 2006 at 12:50 pm
Jan 31, 2006 at 9:33 pm -
I am curious what would be the difference between searching for a number verses a character. I have a large index consisting of a few fields (So index would look something like: " 123123123 my ...
Aigner, Thomas
Jan 30, 2006 at 9:46 pm
Jan 31, 2006 at 9:29 pm -
Hi, I'm trying to figure out a way to locate tokens which include special characters. The actual text in the file being indexed is something like "function() { statement1; statement2; }" The query ...
Dmitry Goldenberg
Jan 27, 2006 at 10:10 pm
Jan 30, 2006 at 4:59 pm -
Hi, im trying to delete duplicate documents from my index, the unique indentifier is the documents url (aka field "url"). my initial thought of how to acomplish this is to open the index via a reader ...
Gekkokid
Jan 28, 2006 at 5:00 pm
Jan 30, 2006 at 12:01 pm -
Hi, Got more questions regarding Lucene and this time it's about performance ;-) We currently are using RAMDirectories to read our Indexes. This has now become a problem since our index has grown to ...
Daniel Pfeifer
Jan 26, 2006 at 3:21 pm
Jan 27, 2006 at 10:43 am -
Today I've been alerted by one of my collegues that our Lucene-based indexing solution no longer refreshes the searchers and thus we never get any new indexed documents. Since I didn't find anything ...
Daniel Pfeifer
Jan 24, 2006 at 1:52 pm
Jan 27, 2006 at 6:46 am -
Hi, Please forgive me if this comes across as being naïve however Ive bashed my head against it for a while and can’t come up with a solution. Overview: I have the following basic document structure: ...
Ashley Rajaratnam
Jan 20, 2006 at 3:09 pm
Jan 20, 2006 at 10:40 pm -
Hello lucene people! First of all, i would like to thank all of community participants ( developers, users, Erik and Otis for "Lucene in Action" book ) for their great work. As far as i understand ...
Aleksey Serba
Jan 20, 2006 at 1:50 pm
Jan 20, 2006 at 6:32 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 126 |
posts | 544 |
users | 126 |
website | lucene.apache.org |
126 users for January 2006
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)