Search Discussions
-
Hi all, The DSpace (www.dspace.org) currently uses Lucene to index metadata (Dublin Core standard) and extracted full-text content of documents stored in it. Now the system is being used globally, it ...
Tansley, Robert
May 31, 2005 at 9:09 pm
Jun 7, 2005 at 7:02 am -
Hi, I'm working on a pretty typical web page search system based on lucene. Pretty much everything works great. However, I'm having one problem. I want to have a feature in this system where I can ...
Doug Hughes
May 29, 2005 at 12:30 pm
Jun 21, 2005 at 6:13 am -
I am working on a Document Management System where every document has an Access Control List attached to it. Obviously a search result should only consist of documents that may be viewed by the ...
Markus Wiederkehr
May 30, 2005 at 7:47 am
Jun 4, 2005 at 11:07 pm -
Firstly the Lucene in Action Book is great. It really helped me with implementing search for a project. Sorry if this is the wrong forum but as you are all search people. I wondered if you could ...
Anna Bing
May 12, 2005 at 10:36 am
May 24, 2005 at 1:37 pm -
I have an index with a date field. I want to quickly find the minimum and maximum values in the index. Is there a quick way to do this? I looked at using TermInfos and finding the first one but how ...
Kevin Burton
May 31, 2005 at 7:07 am
Jun 7, 2005 at 4:35 pm -
Hi, All, I use lucene highlight package to generate KWIC for our application. The part of the code is as following: ===================================================== if(text != null ){ ...
Yinjin
May 5, 2005 at 12:13 am
May 5, 2005 at 10:51 pm -
Hi, We have a need to present HTML documents with all search terms highlighted. Everything I've seen regarding the Highlighter code seems to point to the typical case of extracting relevant fragments ...
Fred Toth
May 24, 2005 at 7:47 pm
May 27, 2005 at 4:09 pm -
First, I am new to Lucene. Is there anyone out there who has had trouble getting hits when running phrase queries against an index that contains content from PDF files. For PDF documents, I create ...
Thomas X Hoban
May 25, 2005 at 8:59 pm
May 25, 2005 at 11:29 pm -
Hi, Anyone knows what is exactly Similarity.tf()? I understood it's term frequency on a document. Still, when I'm searching for a string a document contains, and the Explain().toString() shows tf=0. ...
M. Mokotov
May 24, 2005 at 1:51 pm
May 25, 2005 at 8:21 am -
7
NFS
I was wondering about Lucene and NFS. The issue is with locking correct? In Lucene in Action it mentions. ... issues with lock files and NFS, choose a directory that doesn't reside on an NFS volume. ...Richard Krenek
May 18, 2005 at 1:50 am
May 18, 2005 at 9:39 pm -
Hi! In my application, I index some strings (like filenames) untokenized, meaning via doc.add(new Field(FIELD,VALUE,false,true,false)); When I later take a look at it with Luke, I still get tokens of ...
Max Pfingsthorn
May 27, 2005 at 3:23 pm
May 28, 2005 at 2:11 am -
Hi! I was wondering if Lucene has any sort of functionality to distribute indices so that different fields are stored in separate indices but they still refer to the same document. This would be ...
Max Pfingsthorn
May 20, 2005 at 11:58 am
May 21, 2005 at 8:01 am -
Hi I catch the TooManyClauses Exception in my application, and when I show the exception message get null value. This behavior is bad I think, don't help to found cause of errors. Now I use ...
Ernesto De Santis
May 18, 2005 at 4:24 pm
May 19, 2005 at 12:25 am -
Hello, I'm having a tough time trying to get to the root of an exception I see sometimes on my Lucene 1.4.3 index. The exception is: java.lang.ArrayIndexOutOfBoundsException: 4 at ...
Matt Magoffin
May 5, 2005 at 7:26 pm
May 8, 2005 at 8:07 pm -
Hi Everyone, I've been searching the archive without success to answer this one: is it possible to specify one similarity class per field, just like we can do with an analyzer ? I know I can change ...
Robichaud, Jean-Philippe
May 3, 2005 at 9:58 pm
May 5, 2005 at 5:09 pm -
Hi, I´m using lucene for 2 month and now I have a big problem. In my index are (for example) 4 documents which contains the word simone. The problem is that lucene does not find all documents by some ...
Möckl Susanne
May 20, 2005 at 9:04 am
May 20, 2005 at 7:25 pm -
Somebody asked about this today, and I just found this through Simpy: http://www.unine.ch/info/clef/ Scroll half-way through the page, look on the right side: 1,000 most frequent words for several ...
Otis Gospodnetic
May 12, 2005 at 7:59 am
May 12, 2005 at 11:29 am -
Hi all, Is it possible, with the RAMDirectory (or another Directory), to "flush" informations after each Document indexing ? I tried this but this "flush" appears to be able to be made after 2 ...
Rifflard Mickaël
May 10, 2005 at 2:48 pm
May 12, 2005 at 7:37 am -
Hello, I just wanted to let everyone know that we've officially announced that the new SourceForge.net search system is based on Lucene. It's been in operation for over a month now and we're very ...
Chris Conrad
May 25, 2005 at 7:40 pm
May 29, 2005 at 10:32 am -
Hi, I'm getting a TooManyClauses Exception when I try to query for a particular date range. I've around 4 million documents with 21 fields each. The fields to search into are determined by the user - ...
Jayakumar.V
May 19, 2005 at 8:50 pm
May 21, 2005 at 6:07 am -
Hi all, I need to retrieve all terms from an specified field filtered for another field. For example, Document 1 - <contents, " document 1 content" <language, en Document 2 - <contents, " document 2 ...
Albert Vila
May 18, 2005 at 3:19 pm
May 19, 2005 at 9:06 pm -
Hypothetically I have 100 million records. Each record has 100+ fields. Only 20 of those fields need to be searched on, the rest (including the 20) are just for display purposes. Would it be best to ...
Richard Krenek
May 13, 2005 at 10:31 pm
May 18, 2005 at 3:59 am -
Hi, i'm trying to collect Documents whose (normalized) score is greater than a given threshold. But i don't know what is the smartest way to do so :) Do i have to subclass (Index)Searcher and ...
Kai Gülzau
May 10, 2005 at 3:09 pm
May 17, 2005 at 7:07 pm -
Hi, I have a project which will be used in order to supply automatic dictionary helps in different languages. I'm using Lucene for indexing, and searching the words in it. It is an open source ...
Ahmet Aksoy
May 11, 2005 at 10:01 pm
May 12, 2005 at 8:19 am -
Hi, I am starting my application in multi-threaded environment, could somebody show me any examples with serialize calls to the IndexWriter.addDocument(Document)? because my idea is to use ...
Sodel Vazquez-Reyes
May 3, 2005 at 6:50 pm
May 10, 2005 at 7:50 pm -
Hi, " We are please to announce the initial release of Compass, a new concept in semantic Search Engine/Object Mapping (OSEM) technology. Compass is a Java framework, built on top of the Lucene ...
Kimchy Compass
May 3, 2005 at 12:17 pm
May 4, 2005 at 9:35 am -
I'm building a search engine that searches multiple document fields by default. Given a query string like "Bruce Lee", I would expect the results list to first show the documents containing both ...
Mike Baranczak
May 1, 2005 at 5:05 pm
May 2, 2005 at 6:39 pm -
Hi, Can someone please explain me how do I use the CachingWrapperFilter? I see that it's built in a decorator way (getting on the constructor another filter and decorate it with caching), still I ...
M. Mokotov
May 26, 2005 at 8:04 am
Jun 2, 2005 at 5:08 pm -
Hi All, Now that the QueryParser knows about position increments has anyone used this to do stemming at query time and not at indexing time? I suppose one would need a reverse stemmer. Given the ...
Andrew Boyd
May 30, 2005 at 4:54 pm
Jun 1, 2005 at 2:11 pm -
I have a Document with about 15 fields. I only need two of them. How much faster would lucene be if I only fetched the two fields? Each field is a separate file and this would almost certainly slow ...
Kevin Burton
May 28, 2005 at 9:11 am
Jun 1, 2005 at 8:30 am -
Here is the logical structure of the document I'm working with: The 'Document' has two fields: 'includes' - List of terms that provide positive boost 'excludes' - List of terms that provide negative ...
Ryan Skow
May 26, 2005 at 3:59 pm
May 26, 2005 at 5:17 pm -
Hi, I wanted to know what method would be the best way to do something that I am describing below. I am creating an index of all my products and categories. While indexing, I am creating the ...
Mufaddal Khumri
May 20, 2005 at 10:37 pm
May 21, 2005 at 8:46 am -
Hi Lucene community, I'm facing a strange problem, that you'll probably understand as I'm only a newbie to Lucene. When I search "hotliner:such" I get a 0 result. ("such" gets the same) But when I ...
JM Tinghir
May 19, 2005 at 7:53 pm
May 20, 2005 at 7:40 am -
Hi, We have implemented a lucene search like this: registry = LocateRegistry.getRegistry(RMIAddress, RMIPort); searchables = new Searchable[] { (Searchable) registry.lookup(RMIIndexName)}; ...
Lilja, Bjorn
May 10, 2005 at 3:06 pm
May 11, 2005 at 9:25 pm -
Context: our index is currently around 6 gig and takes about an hour just to optimize. Updating it, even in batches, can involve active updating for 15 or more minutes. Index updates are done with ...
Naomi Dushay
May 10, 2005 at 8:45 pm
May 11, 2005 at 3:00 pm -
Hi guys, A friend just asked me for advice about synchronizing lucene indexes across a very large number of servers. I haven't really delved that deeply into this sort of stuff, but I've seen a ...
Steven J. Owens
May 5, 2005 at 6:29 am
May 10, 2005 at 7:45 pm -
Hi All, I'm wanting to do some range queries using latitude and longitude. I have numbers like so: long lat -84.65532 32.74212 What would be the best way to store this in lucene so I can do a range ...
Andrew Boyd
May 8, 2005 at 4:26 pm
May 8, 2005 at 5:49 pm -
Hello all, I know that we can expand a word to get its synonyms with Wordnet. I was wondering if we could reduce the index size by including a synonym instead of a word on the synonym list. For ...
Pablo Gomes Ludermir
May 4, 2005 at 9:39 pm
May 5, 2005 at 2:44 pm -
Hi, I suppose this question has been asking before but there is no way to search such a thing in the archive. Anyway, I need to merge to different type of search but I am not really sure that the ...
Bertrand VENZAL
May 12, 2005 at 7:57 am
Aug 22, 2005 at 11:07 pm -
Hello, I am currently looking for a way to navigate forward and backward among the indexed terms. For example, given a Term t, I would like to be able to get the next 10 terms or the previous 10 ...
Antoine Brun
May 25, 2005 at 8:05 am
Jun 13, 2005 at 9:52 am -
Hi All, By using the carrot demo: http://www.newsarch.com/archive/mailinglist/jakarta/lucene/user/msg03928.html I was able to easliy cluster search results based on the fields used by carrot( url, ...
Andrew Boyd
May 30, 2005 at 3:08 pm
Jun 1, 2005 at 2:28 pm -
How would one go about adding additional terms to a field which is not stored literally, but instead has a termFreqVector? For example: If DocumentA was indexed originally with: myTermField: red ...
Ryan Skow
May 30, 2005 at 4:38 pm
May 31, 2005 at 10:03 pm -
I noticed in my lucene index that I had mistakenly indexed some documents multiple times. I wrote the following piece of code to find and eliminate the duplicates, but it did not behave as expected. ...
Dan Climan
May 26, 2005 at 6:51 pm
May 27, 2005 at 3:27 pm -
Hi all, I am new to Lucene project, would like to get some information 1) Can we use Lucene project as a search engine for code repository 2) If yes, how should the code component cataloging should ...
Singh, Anurag \(Research\)
May 27, 2005 at 5:15 am
May 27, 2005 at 7:00 am -
Hi, I am building queries using the query api and when I use } in my fieldname and then call toString on the query, QueryParser throws a ParseException when trying to parse it. How do I fix this? ...
Peter Gelderbloem
May 24, 2005 at 9:19 am
May 25, 2005 at 9:43 am -
Dear Sir/Madam: I am a beginner of IR.I want to use Lucene with BM25 algorithm,but i dont know how to change its default sort algorithm? Can you give me some advice? Thanks! Don't just search. Find. ...
Luqun lou
May 23, 2005 at 1:30 pm
May 24, 2005 at 1:32 am -
Hi, My company would like to make the following contribution to Lucene (in sandbox?) licensed under the Apache License, Version 2.0. Background: While doing project work on a web-based search engine ...
Maik Schreiber
May 17, 2005 at 10:42 pm
May 18, 2005 at 10:16 am -
Dear all, I would like to know about the maxFieldLength. It says on the Javadocs that it limits "The maximum number of terms that will be indexed for a single field in a document." So, for instance, ...
Pablo Gomes Ludermir
May 17, 2005 at 9:35 pm
May 18, 2005 at 12:49 am -
Now Suppose,There are two fields,"content","summary",but i think the query in content field may have highter weight than the summary field. how can i do it? I overload the parse function,and add ...
Luqun lou
May 11, 2005 at 3:50 pm
May 16, 2005 at 1:35 am -
1. I am trying to pump in large number of documents( to the tune of 50000) ... I use muliple threads and i depend on the internal locks of lucene to synchronize the write access to the index. try { ...
Ramya
May 12, 2005 at 2:20 pm
May 13, 2005 at 4:49 am
Group Overview
group | java-user |
categories | lucene |
discussions | 109 |
posts | 363 |
users | 129 |
website | lucene.apache.org |
129 users for May 2005
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)