Search Discussions
-
Hi All, I have successfully used Lucene in the "tradtiional" way to provide full-text search for various websites. Now I am tasked with developing a data-store to back a web crawler. The crawler can ...
John Evans
Jul 29, 2008 at 1:53 am
Aug 3, 2008 at 3:09 pm -
Hi, I have a number of fields that are used to filter documents from a search. They should not contribute to the score of the document but merely decide which documents are valid. i.e. it doesn't ...
John Patterson
Jul 15, 2008 at 8:08 am
Jul 16, 2008 at 5:33 pm -
Hi Everyone I am getting the the following error when executing Hits hits = searchers.search(query, queryFilter, sort): 18007414-java.io.IOException: Bad file descriptor 18007455- at ...
Jamie
Jul 23, 2008 at 4:20 pm
Jul 28, 2008 at 9:56 am -
Hi all, I am new to lucene , is it possible to Index different files in different folders in lucene for examples , i have two folderes a and b , each contain several files. in lucene args i wrote : ...
Starz10de
Jul 5, 2008 at 6:42 pm
Jul 15, 2008 at 5:33 pm -
I'd like to shorten the time it takes to optimize my index and am willing to sacrifice search and indexing performance. Which parameters (e.g. merge factor) should I change? Thank you. Stay in touch ...
Dragon Fly
Jul 28, 2008 at 6:00 pm
Jul 30, 2008 at 10:45 pm -
Hi All, Is there any possibility to avoid duplicate records in lucene 2.3.1? -- View this message in context: http://www.nabble.com/How-to-avoid-duplicate-records-in-lucene-tp18543588p18543588.html ...
Sebastin
Jul 19, 2008 at 11:30 am
Jul 23, 2008 at 11:19 pm -
Hi. There's a ThreadLocal field in SegmentReader (it's called termVectorsLocal). Some value is put to it, but it's never cleared. Is it ok? It looks like sometimes this behavior may lead to leaks. ...
Roman Puchkovskiy
Jul 6, 2008 at 8:33 pm
Jul 12, 2008 at 4:38 am -
10
.fdt file
Hi, I had recently found out that Lucene will retrieve the content of a document from a file ".fdt". I am trying to retrieve the entire file in one go instead of retrieving it based on document ...Blazingwolf7
Jul 10, 2008 at 1:01 am
Jul 11, 2008 at 9:34 am -
Hi all. This may seem a longish and informal mail, but do correct me if my assumptions are wrong anywhere, otherwise my actual doubt will make no sense. Say I opened an IndexWriter on an initially ...
Java_is_everything
Jul 26, 2008 at 4:18 am
Jul 28, 2008 at 9:18 am -
Hi all.. I had a question related to the write locks created by Lucene. I use Lucene 2.3.2. Will this newwer version create locks while indexing as older ones? or is there any other way that lucene ...
Sandeep K
Jul 23, 2008 at 6:59 am
Jul 25, 2008 at 12:39 pm -
If I have a SortField with a type of STRING, is there any way to sort in a case-insensitive manner? - Paul --------------------------------------------------------------------- To unsubscribe, ...
Paul J. Lucas
Jul 1, 2008 at 12:59 am
Jul 18, 2008 at 5:33 pm -
I need to perform a query for a term that may or may not have values, and I need to check for the conditions where either no terms are indexed OR any and ALL indexed terms match a wildcard. For ...
Ronald Rudy
Jul 10, 2008 at 9:54 pm
Jul 21, 2008 at 7:06 pm -
Hi, Every time I send a mail to this list, I get the below error. Any idea where is the problem ? It also appears that my mails are actually reaching the list. Any help in rectifying this is ...
Preetam Rao
Jul 15, 2008 at 8:55 am
Jul 17, 2008 at 8:03 am -
Hi, I am indexing content and searching using lucene. It is working fine when I use the simple servlet and jsp mechanism. I am able to search on the indexed content. I tried to implement the same ...
Yugana
Jul 3, 2008 at 4:42 am
Jul 4, 2008 at 3:54 pm -
Can someone explain this to me? After indexing I can see the terms I expect in the top terms using Luke but then when I search I get no results?? This is really bizarre and is blocker for me. Thanks. ...
Samd
Jul 24, 2008 at 6:46 pm
Jul 24, 2008 at 7:15 pm -
Hi, I am indexing lots of text files and need to see how many times a certain word comes up in each text file. Right now I have this constructor for "search": static void search(Searcher searcher, ...
Jnance
Jul 9, 2008 at 1:50 pm
Jul 11, 2008 at 6:40 pm -
According to SVN history on the next version this will be available: LUCENE-1044: IndexWriter with autoCommit=true now commits (such that a reader can see the changes) far less often than it used to. ...
Eric Diaz
Jul 8, 2008 at 3:40 pm
Jul 10, 2008 at 9:45 pm -
I just did an update from lucene 2.2.0 to 2.3.2 and thought I'd give some kudos for the indexing performance enhancements. The lucene indexing portion is about 6-8 times faster. Previously we were ...
Beard, Brian
Jul 9, 2008 at 1:04 pm
Jul 10, 2008 at 3:21 pm -
Hi, I have been using a RAMDirectory for indexing without any problem, but I then moved to a file based directory to reduce memory usage. this has been working fine on Windows and OSX and my version ...
Paul Taylor
Jul 8, 2008 at 8:04 am
Jul 8, 2008 at 4:15 pm -
Hi there, I want to index email address in such a way that i can do WildCard, Phrase and Simple search on those items. for each document i will have email addresses string just like in the case of CC ...
Miztaken
Jul 3, 2008 at 11:31 am
Jul 7, 2008 at 5:23 am -
Hi, I'm implementing a custom IndexDeletionPolicy. An IndexCommit object does not have any information whether it's index is optimized or not. How can a IndexDeletionPolicy know which IndexCommit ...
Shalin Shekhar Mangar
Jul 1, 2008 at 10:48 am
Jul 2, 2008 at 5:39 pm -
Dear fellow Java/Lucene developers: I have a question on creating an index from an XML document for the purpose of searching using the Lucene API in Java. I am searching shakespeare's "Hamlet" which ...
Syedfa
Jul 27, 2008 at 5:59 pm
Jul 29, 2008 at 4:41 am -
Hi, I am writing a class to report on an index. This index has documents updated using the IndexWriter.updateDocument(Term, Document) method. That is, documents were deleted and added again. My aim ...
ನಾಗೇಶ್ ಸುಬ್ರಹ್ಮಣ್ಯ (Nagesh S)
Jul 25, 2008 at 11:18 am
Jul 25, 2008 at 2:57 pm -
6
Scaling
Is there some sort of a scaling strategies listing available? I think there is a Wiki page missing. What are the typical promblems I'll encounter when distributing the search over multiple machines? ...Karl Wettin
Jul 16, 2008 at 1:42 pm
Jul 18, 2008 at 4:52 pm -
Hi, Sorry if you get this mail second time. Having some trouble with mail box. Is there a query in Lucene which matches sub phrases ? For example if the document text is "new york existing homes *3 ...
Preetam Rao
Jul 14, 2008 at 5:15 pm
Jul 15, 2008 at 10:06 am -
Hi, I have some questions about indexing: 1. Is it possible to open indexes with Multireader+IndexSearcher and add documents to these indexes simultaneously? 2. Is it possible to open indexes with ...
Spring
Jul 13, 2008 at 1:00 pm
Jul 15, 2008 at 9:51 am -
hello - if I make a query and get the document ids and delete with the document id - could there be a side effect? my index is committed periodically, but i can not say when it is committed. best ...
Cam Bazz
Jul 23, 2008 at 8:09 pm
Aug 1, 2008 at 9:18 pm -
Could any one tell me please how to print the content of the document after reading the index. for example if i like to print the index terms then i do : IndexReader ir = IndexReader.open(index); ...
Starz10de
Jul 22, 2008 at 6:53 pm
Jul 30, 2008 at 6:57 pm -
Hi, we have a system to archive mails and are facing some issues that we are having with search and indexing performance, the following is what we are currently facing challenges with, we are ...
Mazhar Lateef
Jul 27, 2008 at 8:41 pm
Jul 28, 2008 at 9:34 am -
Hi all, I am searching for a way to ignore XML tags in the input when indexing. Is there a built in functionality in Lucene to get this done? I am sorry if this was discussed before. I searched but ...
Kalani Ruwanpathirana
Jul 24, 2008 at 6:18 am
Jul 25, 2008 at 12:48 pm -
Everything i've read and seen about luceen is search for keywords in documents; I want to do the reverse. I have a huge list of keywords("big boy","red ball","computer") and I have phrases that I ...
Ryan Detzel
Jul 23, 2008 at 7:31 pm
Jul 23, 2008 at 9:17 pm -
If a complicated query is running in a Thread, how does Lucene respond to Thread.interrupt()? I want to be able to interrupt an in-progress query. - Paul ...
Paul J. Lucas
Jul 16, 2008 at 8:44 am
Jul 22, 2008 at 9:52 pm -
If a SpanQuery is constructed from one or more BoostingTermQuery(s), the payloads on the terms are never processed by the SpanScorer. It seems to me that you would want the SpanScorer to score the ...
Peter Keegan
Jul 9, 2008 at 6:56 pm
Jul 19, 2008 at 2:49 pm -
Hi. Currently using Lucene 2.3.2 in a tomcat webapp. We have an action configured that performs reindexing on our staging server. However, our live server can not reindex since it does not have the ...
Christopher Kolstad
Jul 10, 2008 at 1:16 pm
Jul 11, 2008 at 12:42 pm -
The best strategy. Hello. I want to ask you opinion about to "How store multiple fields of same document". I see now two possibility's. 1. Multiple fields in document 2. One filed: for example named ...
Sergey Kabashnyuk
Jul 31, 2008 at 2:37 pm
Aug 1, 2008 at 6:51 pm -
Hello, I've filled an index with 1100 text files with the names "monisys1" to "monisys1100". If I start a WildcardQuery WildcardQuery query = new WildcardQuery(new Term("fileId","monisys*")); Hits ...
Zoeppi
Jul 30, 2008 at 7:31 am
Jul 31, 2008 at 7:05 pm -
FYI -- there is a nasty bug that affects Lucene in Sun's 1.6 hotspot compiler, starting with 1.6.0_04. At least 3 known cases have been seen on this list. Details are here: ...
Michael McCandless
Jul 30, 2008 at 6:10 pm
Jul 31, 2008 at 11:50 am -
I need to execute a boolean query and get back just the bits of all the matching documents. I do additional filtering (date ranges and entitlements) and then do my own sorting later on. I know that ...
Robert Stewart
Jul 22, 2008 at 7:41 pm
Jul 28, 2008 at 2:40 pm -
Hi all, I need to replace some db queries with lucene due to response time issues for sure. In this special case I need to do a range query on a field and a prefix query. I'm trying to prepare and ...
Thomas Becker
Jul 25, 2008 at 8:54 am
Jul 25, 2008 at 9:36 am -
Hi there, I know lucene is for indexing and not for frequent updates and delete. But i have been using lucene to store my matrix as a document. Since with my algorithm the value of matrix can change ...
Miztaken
Jul 8, 2008 at 7:34 pm
Jul 15, 2008 at 7:32 am -
Hi, Can someone point me in the right direction please? How can I trap this situation correctly? I receive user queries like this (quotes included): /from:"fred flintston*"/ Which produces a query ...
Chris Bamford
Jul 3, 2008 at 1:39 pm
Jul 4, 2008 at 4:19 pm -
Hello, I don't have a good understanding of what options for avoid this corrupted index problem described in LUCENE-1282. It seems to me that I either downgrade JRE from 1.6.0_06 to 1.6.0_03, or wait ...
Dan
Jul 1, 2008 at 8:02 pm
Jul 2, 2008 at 1:23 am -
I seem to recall some discussion about updating a payload, but I can't find it. I was wondering if it were possible to use a payload to implement 'modify' of a Lucene document. For example, I have an ...
Antony Bowesman
Jul 30, 2008 at 11:06 am
Jul 31, 2008 at 9:34 am -
hello, was not there a lucene delete by query feature coming up? I remember something like that, but I could not find an references. best regards, -c.b.
Cam Bazz
Jul 23, 2008 at 1:53 pm
Jul 24, 2008 at 10:21 am -
helo all, In my project, we are indexing the US states...when we try to search on oregon ; state:OR, search on OR is throwing err...i know OR is a logical op in lucene...is there a way to escape such ...
Aravind Yarram
Jul 22, 2008 at 1:29 pm
Jul 22, 2008 at 2:13 pm -
Hi ALL , This is the exception raised when when am indexing the records (I have 10 million records and after indexing 4 million record i got this exception) java.io.IOException: background merge hit ...
Sandyg
Jul 12, 2008 at 10:09 am
Jul 21, 2008 at 10:10 am -
Hi, I have a set of indices in different languages (very smal indices: on average each index directory has 10,000 documents, which has an overall size of less than 2mb). I want to know if this is a ...
Mohsen Saboorian
Jul 17, 2008 at 6:30 am
Jul 21, 2008 at 2:05 am -
Hello, Could someone please confirm that calling indexWriter.optimize() is the only way to clean out the deleted documents from the disk? I understand that indexWriter.deleteDocuments() does not ...
Dan
Jul 18, 2008 at 5:48 pm
Jul 18, 2008 at 9:51 pm -
Hi, I'm in the process of trying to optimize searches and avoid the dreaded OutOfMemoryError s. We currently return the entire document from each of the search results and then filter the results ...
Declan Newman
Jul 14, 2008 at 6:47 pm
Jul 18, 2008 at 6:57 pm -
You need to include ISOLatinFilter in your analyzer. That will convert all accented characters to their non-accented version. ------Original Message------ From: Aamir.Yaseen@globaldatapoint.com To: ...
Anand Jain
Jul 16, 2008 at 9:02 am
Jul 16, 2008 at 1:34 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 101 |
posts | 522 |
users | 116 |
website | lucene.apache.org |
116 users for July 2008
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)