Search Discussions
-
Hi, Doro Cohen Thanks for your reply, but I am facing a small problem over here. As I am using notepad for coding, then in which format the file should be saved. public static final String[] ...
Liaqat Ali
Dec 26, 2007 at 7:37 pm
Dec 27, 2007 at 10:48 am -
hi, i am using lucene for the very first time and want to manipulate the results, by adding some more factors to it, which file should i edit to manipulate the search results.... Thanks Sumit Tyagi ...
Sumittyagi
Dec 21, 2007 at 4:51 am
Feb 21, 2008 at 12:47 am -
Hi, I need to index a Wikipedia dump. I know there is code in contrib/benchmark for indexing *English* Wikipedia for benchmarking purposes. However, I'd like to index a non-English dump, and I ...
Otis Gospodnetic
Dec 12, 2007 at 5:35 am
Jan 3, 2008 at 3:35 am -
I have an index of about 10mb. Since it's so small, I would like to keep it loaded in memory, and reload it about every minute or so, assuming that it has changed on disk. I have the following code, ...
Ruslan Sivak
Dec 11, 2007 at 10:38 pm
Dec 13, 2007 at 5:27 pm -
Hy, I got a large index and when searching for a term I want the newer documents be at the begining of the result set. I dont need a real order by time but lucene should prioritze the newer ...
Dominik Bruhn
Dec 30, 2007 at 12:54 am
Jan 12, 2008 at 1:07 am -
I have the following code for search: BooleanQuery bQuery = new BooleanQuery(); Query queryAuthor; queryAuthor = new TermQuery(new Term(IFIELD_LEAD_AUTHOR, author.trim().toLowerCase())); ...
Sirish Vadala
Dec 17, 2007 at 6:38 pm
Dec 18, 2007 at 9:07 pm -
Suppose I have an index containing the terms impostor, imposter, fraud, and fruad, then presumably regardless of whether I spell impostor and fraud correctly, Lucene SpellChecker will offer the ...
Smokey
Dec 3, 2007 at 3:14 am
Dec 11, 2007 at 7:27 pm -
Hi there I am having a problem in using escape characters with lucene demo code. I used the following code for IndexFiles and SearchFiles. The code works fine for regular searching and also with ...
Baljeet Dhaliwal
Dec 19, 2007 at 10:07 pm
Dec 21, 2007 at 6:39 pm -
Hello All, I am seeing this issue and would like to understand if its a bug or I am missing something and doing the wrong way: (Note that I am doing all exception handling - but deleted the exception ...
Tushar B
Dec 19, 2007 at 10:10 am
Dec 20, 2007 at 12:57 pm -
Hi all, I am facing problem with the following multifield query: i_title:indoor* i_description:indoor* -i_published:false +i_topicsClasses.id:1_1_*_* The above query returns me even results which ...
Rakesh Shete
Dec 18, 2007 at 6:22 pm
Dec 19, 2007 at 12:34 pm -
Dear Fellow Java & Lucene developers: I am a Java developer learning lucene and I am currently going through the book Lucene in Action. At present, I am trying to run the sample code for indexing an ...
Syedfa
Dec 5, 2007 at 3:23 am
Dec 10, 2007 at 6:58 pm -
I'm not even sure if it can be considered Named Entity Recognition, but what the hell... so here's my problem... I was asked to retrieve a the named entities out of a collection of documents, and ...
Chris.b
Dec 12, 2007 at 9:45 am
Jan 9, 2008 at 4:35 pm -
Hello, I'm building a ticketing system for my company and am using Lucene for some of the more complicated queries. I'd say my application differs from the typical lucene application in that my ...
Bob Daha
Dec 10, 2007 at 7:57 pm
Dec 12, 2007 at 4:10 am -
Hi all, I am using Hibernate Search (http://www.hibernate.org/410.html) which is a wrapper around Lucene for performing search over info stored in the DB. I have questions related to Lucene boosting ...
Rakesh Shete
Dec 21, 2007 at 3:51 am
Dec 24, 2007 at 3:10 pm -
Hi, I am trying to retreive documents from an index. Each document has a date field and other fields. while making a seach i want to give some extra boost to the more recent items (as per the date ...
Prabin meitei
Dec 19, 2007 at 9:51 am
Dec 20, 2007 at 1:57 pm -
I have a few fields that use package names and class names and I've been looking for some suggestions for analyzing these fields. A few examples - Text (class name) - ...
Beyer,Nathan
Dec 15, 2007 at 11:17 pm
Dec 18, 2007 at 4:54 am -
Hi, since I need highlighting, I need to 'rewrite' a query. Query.rewrite takes an object of type IndexReader But what for? As I understand it, rewrite transforms a possibly complicated query into an ...
Helmut Jarausch
Dec 13, 2007 at 11:40 am
Dec 17, 2007 at 3:09 pm -
Hello, I'm looking for suggestions on how to deal with the following (simplified) scenario (Lucene 2.2.0): Documents in my index have some number of fields that are searched in various combinations ...
Tom Emerson
Dec 7, 2007 at 6:42 pm
Dec 16, 2007 at 3:49 pm -
Hi, It's been a while since I've written a custom TokenFilter, and I'm not having luck getting tokens out of the TokenStream using 2.3-dev. I'm hitting that default term buffer of the size 10 using ...
Otis Gospodnetic
Dec 8, 2007 at 10:44 pm
Dec 12, 2007 at 4:07 am -
I have developed a fuzzy search application over a database of books (titles, authors etc) and it works really well. (I use Lucene.Net but read the JavaDocs and forums for java Lucene) However I've ...
D33mb33
Dec 7, 2007 at 11:34 am
Dec 11, 2007 at 2:00 pm -
Happy festivus everyone, So I have my fancy new stemmed synonym based Lucene index. Let's say I have the following synonym defined: radiation - radiotherapy (and the reverse) The search results rank ...
Frank Schima
Dec 27, 2007 at 9:19 pm
Jan 3, 2008 at 9:56 pm -
Hi, What is the most efficient way to do pagination in Lucene? I have always done the following because this "flavor" of the search call allows me to specify the top N hits (e.g. 1000) and a Sort ...
Dragon Fly
Dec 22, 2007 at 3:20 pm
Dec 27, 2007 at 6:48 pm -
Hello, I would like to search documents by "CUSTOMER". So I search on the field "CUSTOMER" using a KeywordAnalyzer. The CUSTOMER field is indexed with those params: Field.Index.UN_TOKENIZED ...
Webspeak
Dec 27, 2007 at 2:34 pm
Dec 27, 2007 at 3:40 pm -
Do you guys have article links or other documents to describe the lucene database. Eg. what is it composed of? -- Berlin Brown http://botspiritcompany.com/botlist/spring/help/about.html ...
Berlin Brown
Dec 23, 2007 at 2:11 am
Dec 26, 2007 at 5:18 pm -
Hi guys, I met some trouble in optimizing the index. The index looks fine in Luke and I can carry out the search in the index. However, when I try to merge all these seperated files into a complete ...
Zhou Qi
Dec 22, 2007 at 5:22 am
Dec 26, 2007 at 1:17 pm -
Hello, I am using Lucene to build an index from roughly 10 million documents in number. The documents are about 4 TB in total. After some trial runs, indexing a subset of the documents I am trying to ...
V k
Dec 18, 2007 at 5:03 am
Dec 22, 2007 at 4:23 pm -
4
anyone
Is this at least a semi-active list? JamesHartrich, James CTR USTRANSCOM J6
Dec 19, 2007 at 7:01 pm
Dec 19, 2007 at 8:10 pm -
Hi All, I am parsing this query: "Auto* machine"~4. Will it work? If yes then right now it's not working. Can anyone help on this? Thanks & Regards Shakti Sareen DISCLAIMER: This email (including any ...
Shakti_Sareen
Dec 12, 2007 at 8:55 am
Dec 18, 2007 at 9:22 am -
I have an index that contains three sorts of documents: Car brand Tire brand Tire pressure (Please bear with me, the real index has nothing to do with cars. I just try to explain the problem in an ...
Karl Wettin
Dec 14, 2007 at 5:06 pm
Dec 17, 2007 at 1:32 am -
Hi, I know how to set DEFAULT_OPERATOR_AND for an individual QueryParser Objekt (after creation) Since I always want this to be set, is there a means to set a (global) option such that any ...
Helmut Jarausch
Dec 11, 2007 at 2:45 pm
Dec 12, 2007 at 6:35 am -
Hi I am trying to run a code from Lucene In Action, but it generate some errors.There is one one warning at compilation time and the errors generate at run time. Given below the code and errors. ...
Liaqat Ali
Dec 6, 2007 at 9:49 am
Dec 6, 2007 at 3:25 pm -
Does anyone know why JVM heap use almost doubles at the very end when indexing in memory? around 9 megs @ 1:03 min into indexing - around 18 megs @ 1:05 min when indexing is complete - heap use jumps ...
Tgospodinov
Dec 27, 2007 at 5:19 pm
Dec 27, 2007 at 6:14 pm -
Hello all, I'm trying to implement a synonym engine in Lucene 2.2 based on the code in the Lucene In Action book. However, I'm getting compile errors: My Synonym filter looks like this: import ...
Frank Schima
Dec 27, 2007 at 3:56 pm
Dec 27, 2007 at 5:44 pm -
I don't care about score, but I do care about the # of times a query was hit within a document? example: the quick brown fox jumped over the lazy dog the quick brown fox jumped over the lazy dog the ...
Jeff
Dec 20, 2007 at 7:54 pm
Dec 21, 2007 at 3:38 am -
Hi, please help I am totally puzzled. The same query, once with a direct call to FuzzyQuery succeeds while the same query with QueryParser fails. What am I missing? Sorry, I'm using pylucene (with ...
Helmut Jarausch
Dec 17, 2007 at 9:28 am
Dec 17, 2007 at 6:01 pm -
There's an interesting article on state-of-the-art setup with Mtron Solid State Drives at http://www.nextlevelhardware.com/storage/battleship/ The concise version is that Mtron flash drives puts all ...
Toke Eskildsen
Dec 14, 2007 at 10:59 am
Dec 16, 2007 at 6:50 pm -
Hello, I got a quick question. I am handling hughe CSV files. They start with a key in the first column and are followed by data. I need to retrieve randomly this data based on the key. So it is kind ...
Tobias Rothe
Dec 13, 2007 at 11:26 pm
Dec 15, 2007 at 5:08 am -
Hello, I am looking for some advice regarding which tools I might use to solve my problem. I apologize ahead of time for the long explanation. Problem Description: I would like to index a set of very ...
Jose Luna
Dec 11, 2007 at 6:30 pm
Dec 12, 2007 at 4:16 pm -
Hi all, I want to index an XML file,containing 200 Urdu language (Varient of Arabic and Persian) documents. This corpus is in CES format,consisting of information about author and many more, I just ...
Liaqat Ali
Dec 4, 2007 at 6:05 pm
Dec 12, 2007 at 6:31 am -
Here goes, I'm developing an application using lucene which will evaluate the representativeness of a list of keywords within a collection of documents. I'm doing this by indexing the documents and ...
Chris.b
Dec 10, 2007 at 10:58 am
Dec 10, 2007 at 11:43 am -
Hello all, I’ve been looking into using the nice power of the SpanNearQuery instead of PhraseQuery, mostly because of the simplification of the slop factors. However, I’m wondering if the ...
Arnone, Anthony
Dec 4, 2007 at 9:42 pm
Dec 5, 2007 at 12:31 pm -
Hello All, I want to calculate the Precision and Recall of the current system, based on Lucene. What should be the procedure and either there are some tools available for this purpose. Kindly guide ...
Liaqat Ali
Dec 29, 2007 at 10:45 am
Dec 30, 2007 at 7:46 am -
Hi all, I encounter a strange probelm. To improve performance, I open the indexreader at the start time and reuse it in later search. I have another process running to do online indexing. The search ...
Zhou Qi
Dec 27, 2007 at 1:59 pm
Dec 27, 2007 at 2:36 pm -
hello, I am try to make an index of 191 documents stored in 191 text files. I developed a program, which works well with files containing single line, but files with multiple lines posing a ...
Liaqat Ali
Dec 25, 2007 at 9:03 pm
Dec 26, 2007 at 6:14 am -
I am getting the following exception when I run our indexer: Unsupported MIME type (text/html;charset=US-ASCII) type so ignoring: http://zfin.org/... It appears if a page Http header does not specify ...
Christian Pich
Dec 21, 2007 at 6:42 pm
Dec 22, 2007 at 12:01 am -
Hi, according to the LiA book the FuzzyQuery distance is computed as 1- distance / min(textlen,targetlen) Given def addDoc(text, writer): doc = Document() doc.add(Field("field", text, ...
Helmut Jarausch
Dec 17, 2007 at 8:43 am
Dec 17, 2007 at 7:16 pm -
Hi, We've got a requirement that we need to give our users the ability to search on exact phrases within a field, or, if they prefer, they can match on plurals(either via stems, or another plural ...
Lucifer Hammer
Dec 12, 2007 at 6:26 pm
Dec 13, 2007 at 3:07 am -
My application batch adds documents to the index using IndexWriter.addDocument. Another thread handles searchers, creating new ones as needed, based on a policy. These searchers open a new ...
Antony Bowesman
Dec 9, 2007 at 11:01 am
Dec 9, 2007 at 11:33 am -
Hi, With Lucene 1.4.3, we had used this constructor for Field. What is its equivalent in Lucene 2.2.0? /** Constructs a String-valued Field that is tokenized and indexed, and is stored in the index, ...
Ts01
Dec 6, 2007 at 8:23 pm
Dec 7, 2007 at 6:28 pm -
I did some searching on the lucene site and wiki, but didn't quite find what I was looking for in regards to a basic approach to how and when to reload index data. I have a long running process that ...
Beyer,Nathan
Dec 6, 2007 at 5:44 pm
Dec 6, 2007 at 7:47 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 88 |
posts | 405 |
users | 103 |
website | lucene.apache.org |
103 users for December 2007
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)