Search Discussions
-
hello everyone, I am starting to understand lucene in java and I am having a hard time in implementing it. I am trying to develop a java application that can do indexing, searching and whatnot. and ...
Yakob
Jul 28, 2010 at 4:06 am
Aug 13, 2010 at 7:58 am -
Any hints on making something like an InverseWildcardQuery? We're trying to find all documents that have at least one field that doesn't match the wildcard query. Or is there a way to inverse any ...
Justin
Jul 30, 2010 at 2:30 pm
Jul 30, 2010 at 8:31 pm -
Hi, A customer has been indexing a very large collection of documents that has been running over many days using 2.9.0. During the optimisation stage, the following error occurred, and now the index ...
David Sitsky
Jul 26, 2010 at 4:22 am
Jul 29, 2010 at 10:33 am -
I'm having trouble with the IndexReader class as per below: (using lucene 2.9.1) RAMDirectory dir = new RAMDirectory(); createIndex(dir); IndexReader reader = IndexReader.open(dir); IndexReader ...
Gregory Tarr
Jul 30, 2010 at 8:16 am
Aug 21, 2010 at 12:56 am -
Hi! We are using lucene in our project to search through information objects which works fine. For indexing we use the StandardAnalyzer. Now, we have to support the Chinese language. I found out that ...
Kolhoff, Jacqueline - ENCOWAY
Jul 1, 2010 at 9:20 am
Jul 2, 2010 at 7:10 am -
We're getting up there in terms of corpus size for our Lucene indexing application: * 20 million documents * all fields need to be stored * 10 short fields / document * 1 long free text field / ...
Christopher Condit
Jul 13, 2010 at 9:54 pm
Jul 16, 2010 at 8:35 am -
Hi, Is there a way to get all the fields involved in a query? Thanks Anuj
Anuj Shah
Jul 31, 2010 at 1:42 pm
Jan 16, 2013 at 9:21 pm -
Hi, I just performed two queries which, in my opinion, should lead to the same document rankings. However, the document ranking differ between these two queries. For better understanding I prepared ...
Philippe
Jul 21, 2010 at 9:27 pm
Jul 27, 2010 at 3:31 pm -
the index file is ill-formated because disk full when feeding. Can I roll back to last version? Is there any method to avoid unexpected errors when indexing? attachments are my segment_N
Li Li
Jul 7, 2010 at 2:43 am
Jul 7, 2010 at 3:34 am -
We recently upgraded from lucene 2.4.0 to lucene 3.0.2. Our load testing revealed a serious performance drop specific to traversing the list of terms and their associated documents for a given ...
Nader, John P
Jul 28, 2010 at 6:40 pm
Aug 19, 2010 at 9:39 am -
Hi, I want to rank my results only on parts of my query. E.g my query is "TITLE:Lucene AND AUTHOR:Manning". After this query standard lucene ranking for both fields take place. However, is it ...
Philippe
Jul 31, 2010 at 8:05 am
Aug 1, 2010 at 7:44 am -
Hello, we are trying to implement a query type for Lucene (with eventual target being Solr) where the query string passed in needs to be "filtered" through a large list of document IDs per user. We ...
Martin J
Jul 21, 2010 at 12:38 pm
Jul 23, 2010 at 5:53 pm -
Hi all, Consider the following string: "the buffalo buffaloes" [1]. When passed through a stemming analyzer, the resulting token would be "buffalo buffalo" (assuming a good stemmer). To enable exact ...
Itamar Syn-Hershko
Jul 16, 2010 at 3:30 pm
Jul 22, 2010 at 9:45 pm -
Hi, I would like to continuously iterate over the documents in my lucene index as the index is updated. Kind of like a "stream" of documents. Is there a way I can achieve this? Would something like ...
Max Lynch
Jul 13, 2010 at 9:18 pm
Jul 15, 2010 at 3:40 pm -
Hello Friends; Recently, I have problem with lucene search - memory problem on the basis that indexed file is so big. (I have indexed some kinds of information and this indexed file's size is nearly ...
Ilkay polat
Jul 14, 2010 at 10:45 am
Jul 15, 2010 at 12:52 am -
i am trying to search for a value which begins with a '$' or even sometimes '$$'. '$' is not listed as a special character and no matter what i try, i can not get a search for $* to return anything. ...
Nathaniel Auvil
Jul 1, 2010 at 6:57 pm
Jul 9, 2010 at 6:57 pm -
Hello everybody, I am reading the file format paper and I check it against a created index. The documentation says: TermInfoIndex (.tii)-- TIVersion, IndexTermCount, IndexInterval, SkipInterval, ...
Alexander vom Berg
Jul 21, 2010 at 8:52 am
Jul 27, 2010 at 6:20 pm -
Hi all! hmmm, i need to get how important is the word in entire document collection that is indexed in the lucene index. I need to extract some "representable words", lets say concepts that are ...
Xaida
Jul 23, 2010 at 2:44 am
Jul 23, 2010 at 11:44 am -
Markus Roth
Jul 30, 2010 at 1:30 pm
Jul 30, 2010 at 2:02 pm -
Hi, I'm about to write an application that does very simple text analysis, namely dictionary based entity entraction. The alternative is to do in memory matching with substring: String text; // could ...
Geir Gullestad Pettersen
Jul 22, 2010 at 10:31 pm
Jul 28, 2010 at 8:54 am -
Hi, Normally, when I am building my index directory for indexed documents, I used to keep my indexed files simply in a directory called 'filesToIndex'. So in this case, I do not use any standar ...
Manjula wijewickrema
Jul 23, 2010 at 5:46 am
Jul 28, 2010 at 6:43 am -
Hi all, I have an interesting problem...instead of going from a query to a document collection, is it possible to come up with the best fit query for a given document collection (results)? "Best fit" ...
Skant
Jul 23, 2010 at 6:31 am
Jul 23, 2010 at 12:33 pm -
I am using lucene 2.9.3 (via Solr 1.4.1) on windows and am trying to understand ShingleFilter. I wrote the following code and find that if I provide more words than the actual phrase indexed in the ...
Ethan Collins
Jul 13, 2010 at 7:43 am
Jul 14, 2010 at 10:00 am -
Hi, I run a single programme to see the way of scoring by Lucene for single indexed document. The explain() method gave me the following results. ******************* Searching for 'metaphysics' ...
Manjula wijewickrema
Jul 9, 2010 at 7:22 am
Jul 12, 2010 at 7:49 am -
Hi, In my application, I input only one index file and enter only single term query to check the lucene score. I used explain method to see the way of obtaining results and system gave me the result ...
Manjula wijewickrema
Jul 8, 2010 at 3:45 am
Jul 9, 2010 at 10:31 am -
Hi, For Lucene 3.0.2, issue LUCENE-2421 ( https://issues.apache.org/jira/browse/LUCENE-2421) changed NativeFSLock.release to not raise an exception if a write.lock file could not be deleted since the ...
Ted McFadden
Jul 7, 2010 at 5:59 am
Jul 8, 2010 at 1:25 pm -
Hi, In my application, I input only single term query (at one time) and get back the corresponding scorings for those queries. But I am little struggling of understanding Lucene scoring. I have ...
Manjula wijewickrema
Jul 5, 2010 at 9:03 am
Jul 7, 2010 at 8:36 am -
Hi all, Is it possible to run a search over top 100,000 (for example) results of a prior search. So if the user first does the search, gets results, if pressing on the search button again, I would ...
Liat oren
Jul 6, 2010 at 8:33 am
Jul 6, 2010 at 9:06 am -
Hi All, I'm trying to use the patch for testing, provided in the issue. I downloaded the patch and the dependency *LUCENE-2453 <https://issues.apache.org/jira/browse/LUCENE-2453 *. I tested this ...
Utku Can Topçu
Jul 23, 2010 at 5:00 pm
Aug 17, 2010 at 7:26 pm -
Hi, I heard work is being done on re-writing MultiPassIndexSplitter so it will be a single pass and work quicker. I was wondering if this is already done or when is it due ? Thanks
Yatir Ben Shlomo
Jul 22, 2010 at 2:54 pm
Aug 5, 2010 at 5:15 pm -
Hi, for some queries I'm only interested in the number of matching documents. Is there a better/faster way to perform such a query, instead of retrieving all TopDocs and counting the number of ...
Philippe
Jul 26, 2010 at 1:19 pm
Jul 26, 2010 at 3:06 pm -
Hey All, I am using Apache Lucene (2.9.1) and its fast and it works great! I have a question in connection with Apache PDFBox. The following command creates a Lucent Document from a PDF file: ...
Joe Hansen
Jul 19, 2010 at 10:32 pm
Jul 20, 2010 at 12:08 am -
Hi, I'm trying to run ant task "generate-maven-artifacts" in lucene-solr build.xml file. But getting this error: /home/chardex/lucene/dev/lucene/common-build.xml:312: Error deploying artifact ...
Pavel Minchenkov
Jul 16, 2010 at 3:36 pm
Jul 19, 2010 at 4:43 pm -
Hello, I'm a newbie to Lucene and before starting playing with it I would like to know whether it fits to my application. I have a collection of XML documented demarcated with respect to a stable XML ...
Maciej
Jul 15, 2010 at 3:09 pm
Jul 16, 2010 at 10:09 am -
Hi, I have seen that, onece the field length of a document goes over a certain limit ( http://lucene.apache.org/java/2_9_3/api/all/org/apache/lucene/index/IndexWriter.html#DEFAULT_MAX_FIELD_LENGTH ...
Manjula wijewickrema
Jul 12, 2010 at 8:01 am
Jul 13, 2010 at 4:43 pm -
I am extremely impressed with Lucene and would like to thank Naveen and Otis for your kind help. I am not really a Java person, I am a perl and C++ guy and my website is done with mod_perl. So, my ...
Igor Chudov
Jul 9, 2010 at 5:18 am
Jul 9, 2010 at 7:37 am -
Hi, what would be the fastest way to get all terms for all documents matching a specific query? Sofar I: 1.) Query the index 2.) Retrieve all scoreDocs 3.) Iterate the scoreDocs and retrieve all ...
Philippe
Jul 27, 2010 at 12:51 pm
Jul 28, 2010 at 1:16 pm -
Consider the following two documents which I have added to my index: doc.add( new Field("text", "hello world", Field.Store.YES, Using the StandardQueryParser I can retrieve my document with either of ...
Geir Gullestad Pettersen
Jul 27, 2010 at 8:19 pm
Jul 27, 2010 at 9:10 pm -
Hi, is there a possibility to retrieve the lengthNorm for all (or a specific) fields in a specific document? Regards, Philippe --------------------------------------------------------------------- To ...
Philippe
Jul 19, 2010 at 1:54 pm
Jul 19, 2010 at 2:28 pm -
Hi there, I have been recently trying to build a lucene index out of ngrams and seem to have stumbled on to a number of issues. I first tried to use the NGramTokenizer, but that thing apparently only ...
Martin
Jul 17, 2010 at 8:30 pm
Jul 17, 2010 at 9:53 pm -
I'm examining the following search problem. Consider a document with two multi-value fields. Document doc = new Document(); doc.add(new Field("f1", "a1", Field.Store.YES, Field.Index.ANALYZED)); ...
Hans-Gunther Birken
Jul 9, 2010 at 12:44 pm
Jul 9, 2010 at 6:39 pm -
Hello, My name is Igor and I own a website algebra.com. I just joined. I have a database of answered algebra questions (208,000 and growing). A typical question is here (original spelling): ``who ...
Igor Chudov
Jul 8, 2010 at 10:14 pm
Jul 9, 2010 at 5:12 am -
I used to store full text into lucene index. But I found it's very slow when merging index because when merging 2 segments it copy the fdt files into a new one. So I want to only index full text. But ...
Li Li
Jul 7, 2010 at 6:09 am
Jul 7, 2010 at 6:30 am -
it is said that "At a few thousand ~160 characters long documents InstantiatedIndex outperforms RAMDirectory some 50x, 15x at 100 documents of 2000 characters length, and is linear to RAMDirectory at ...
Li Li
Jul 2, 2010 at 6:34 am
Jul 7, 2010 at 2:59 am -
Hello All, Can someone explain to me how fielded queries work with phrases? My first thought is that the phrase is broken down into terms and those terms are then fielded and separated with the AND ...
Thomas Nguyen
Jul 6, 2010 at 8:20 pm
Jul 7, 2010 at 2:46 am -
Hi all. I've been dealing with a small problem when searching and trying to sort and filter on a NumericField using Lucene 2.9.2; the result never comes back as expected. Here are some snippets of my ...
Eduardo Pierdant
Jul 6, 2010 at 5:49 pm
Jul 6, 2010 at 6:43 pm -
Hi, I am currently working on a Lucene module that makes use of controlled SKOS vocabularies (http://www.w3.org/TR/skos-primer/) during index and search time. It should work similar to Lucene's ...
Bernhard Haslhofer
Jul 6, 2010 at 1:03 pm
Jul 6, 2010 at 2:37 pm -
Working on the nightly build of solr and lucene - MultiPhraseQuery throws ArrayIndexOutOfBounds Exception for the words defined as synonyms SEVERE: java.lang.ArrayIndexOutOfBoundsException: 5 at ...
Jayendra patil
Jul 30, 2010 at 3:21 pm
Jul 30, 2010 at 6:11 pm -
Hi, I'm trying to implement a query for phrases without strict ordered and with missing words. At the moment, I'm trying the Spans infrastructure and this problem just arised. NearSpansOrdered's ...
Santiago M. Mola
Jul 29, 2010 at 9:24 am
Jul 29, 2010 at 10:49 am -
Hi, Can any one clarify me difference between lucene index and database index? I am just trying to understand how lucene stores index, like databases store index as b-tree's. Thank in advance, ...
Shravan
Jul 27, 2010 at 2:22 am
Jul 27, 2010 at 8:21 am
Group Overview
group | java-user |
categories | lucene |
discussions | 71 |
posts | 296 |
users | 97 |
website | lucene.apache.org |
97 users for July 2010
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)