Search Discussions
-
As devs of Lucene/Solr, due to the way ASF mirrors, etc. works, we really don't have a good sense of how people get Lucene and Solr for use in their application. Because of this, there has been some ...
Grant Ingersoll
Jan 18, 2011 at 9:04 pm
Jan 22, 2011 at 7:28 am -
What is the "best practice" to support multiple languages, i.e. Lucene-Documents that have multiple language content/fields? Should a) each language be indexed in a seperate index/directory or should ...
Clemens Wyss
Jan 18, 2011 at 5:54 pm
Jan 20, 2011 at 9:56 pm -
Hi, I'm just migrating our small search customization from Lucene version 2.3 to the current version (3.0.3) and wonder why, in contrast to the old version, we no longer get the Wildcard Queries ...
Wulf Berschin
Jan 25, 2011 at 5:41 pm
Jan 26, 2011 at 4:43 pm -
Hi, I need to parse the Java log files with Lucene 3.0.3. The StandardAnalyzer is OK, except it's handling of dots. E.g. it handles "java.lang.NullPointerException" as one word and searching for ...
Benzion G
Jan 1, 2011 at 5:36 pm
Jan 4, 2011 at 5:48 pm -
Dear Luceners, I'm using lucene-3.0.2 in our app. There is some testing code for switching index, however, when my code run a couple of times, I found the index file was locked, I can not delete the ...
张志田
Jan 12, 2011 at 10:40 am
Jan 13, 2011 at 11:01 am -
Dear All, When using lucene to search documents, the results have a score based on their relativity to the search term. Inside lucene, the score percentage is calculated as a percentage of the ...
Amr ElAdawy
Jan 3, 2011 at 7:16 am
Jan 16, 2011 at 7:32 am -
Hi, is OpenBitSet / SortedVIntList a compressed bit map index? Which one is better if memory usage is the primary concern ? Our filters are sparse. So is SortedVIntList better in that case? Are there ...
First Last
Jan 7, 2011 at 7:55 pm
May 25, 2011 at 2:28 pm -
Hi, Each night I optimize an index that contains 35 millions docs. Its takes about 1.5 hours. For maintenance reasons, it may happen that the machine gets rebooted. In that case, server gets a chance ...
V Sevel
Jan 21, 2011 at 1:31 pm
Jan 26, 2011 at 7:17 pm -
Hi, I am trying to implement a "progressive search" with Lucene. What I mean is that something like what Google does: you type a few letters and google searches for matches as you type. The more ...
L Duperval
Jan 5, 2011 at 4:39 pm
Jan 6, 2011 at 1:45 pm -
I'm building six different indexes in series, at the end of building an index I call optimize() and then close() the writer, then move onto the next one. I build them in series because they are ...
Paul Taylor
Jan 28, 2011 at 9:17 am
Jan 28, 2011 at 9:31 pm -
Hello list, has anyone built a log-analyzer based on Lucene? Our logs are so big that grep takes more hours to do what I want it to do. I'm sure Lucene would solve it. Thanks in advance paul ...
Paul Libbrecht
Jan 13, 2011 at 12:54 pm
Jan 14, 2011 at 9:39 am -
Hi, I've upgraded from 3.00 to 3.0.3 and am now hitting assertion errors from IndexWriter.ReaderPool.commit, at this line: // We invoke deleter.checkpoint below, so we must be Has anyone encountered ...
Anuj Shah
Jan 25, 2011 at 4:18 pm
Jan 31, 2011 at 5:21 pm -
Shouldn't these two queries be fine? (from TREC million query track). Should this be entered as a bug? Thanks, Andrew. Cannot parse 'statistics on child labor laws 1930 -': Encountered "<EOF " at ...
Andrew Kane
Jan 24, 2011 at 10:05 pm
Jan 25, 2011 at 2:42 am -
Hello all, Does anyone know if it is possible in Lucene to do a query based on the string length of the value of a field? For example, if I wanted all index matches where a specific field like ...
Camden Daily
Jan 21, 2011 at 3:16 pm
Jan 21, 2011 at 5:29 pm -
Hi, I am new to lucene. Recently I was assigned for some lucene related workitems. Now there is one problem. Before, we use StandardAnalyzer in our application, and our application has been online ...
黄靖宇
Jan 21, 2011 at 9:05 am
Jan 21, 2011 at 1:39 pm -
Trying to extend MappingCharFilter so that it only changes a token if the length of the token matches the length of singleMatch in NormalizeCharMap (currently the singleMatch just has to be found in ...
Paul Taylor
Jan 20, 2011 at 1:20 pm
Jan 29, 2011 at 11:09 am -
Hi all, I'm new to Lucene and have a question about indexing/highlighting of HTML files with Lucene. What I need to do is highlight the hits (terms) in the original HTML file (or get the positions of ...
Karolina Bernat
Jan 24, 2011 at 1:34 pm
Jan 26, 2011 at 9:53 am -
Hello everybody, I used a small indexing example from "Lucene in Action" and can run and compile the program under eclipse. If I want to compile and run it by console I get this error: ...
Alex vB
Jan 25, 2011 at 3:12 pm
Jan 25, 2011 at 5:31 pm -
Hello, I have a bunch of text documents formatted like so: keyword1 wt1 keyword2 wt2 keyword3 wt3 I would like to index the documents based on the keywords. When I retrieve (search) for a keyword, I ...
Chris Schilling
Jan 24, 2011 at 9:02 pm
Jan 25, 2011 at 2:42 am -
(thanks fort he many answers to my initial lucene question "Best practices for multiple languages?") We shall be confronted with the followong problem: due to the very dynamic access rules on our ...
Clemens Wyss
Jan 20, 2011 at 7:36 am
Jan 21, 2011 at 1:58 pm -
Hi all, I am trying to use *IndexSearcher<http://lucene.apache.org/java/3_0_1/api/core/org/apache/lucene/search/IndexSearcher.html#IndexSearcher%28org.apache.lucene.store.Directory%29 * to retrieve a ...
Yuhan Zhang
Jan 19, 2011 at 7:03 pm
Jan 20, 2011 at 1:36 am -
Hi, We're writing a web application, which naturally needs - "IndexSearcher" when users use our search screen - "IndexWriter" in a background process that periodically updates and optimizes our ...
Sol myr
Jan 13, 2011 at 3:12 pm
Jan 16, 2011 at 12:22 pm -
My index contains multivalued filed like and i use whitespaceAnalyzer DOC 1 : ITEMNAME: item 2 name ITEMNAME: movie tickets ITEMNAME: item 1 name so when search for (+ITEMNAME:item +ITEMNAME:movie), ...
Sailesh
Jan 11, 2011 at 7:58 am
Jan 14, 2011 at 7:16 pm -
Greetings, Is there an easy way to figure out the frequency of words in an index ? I'd like to get, say, the 1000 most often indexed words in order to create an auto-completion cache for my ...
Matthieu Huin
Jan 14, 2011 at 3:43 pm
Jan 14, 2011 at 4:32 pm -
Hi, I am happily using Lucene for several years to offer French lexical analysis tools to university researchers. Today, one of them decided to analyze the use of the French word "or" (meaning "gold" ...
Benoit Mercier
Jan 13, 2011 at 3:38 am
Jan 14, 2011 at 3:21 am -
Our business has a need to allow for multiple values for a single field. For example, we have an index of employers where an employer often has multiple ways people refer to it. For example, the ...
Ryan Aylward
Jan 8, 2011 at 12:33 am
Jan 10, 2011 at 10:16 pm -
Hi, I have an application that continously indexes 140 documents/s (we commit after each second) using lucene 2.9. at the beginning of the test the index is empty. during the test, I monitored this ...
V Sevel
Jan 19, 2011 at 7:32 am
Feb 22, 2011 at 11:50 am -
Hello all, Could you any one guide me what all the various ways we could scale out? 1. Index: Add data to the nodes in round-robin. Search: Query all the nodes and cluster the results using carrot2. ...
Ganesh
Jan 21, 2011 at 5:22 am
Feb 4, 2011 at 6:25 am -
Hi Under LUCENE-2720 the index format of both trunk and 3x has changed. You should re-index any indexes created with either of these code streams. Shai
Shai Erera
Jan 23, 2011 at 5:15 am
Jan 23, 2011 at 7:41 pm -
Hi, I have couple of questions on filtering result set while performing a search in lucene index : 1) I want to filter the document set returned when searching an index based on a match on a ...
Amg qas
Jan 22, 2011 at 7:32 pm
Jan 22, 2011 at 10:00 pm -
Hi all I've got an Index with a few 100k documents and I want to run a rather complex wildcard (incl. leading wildcards) query on it. The wildcard query takes about 2 seconds to complete. Now, I want ...
comparis.ch - Roman Baeriswyl
Jan 20, 2011 at 9:50 am
Jan 22, 2011 at 8:17 pm -
Dear All, I have two documents. The analyzed and the tokenized contents are mentioned below. *Document 1 :* *when*, null_1, *my*, null_1, money, fund, amount, payment, creditcard, credit, card, ...
Lahiru Samarakoon
Jan 18, 2011 at 12:12 pm
Jan 18, 2011 at 1:47 pm -
Hi, I'm maintaining some Lucene-based code, and we're trying to get control over result ordering (users aren't happy with the default). I know how to boost a Field or Document (very useful). But: 1) ...
Pelit Mamani
Jan 16, 2011 at 2:33 pm
Jan 17, 2011 at 8:46 am -
Hi, I'm new to Lucene (using 3.0.3), and just started to check out the behavior of the 'optimize()' method (which is quite important for our application). Could it be that 'optimize' cancels out the ...
Sol myr
Jan 10, 2011 at 5:57 pm
Jan 12, 2011 at 8:35 am -
I'm trying to: StandardQueryTreeBuilder b = …; b.setBuilder( "myfield", fieldSpecificBuilder); In the debugger I see that the builder is registered in the QueryTreeBuilder's fieldNameBuilders map. ...
Christopher St John
Jan 8, 2011 at 1:44 am
Jan 9, 2011 at 4:08 am -
Hi, I have a single IndexWriter object which I use to update the index. After each update, I'd like to query the index using IndexReader and IndexSearcher objects. When I try to do that I get ...
Andreas Harth
Jan 8, 2011 at 4:31 pm
Jan 8, 2011 at 5:39 pm -
Hello, What's a good source to get dictionaries (for spellcorrections) and/or thesaurus (for synonyms) that can be used with Lucene for non-English languages such as Fresh, Chinese, Korean etc? For ...
Pulkit Singhal
Jan 6, 2011 at 4:54 pm
Jan 7, 2011 at 9:36 pm -
Hi, we are calling updateDocument(term, document) method on IndexWriter and after that we are calling close() method of indexWriter. In Close() method i got the following IO exception. ...
Atul Prajapati
Jan 3, 2011 at 6:04 am
Jan 3, 2011 at 1:54 pm -
Lets' say I have documents with following. id text 1 User not found 2 User not found 3 Address not found 4 Fatal error 5 User not found 6 Address not found 7 User not found How can I get each text ...
Benzion G
Jan 1, 2011 at 9:32 pm
Jan 2, 2011 at 8:56 am -
I have been trying to parse & index different portions of an HTML page using Tika & Lucene. For eg. I would like to index text within <Title , <H1 , <H2 , <A tags of a HTML page separately and ...
Amg qas
Jan 11, 2011 at 1:55 am
Feb 25, 2011 at 10:11 pm -
Hi , I have started to use Lucene for searching in HTML files. Is it possible to get Hits per document, when we search for phrases like "Hello World" and wild card searches like "te?t"? I managed to ...
Sharma Kollaparthi
Jan 22, 2011 at 5:47 am
Jan 30, 2011 at 9:25 pm -
Hi! I would like to announce RankingAlgorithm. RankingAlgorithm is a new search algorithm that seems to enable Solr to returns results comparable to Google site search results, and much better than ...
Nagendra Nagarajayya
Jan 27, 2011 at 4:08 pm
Jan 28, 2011 at 2:49 am -
hi, i have been searching for getting the term enum for filtered documents... I have index containing fields "group_id" and "user"..i know that we can easily get unique Terms and their count for ...
Emmad
Jan 26, 2011 at 9:59 am
Jan 26, 2011 at 5:23 pm -
Hi! My index contains a few (really 7) fields and I need to search by all of them. I use BooleanQuery and seven TermQueries added to this one. Problem: result must to be sorted by max(field.boost), ...
Dmytro Barabash
Jan 24, 2011 at 8:39 am
Jan 24, 2011 at 9:32 am -
Hi, I have two question regarding phrase query : 1) How can I execute a phrase query over multiple fields ? I can only get PhraseQuery to work over a single field - For eg something like this : ...
Amg qas
Jan 20, 2011 at 3:14 am
Jan 22, 2011 at 6:56 pm -
Hi, We're trying to create a large index via solr for trends and notice that we have a large '.frq' file after doing the following: make all text fields index="true", stored="false", ...
Dan sutton
Jan 18, 2011 at 12:13 pm
Jan 18, 2011 at 4:11 pm -
Hi All, i have my own query parser which generates fuzzy/wildcard queries instances. It works fantastic, Lucene rocks ;-). But i have to make sure the words are not to far apart. I checked current ...
Livia Hauser
Jan 16, 2011 at 5:42 pm
Jan 17, 2011 at 7:51 am -
Hi all. I discovered there is a normalise filter now, using ICU's Normalizer2 (org.apache.lucene.analysis.icu.ICUNormalizer2Filter). However, as this is a filter, various problems can result if used ...
Trejkaz
Jan 17, 2011 at 12:37 am
Jan 17, 2011 at 1:54 am -
Hi Lucene Users, I work on a product with several thousand clients. We use Lucene to index various client data and make the functionality available as part of our product. Currently, each client has ...
Sean Joyce
Jan 13, 2011 at 5:58 pm
Jan 15, 2011 at 8:50 am -
As recommended, I use just one Index Searcher on my multithreaded GUI app using a singleton pattern If data is modified in the index I then close the reader and searcher, and they will be recreate on ...
Paul Taylor
Jan 13, 2011 at 8:22 pm
Jan 14, 2011 at 7:00 am
Group Overview
group | java-user |
categories | lucene |
discussions | 71 |
posts | 365 |
users | 133 |
website | lucene.apache.org |
133 users for January 2011
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)