Search Discussions
-
Hi, My apps need to read from and write to some big indexes frequently. So I use RAMDirectory instead of FSDirectory, and give JVM about 2GB memory size. I notice that the speed of reading and ...
Cheng
Jun 4, 2012 at 2:08 pm
Jun 30, 2012 at 10:25 am -
hi all I need to return certain fields of all matched documents quickly. I am now using Document.get(field), but the performance is not well enough. Originally I use HashMap to store these fields ...
Li Li
Jun 20, 2012 at 12:49 pm
Jun 25, 2012 at 5:07 pm -
hi I have strings like "drinks - water" and I've read in "Lucene in Action" that the StandardAnalyzer and other analyzers removes the "-" from the string but so far none of them worked... All of them ...
Listas
Jun 25, 2012 at 1:41 am
Jun 26, 2012 at 3:26 am -
Hello, I have checked out lucene 3.6 and I am trying to run the ant jflex. It is throwing a Stackoverflow error when it is trying to execute the target: jflex-UAX29URLEmailTokenizer. Any idea why ...
Bin01
Jun 14, 2012 at 12:59 am
Jun 15, 2012 at 12:32 am -
Hi, I'm trying to brush up on some of the *cough* newer APIs (we've been using 2.9.2 up until now). Anyway, I have the test below which a modified version of one of the tests in Lucene In Action, but ...
Brendan Grainger
Jun 28, 2012 at 11:06 pm
Jun 29, 2012 at 3:28 am -
Hi, I have one query of lucene about sort. I have 10000 documents in my index which having fields A,B,C,D. i want first 100 results in my query but they must be sort by field A. Suppose I have query ...
Yogesh patel
Jun 26, 2012 at 3:06 am
Jun 28, 2012 at 11:02 pm -
Hi there, I have to index chinese content and I don't get the expected results when searching. It seems that the WildcardQuery does not work properly with the chinese characters. See attached sample ...
Paco Avila
Jun 27, 2012 at 10:20 am
Jun 28, 2012 at 7:38 am -
Hi everybody I'm using Lucene3.6 to index Wikipedia documents which is over 3 million article, the data is on a mysql database and it is taking more than 24 hours so far.Do you know any tips that can ...
Elshaimaa Ali
Jun 19, 2012 at 7:08 pm
Jun 20, 2012 at 12:43 am -
http://lucene.apache.org/core/3_6_0/fileformats.html#Frequencies The .frq file contains the lists of documents which contain each term, along with the frequency of the term in that document (except ...
Wangjing
Jun 27, 2012 at 9:40 am
Jun 28, 2012 at 5:00 am -
Hi, We have been using lucene 2.3.2 for years well (yes, we should upgrade). Recently we encountered data corruption error when commiting IndexWriter: /// background merge hit exception: _14b:c61262 ...
Zhang, Lisheng
Jun 30, 2012 at 8:47 pm
Jun 30, 2012 at 10:21 pm -
I'm quite new to Lucene and recently, I ran into a problem. I have a lucene document that looks like this: --- type --- gene --- id --- xla:379474 --- alt_id --- emb:BC054227 gb:BC054227 ...
Secevalliv
Jun 25, 2012 at 1:02 pm
Jun 26, 2012 at 11:02 am -
Hello all, I am tying to write a simple autosuggest functionality. I was looking at some auto suggest code, and came over this post ...
Mansour Al Akeel
Jun 22, 2012 at 10:26 pm
Jun 24, 2012 at 7:42 am -
hi, does anyone knows how to extract meaningful words from Lucene index?
齐保元
Jun 26, 2012 at 9:40 am
Jun 27, 2012 at 10:25 pm -
Our Hit highlighting (Using the older Highlighter) is wired with a "too huge" limit, so we could skip the multi-million character files, not just for highlighter.setMaxDocCharsToAnalyze, but if a ...
Paul Hill
Jun 22, 2012 at 7:24 pm
Jun 25, 2012 at 5:17 pm -
Hi, I am getting the following OOM consistently whenever the index is opened . Is it because now the index is holding too many terms ? Our application ( that has Lucene 2.9.3 ) already has reached ...
Nishesh Gupta
Jun 1, 2012 at 11:52 pm
Jun 6, 2012 at 3:58 am -
Dear, I am using Lucene for my log search tool. Is there a way I can automatically perform a commit operation on my IndexWriter when a particular set of docs is flushed from memory to the disk. My ...
Ramprakash Ramamoorthy
Jun 27, 2012 at 5:55 am
Jun 28, 2012 at 6:54 am -
<EOM --------------------------------------------------------------------- To unsubscribe, e-mail: <span class="m_body_email_addr" title="356411d2f4b6e34d06eca5aa0a7230d4" ...
Deshpande, Vikas
Jun 25, 2012 at 2:00 pm
Jun 25, 2012 at 4:32 pm -
Hello everyone, I am having a problem with a lucene store. When starting an IndexWriter on it, it throws the following exception: Caused by: java.io.IOException: read past EOF ...
Chris Gioran
Jun 19, 2012 at 2:51 pm
Jun 19, 2012 at 11:45 pm -
I've found the class WordnetSynonymParser in org.apache.lucene.analysis.synonym but there aren't examples of its usage neither in the API nor in google. Does any one have experience with it? Thank ...
Kits89
Jun 15, 2012 at 3:10 pm
Jun 18, 2012 at 3:53 pm -
As others have previously proposed on this list, I am interesting in inserting a second token at some positions in my index. I'll call this Limited Index Expansion. I want to retain the original ...
Paul Hill
Jun 12, 2012 at 7:07 pm
Jun 12, 2012 at 11:57 pm -
Hi all, This is driving me crazy. In my data if I search "state" AND "GA" I get hits. If I search "state" AND "OR" or "state" AND "IN" I get no hits even though I can see examples of state AND IN in ...
Bob Rhodes
Jun 7, 2012 at 5:50 pm
Jun 7, 2012 at 11:55 pm -
.fdx file contains, for each document, a pointer to its field data. BUT fdx is contains pointer to WHAT? it's a pointer of field data offset in the fdt file? my app is File file = new File(path) ...
Wangjing
Jun 25, 2012 at 3:28 am
Jun 26, 2012 at 2:43 am -
I imagine this is a question that comes up from time to time, but I haven't been able to find a definitive answer anywhere, so... I'm wondering whether there is some type of Lucene query that filters ...
Mike Sokolov
Jun 16, 2012 at 6:34 pm
Jun 18, 2012 at 2:58 am -
Hi, I'm currently reading "Lucene in action (2nd edition)". At page 105 - section 3.5.4, I'm reading the following paragraph: --- QueryParser won’t create a NumericRangeQuery for you. This is because ...
Jochen Hebbrecht
Jun 12, 2012 at 7:30 am
Jun 12, 2012 at 11:26 am -
Hi, Is there a safe way to forcefully close an IndexWriter that is unable to flush to disk? We're seeing occasional issues where an IndexWriter encounters an IOException on close and does not release ...
Geoff Cooney
Jun 4, 2012 at 2:00 pm
Jun 4, 2012 at 6:51 pm -
Based on this link http://www2002.org/CDROM/refereed/643/node6.html , I'm calculating Okapi similarity between the query document and another document as below using Lucene: I have indexed the ...
Kasun Perera
Jun 19, 2012 at 9:57 am
Jul 17, 2012 at 3:14 am -
Hi, Suppose we have a query "balcony table". I want results to be returned by exact match (first priority) and by single words matching as well (for "balcony" or for "table"). So currently my ...
Sxam
Jun 30, 2012 at 8:55 pm
Jun 30, 2012 at 9:46 pm -
All, I have a question about join support across multiple document types in Solr/Lucene. Let me lay out the use case. Suppose I have 3 tables: * Table A has 3 columns, id, a1, a2. * Table B has 4 ...
Frank DeRose
Jun 29, 2012 at 7:14 pm
Jun 29, 2012 at 8:47 pm -
I'm a fresh man, and courious with inverted index. and who can show a sample with dataset to show work of Lucene. Thanks.
DEW¤
Jun 20, 2012 at 4:11 am
Jun 20, 2012 at 4:16 am -
I want to calculate average document length for document collection which each document having 3 different fields(filed1, field2,field3) This is the program to calculate average length when only one ...
Kasun Perera
Jun 18, 2012 at 3:19 am
Jun 19, 2012 at 9:40 am -
Hi all, I'm searching for a way to reuse a Lucene search. For example, I'm searching for the word "acci". But too many ScoreDocs are returned, and I provide: "accide". Can it reuse the existing ...
Jochen Hebbrecht
Jun 14, 2012 at 12:19 pm
Jun 14, 2012 at 3:36 pm -
I got the OutOfMemoryError when I tried to open an Lucene index. it's very weird since this is only seen when I run this inside an Apache PIG LATIN script on a particular hadoop cluster of ours, and ...
Yang
Jun 13, 2012 at 9:16 pm
Jun 13, 2012 at 11:20 pm -
Hello, I've read the documentation about the TiredMergePolicy class. But I just can't get behind what this sentence is trying to state: [..] For normal merging, this policy first computes a "budget" ...
Thomas
Jun 12, 2012 at 8:44 am
Jun 12, 2012 at 4:01 pm -
I noticed today that my code calls IndexSearcher.search (Query query, Filter filter, Collector collector) But also noticed that the DOCs says "Applications should only use this if they need all of ...
Paul Hill
Jun 8, 2012 at 5:33 pm
Jun 8, 2012 at 5:41 pm -
Hey guys, I'm trying to index nested documents in lucene 3.6. I have the parent document having a 'type' and 'typename' fields and the children having 'value' and 'author' fields. The below snippet ...
Ananth V
Jun 8, 2012 at 10:04 am
Jun 8, 2012 at 10:42 am -
I was looking at the Lucene API for IndexCommit and noticed that the JavaDoc states that *'Decision that a commit-point should be deleted is taken by the ...
Colin Goodheart-Smithe
Jun 6, 2012 at 11:16 am
Jun 6, 2012 at 11:38 am -
Apologies for the short notice guys, we're meeting up at The Plough in Bloomsbury on Wednesday 6th June. As usual the format is open and there's a healthy mix of experience and backgrounds. Please ...
Richard Marr
Jun 2, 2012 at 11:30 pm
Jun 5, 2012 at 3:42 pm -
Did you find any solution for this. I am looking for similar solution, please let me know if you found any useful info regarding fuzzy phrase search inlucene. Thanks & Regards, Harish B.N. Lead ...
Harish Bn
Jun 1, 2012 at 2:49 pm
Jun 1, 2012 at 4:05 pm -
Hi kjysmu, I moved the discussion to java-user@lucene instead of dev@lucene since your question is not related to Lucene development. http://people.apache.org/~hossman/#java-user To understand how to ...
Adrien Grand
Jun 26, 2012 at 4:25 pm
Jun 26, 2012 at 4:25 pm -
CommonGrams provides a neat trick for optimizing slow phrase queries that contain common words. (E.g. Hathi Trust has some ...
Chris Harris
Jun 22, 2012 at 12:08 am
Jun 22, 2012 at 12:08 am -
I am trying to use this class and add my synonym list in synonyms.properties file. File Content : car auto car machine car automobile But results obtained are only for last synonym specified , i.e ...
Blunderboy
Jun 14, 2012 at 5:41 pm
Jun 14, 2012 at 5:41 pm -
Hi, In CarmelTopKTermPruningPolicy class, the threshold is calculated as follows: *float threshold = docs[k - 1].score - scoreDelta;* docs[k - 1].score corresponds to z_t in the original paper ...
Zeynep P.
Jun 12, 2012 at 2:57 pm
Jun 12, 2012 at 2:57 pm -
Lets suppose that we make a query with multiple terms. Lucene creates a topScoreDocsCollector with an Inorder traversal of posting lists. Lets suppose we are in a specific segment, since we use a ...
Apostolis Xekoukoulotakis
Jun 12, 2012 at 10:43 am
Jun 12, 2012 at 10:43 am -
Hi, Best Buy is building new Search Platform/Eco-System powered by Lucene/Solr. We are hiring multiple Lucene/Solr engineers, tech leads, and architects, both full-time and consulting based in ...
SV
Jun 7, 2012 at 12:11 pm
Jun 7, 2012 at 12:11 pm -
you can use aggregation for that. dump a collection of prices as a field with multiple values into a document //pseudo-code def doc = new Document(...) doc.add new Field( 'id', id ) doc.add new ...
Konstantyn Smirnov
Jun 6, 2012 at 8:07 am
Jun 6, 2012 at 8:07 am -
Hi, We are hiring multiple Lucene/Solr engineers, tech leads, architects based in Minneapolis - both full time and consulting for developing new search platform. Please reach out to me - ...
SV
Jun 6, 2012 at 4:52 am
Jun 6, 2012 at 4:52 am -
Hi, We are hiring multiple Lucene/Solr engineers, tech leads, architects based in Minneapolis - both full time and consulting for developing new search platform. Please reach out to me - ...
SV
Jun 1, 2012 at 9:07 pm
Jun 1, 2012 at 9:07 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 48 |
posts | 203 |
users | 67 |
website | lucene.apache.org |
67 users for June 2012
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)