Search Discussions
-
Hello! I need to perform an experiment of loading the entire index in RAM and seeing how the search performance changes. My index has TermVectors with payload and position info, StoredFields, and ...
Igor Shalyminov
Oct 9, 2013 at 10:18 pm
Oct 25, 2013 at 1:59 pm -
Hello, I've observed that when using PostingsHighlighter in Lucene 4.4 that some of the responsive documents in TopDocs will have zero matches in the associated array of Passage objects. I.e., in the ...
Jon Stewart
Oct 14, 2013 at 9:12 pm
Oct 15, 2013 at 3:05 pm -
Hi, I am running solr 4.4 & one of my collections seems to have a corrupted index... I tried doing - java -cp lucene-core-4.4.0.jar -ea:org.apache.lucene... org.apache.lucene.index.CheckIndex ...
Chris
Oct 23, 2013 at 1:01 pm
Oct 24, 2013 at 2:44 am -
I have created strings like the below &&searchtext +sampletext and when I try to search the following using *&&** or *+** it does not give any result. I am using QueryParser.escape(String s) method ...
Saisantoshi
Oct 20, 2013 at 4:47 pm
Oct 21, 2013 at 1:00 am -
I've been running some tests comparing storing large fields (documents, say 100K .. 10M) as files vs. storing them in Lucene as stored fields. Initial results seem to indicate storing them externally ...
Michael Sokolov
Oct 11, 2013 at 5:03 pm
Oct 18, 2013 at 12:01 pm -
Consider a Lucene index consisting of 10m documents with a total disk footprint of 3G. Consider an application that treats this index as read-only, and runs very complex queries over it. Queries with ...
Benson Margulies
Oct 8, 2013 at 9:46 pm
Oct 10, 2013 at 6:52 pm -
UNOFFICIAL Hi everyone, I have a question about how to retrieve the values in a NumericDocValuesField. I understand how to do this in situations where you have an AtomicReaderContext available ...
Stephen GRAY
Oct 23, 2013 at 7:30 am
Oct 24, 2013 at 11:21 pm -
Hi there, I'm looking for pointers, suggestions on how to approach this in Lucene 4.5. Say I am creating an index using a sequence of addDocument() calls and end up with segments that each contain ...
Arvind Kalyan
Oct 23, 2013 at 5:16 pm
Oct 24, 2013 at 9:09 am -
You have to call updateDocument with the unique key of the document to update. The unique key must be a separate, indexed, not necessarily stored key. addDocument just adds a new instance of the ...
Uwe Schindler
Oct 1, 2013 at 7:11 am
Oct 2, 2013 at 2:36 pm -
Hi, I rewrote lemmatizer project LemmaGen (http://lemmatise.ijs.si/) to java. Originally it's written in C#. Lemmagen project uses rules to lemmatize word. Algorithm is described here ...
Michal Hlavac
Oct 23, 2013 at 3:18 pm
Dec 2, 2013 at 2:34 pm -
Hi, I have problem with doing wild card search on file path fields. i have a field "filePath" where i store complete path of files. i have used StringField to store the field ("i assume by default ...
Nischal reddy
Oct 14, 2013 at 8:13 am
Oct 14, 2013 at 7:41 pm -
Hi, I have implemented Lucene to search for a single keyword across multiple fields and it works great. I did this by concatenating all the fields into a "contents" field and searching against this ...
Raghavendra K Rao
Oct 10, 2013 at 3:55 pm
Oct 13, 2013 at 10:34 pm -
Hi, How to implement synonym Search for All languages.. As far as i know,Wordnet has only English Support..Is there any other we can use to get support for all languages. -- Thanks and Regards ...
VIGNESH S
Oct 7, 2013 at 9:34 am
Oct 9, 2013 at 2:07 pm -
Hello, We would like to index some documents. Each field of a document may have multiple values. And for each (field,value) pair there are some associated values. These associated values are just for ...
Alice Wong
Oct 2, 2013 at 11:12 pm
Oct 4, 2013 at 3:09 pm -
Hi, I am using the regular QueryParser to form a PhraseQuery. It works fine, but when it consists of a hyphen, it gets removed, hence resulting in unexpected results. Note: I am NOT using the ...
Raghavendra K Rao
Oct 14, 2013 at 10:15 pm
Oct 17, 2013 at 10:08 pm -
Hi! I'm trying to look at index contents by setting SimpleTextCodec into IndexWriterConfig. The problem is, after indexing there are only .cfs, .cfe, .si, segments.gen and segments_h files, and the ...
Igor Shalyminov
Oct 15, 2013 at 1:45 pm
Oct 15, 2013 at 4:11 pm -
In our search application, queries like test && usage do not return correct results but test AND usage works fine. So queries with "&&" doesn't work but "AND" does. We are using default queryparser ...
Devi pulaparti
Oct 10, 2013 at 8:28 am
Oct 10, 2013 at 10:47 am -
Dear, I have indexing integer field like this ----- Document doc = new Document(); FieldType fieldType = new FieldType(); fieldType.setIndexed(true); fieldType.setStored(true) ...
장용석
Oct 7, 2013 at 3:24 pm
Oct 7, 2013 at 9:22 pm -
Hi, I am Trying to do Multiphrase Query in Lucene 4.3. It is working Perfect for all scenarios except the below scenario. When I try to Search for a phrase which is preceded by any punctuation,it is ...
VIGNESH S
Oct 3, 2013 at 2:08 pm
Oct 3, 2013 at 4:07 pm -
Are there any best practices for constructing Filters to search efficiently? From my non-exhaustive experiments I cannot intuit how to construct my filters to achieve best performance. I have an ...
James Clarke
Oct 10, 2013 at 6:02 pm
Oct 17, 2013 at 9:06 am -
Hi, I have a search application using Lucene 4.4.0 with various BinaryDocValues and SortedSetDocValues. We use MMapDirectory to help keep the Java heap small / GC pause times short and instead rely ...
Steven Schlansker
Oct 2, 2013 at 6:12 pm
Oct 2, 2013 at 10:42 pm -
Hi Benson, forceMerge[1], formerly known as optimize, takes a parameter to configure how many segments should remain in the index. Regarding multi-core usage, if your query load is high enough to use ...
Adrien Grand
Oct 1, 2013 at 11:10 am
Oct 1, 2013 at 8:07 pm -
Hi there, If I put Lucene segments on tmpfs and use MMapDirectory to access them, would the kernel be so dumb to load the files from tmpfs to another copy of file system cache before map it to the ...
Reg
Oct 22, 2013 at 6:27 am
Oct 22, 2013 at 7:28 am -
Hello, I am quite confused about the Lucene NRT feature. And there are not many examples out there. My understanding is we can create an DirectoryReader from a IndexWriter. Whenever IndexWriter ...
Alice Wong
Oct 17, 2013 at 5:12 am
Oct 17, 2013 at 6:01 pm -
Hi All, I'm trying to create an index with payloads using Lucene 4.5 but the payloads don't seem to be getting into the index. I looked into the code and it looks like storePayloads argument to ...
Kyle Judson
Oct 15, 2013 at 12:14 pm
Oct 15, 2013 at 1:37 pm -
Team, We have our app using lucene 4.1. Docs keep getting indexed and we close the index by 00.00 hrs every day and open a new one for the next calendar day, however in case of an abrupt ...
Ramprakash Ramamoorthy
Oct 3, 2013 at 12:40 pm
Oct 3, 2013 at 1:31 pm -
Hello, I am trying to weigh some ideas for implementing paged search functionality in our system, which has these basic requirements: - Using Solr is not an option (at the moment). - Any Lucene 4.x ...
Vitaly Funstein
Oct 11, 2013 at 2:01 am
Nov 23, 2013 at 12:10 am -
Hello, I'm new to lucene and I am having some trouble figuring out the right way to use a SearcherTaxonomyManager for NRT faceted search. Assuming I set up the STM with a reopen thread: // Index ...
Joe Eckard
Oct 26, 2013 at 3:45 pm
Oct 26, 2013 at 5:36 pm -
Hi, In my current implementation of Lucene 4.3 where there are millions of indexed records, I do a regular search() and get the topDocs.totalHits as the count of results. As part of this, I store all ...
Raghavendra K Rao
Oct 17, 2013 at 10:17 pm
Oct 18, 2013 at 12:49 am -
Hello, Have a question about default PhraseQuery boost processing. The Query.setBoost() <http://lucene.apache.org/core/4_4_0/core/org/apache/lucene/search/Query.html#setBoost(float) says: / Sets the ...
Denis.zhdanov
Oct 16, 2013 at 6:04 am
Oct 17, 2013 at 11:28 am -
Hi Team, I have one requirement where i have to display sentences of valid document if the keyword(input string) is found in that document. I am thinking if parent-child relation will work? DocBean ...
Avni Sompura
Oct 17, 2013 at 7:40 am
Oct 17, 2013 at 9:02 am -
Hi all, I'd like to know a bit more in detail about what is happening behind the scenes in case of prepareCommit vs Commit. Also, I read that if a indexing request comes while commit is requested, it ...
Phani Chaitanya
Oct 16, 2013 at 4:27 pm
Oct 16, 2013 at 6:18 pm -
Can anyone tell me if a search based on a ConstantScoreQuery should return the results in the order that the documents were added to the index? I am not sorting the results and this works for one ...
Darren Hoffman
Oct 15, 2013 at 1:43 am
Oct 15, 2013 at 1:49 am -
UNOFFICIAL Hi everyone, I'd appreciate some help with a problem I'm having. I have a collection of documents in my index. Each doc contains an IntField with a value in it. What I want is to find out ...
Stephen GRAY
Oct 14, 2013 at 1:04 am
Oct 14, 2013 at 11:12 am -
Using Lucene 3. I know Lucene supports multiple occurrences of a field, and if one searches on that field, all fields are checked for hits. One question I have is if there is a performance difference ...
Earl Hood
Oct 8, 2013 at 2:04 am
Oct 11, 2013 at 11:47 am -
Is there some advice around about when it's appropriate to create an Analyzer class, as opposed to just Tokenizer and TokenFilter classes? The advantage of the constituent elements is that they allow ...
Benson Margulies
Oct 8, 2013 at 2:31 pm
Oct 8, 2013 at 4:05 pm -
Is there an optimal way to access many document TermVectors (in the same chunk) consecutively when using the LZ4 termvector compression? I'm curious to know whether all TermVectors in a single ...
Rose, Stuart J
Oct 7, 2013 at 7:31 pm
Oct 8, 2013 at 7:51 am -
I'm not aware of a lucene rather than Solr or whatever tutorial. A search for something like "lucene sharding" will get hits. Why don't you want to use Solr or Katta or similar? They've already done ...
Ian Lea
Oct 1, 2013 at 10:16 am
Oct 1, 2013 at 11:34 am -
Hi, Currently we merge 2 indexes using iw.addIndexes(idxReaders), where the same call will be made in batches of 10 readers Our requirement is to make this addIndex call consistent. That is, during ...
Ravikumar Govindarajan
Oct 25, 2013 at 5:05 pm
Oct 25, 2013 at 5:05 pm -
October 2013, Apache Lucene™ 4.5.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.5.1 Apache Lucene is a high-performance, full-featured text search engine library ...
Mark Miller
Oct 24, 2013 at 4:15 pm
Oct 24, 2013 at 4:15 pm -
Hi folks, If anyone reading this is interested in how to use the spatial module in Lucene, you might be interested in a recent two-part blog post by Steven Citron-Pousty on the OpenShift blog ...
Smiley, David W.
Oct 23, 2013 at 5:33 pm
Oct 23, 2013 at 5:33 pm -
Hello, My lucene index contains 46 segments with a total of 4M docs. Lately, while running queries I started getting seldom exceptions from this index: java.lang.ArrayIndexOutOfBoundsException at ...
Manuel Le Normand
Oct 22, 2013 at 11:37 am
Oct 22, 2013 at 11:37 am -
Posted something similar some time ago, but didn't get any responses, so I thought I'd try again with more details. We allow end-user queries that have our own proprietary query language, which we ...
Marcos Juarez Lopez
Oct 16, 2013 at 5:11 pm
Oct 16, 2013 at 5:11 pm -
Hello! I'm trying to realize how I can improve search performance for my task. The index is as follows: - 29 segments, each of about 600 MB; - in the complete setup, there's a thread for each segment ...
Igor Shalyminov
Oct 14, 2013 at 4:16 pm
Oct 14, 2013 at 4:16 pm -
Hi, If some one removes some of the segments in my Lucene Index from file system,How to find out which documents are deleted. -- Thanks and Regards Vignesh Srinivasan 9739135640
VIGNESH S
Oct 11, 2013 at 11:20 am
Oct 11, 2013 at 11:20 am -
October 2013, Apache Lucene™ 4.5 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.5 Apache Lucene is a high-performance, full-featured text search engine library written ...
Adrien Grand
Oct 5, 2013 at 9:46 am
Oct 5, 2013 at 9:46 am -
Hello! I need to access token position and payload info during the search result page building. I need to do this for 10 documents max, so retrieving TermVectors is totally OK for me. Say, I retrieve ...
Igor Shalyminov
Oct 4, 2013 at 6:14 pm
Oct 4, 2013 at 6:14 pm -
Hi folks, Here's a small tool that I'm releasing as open source. I hope some of you might find it useful: https://github.com/joelb-git/lqt Lucene Query Tool (lqt) is a small utility for executing and ...
Joel Barry
Oct 3, 2013 at 6:36 pm
Oct 3, 2013 at 6:36 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 48 |
posts | 220 |
users | 53 |
website | lucene.apache.org |
53 users for October 2013
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)