Search Discussions
-
Hi, I use lucene 3.4.0 in a search project,but encounter a problem and i don't know how to resolve. I index and it run well,but one week or two(it appear two times,first run one week,second two),it ...
Superruiye
Jan 7, 2012 at 4:50 pm
Feb 17, 2012 at 1:43 pm -
Hi! I have a Solr-constructed index, which I read with this code: Directory directory = FSDirectory.open(file); IndexReader reader = IndexReader.open(directory, true); IndexSearcher searcher = new ...
Michael Kazekin
Jan 27, 2012 at 3:39 pm
Feb 1, 2012 at 7:43 am -
Hi folks, I have a query result problem I do not understand. The documentation for Lucene 3.2 query syntax says the following about boolean OR queries: "The OR operator links two terms and finds a ...
1983-01-06
Jan 3, 2012 at 3:41 pm
Jan 3, 2012 at 9:26 pm -
I have read a lot about IndexWriter and multi-threading over the Internet. It seems to me that the normal practice is: 1) use a same indexwriter instance for multiple threads; 2) create an individual ...
Cheng
Jan 11, 2012 at 5:19 pm
Jan 11, 2012 at 8:54 pm -
Hello, we use lucene as search engine in an online shop. The products in this shop often contain product keys like CRXUSB2.0-16GB. We would like our customers to be able to find products by entering ...
Christoph Kaser
Jan 3, 2012 at 8:45 am
Jan 3, 2012 at 3:06 pm -
Hi friends, Any one meet ArrayIndexOutOfBoundsException: -65536 described in https://issues.apache.org/jira/browse/LUCENE-1995 after it declared being fixed? My lucene version is 3.0.3 and ...
Duke DAI
Jan 16, 2012 at 12:22 am
Oct 15, 2014 at 9:55 pm -
Hi, I new a RAMDirectory based upon a FSDirectory. After a few modifications, I would like to synchronize the two. Some on the mailing list provided a solution that uses addIndex() function. However, ...
Cheng
Jan 9, 2012 at 4:05 am
Jan 13, 2012 at 12:28 am -
I have a collection of 50 million documents and I hit the SIGSEGV error. For every 10000 documents I perform commit. The logs and the question has been posted to SO here: http://bit.ly/xyZUEG where I ...
Frank Moss
Jan 11, 2012 at 8:29 am
Jan 11, 2012 at 9:53 am -
Hi, I recently switched an experimental project from Lucene 3.5 to 4.0 from 6th Dec 2011 and my indexing time increased by nearly 20% on my local machine*. It seems to me that two simple ...
Peter K
Jan 3, 2012 at 4:57 pm
Jan 7, 2012 at 1:39 pm -
I have a requirement where reads and writes are quite high ( @ 100-500 per-sec ). A document has the following fields : timestamp, unique-docid, content-text, keyword. Average content-text length is ...
Prasenjit mukherjee
Jan 4, 2012 at 5:18 am
Jan 6, 2012 at 1:41 am -
Hi how can we assign custom score for each token/word. For Ex I have document 1 pqrst uvwx abcd 2 abcd pqrst uvwx 3 pqrst uvwx lmn 4 pqrst uvwx lmn abcd 5 pqrst abcd uvwx lmn *Now i m searching data ...
A Z
Jan 24, 2012 at 5:11 pm
Feb 6, 2012 at 11:13 am -
All things being equal does a fuzzy match give the same score as an exact match. i.e if I do a search for farmin and it matches two docs one on term farmin, the other on term farming, will it score ...
Paul Taylor
Jan 28, 2012 at 9:33 am
Feb 1, 2012 at 12:47 pm -
Hi, I don't want to filter certain stop words within the StandardAnalyzer? Can I do so? Ideally, I would like to have a customized StandardAnalyzer. Thanks.
Cheng
Jan 28, 2012 at 4:41 am
Jan 30, 2012 at 10:23 am -
Just reading Apache Solr Enterprise Search Server and was interested in pages 152, 153 dismax and DisjunctionMaxQuery and automatic Phrase Boosting. I would like to incorporate this into a standard ...
Paul Taylor
Jan 6, 2012 at 10:53 pm
Jan 26, 2012 at 10:51 am -
I'm hoping to upgrade Lucene on a local code base from 3.0.3 to 3.5.0; is there a good guide out there for particular pitfalls that I should worry about? I've skimmed the ChangeLogs; the mention of ...
David Carlton
Jan 19, 2012 at 7:02 pm
Jan 20, 2012 at 5:15 pm -
HI, Could you please help me with a quick question - Is there a way to restrict lucene/solr fuzzy search to only analyze words that have more than 5 characters and to ignore words with less than that ...
Lance
Jan 19, 2012 at 8:09 pm
Jan 25, 2012 at 10:30 pm -
I am currently using the following statement at the end of each index writing, although I don't know if the writing modifies the indexes or not: is = new IndexSearcher(IndexReader.openIfChanged(ir)); ...
Cheng
Jan 11, 2012 at 10:52 pm
Jan 15, 2012 at 6:08 am -
Hi, I use a same instance of writer for multiple threads. It turns out that the time to finish jobs is more than to create a new writer instance in each thread. What would be the possible reasons? ...
Cheng
Jan 11, 2012 at 1:33 am
Jan 13, 2012 at 5:13 pm -
Hi all, Looking at some older Lucene examples, I noticed for older versions of lucene that IndexReader came with a handy terms() method that would return a listing of all the terms in the index and ...
Stephen Howe
Jan 24, 2012 at 9:10 pm
Nov 16, 2012 at 9:19 am -
In Lucene, 3.4 I recently implemented "Translating PhraseQuery to SpanNearQuery" (see Lucene in Action, page 220) because I wanted _order_ to matter. Here is my exact code called from getFieldsQuery ...
Paul Allan Hill
Jan 31, 2012 at 8:48 pm
Feb 1, 2012 at 9:45 pm -
Hi Everyone I have a problem where I need to compare two indexed fields as part of a query. For instance: modified_date[1970 to 2012] AND NOT deleted_date modified_date how would one implement this ...
Jamie
Jan 23, 2012 at 10:28 am
Jan 23, 2012 at 10:23 pm -
I am trying to perform a "translation" of sorts of a stream of text. More specifically, I need to tokenize the input stream, look up every term in a specialized dictionary and output the ...
Ilya Zavorin
Jan 13, 2012 at 4:45 pm
Jan 16, 2012 at 10:09 pm -
Just curious about that. Any thoughts? Thanks
Cheng
Jan 13, 2012 at 12:50 am
Jan 16, 2012 at 2:12 pm -
Hi, my name is Reyna Melara I'm a PhD student form Mexico, and I have a set of 11,051,447 files with txt extension but the content of each file is in fact in wiki format, I want and I need them to be ...
Reyna Melara
Jan 11, 2012 at 7:13 pm
Jan 12, 2012 at 3:43 am -
Happy new year! I'm working on a way to simple geocode documents as they are indexed. I'm hoping to use existing Lucene infrastructure to do this as much as possible. My plan is to build an index of ...
Ryan McKinley
Jan 3, 2012 at 9:30 pm
Jan 4, 2012 at 9:18 am -
I'm working on providing advanced searching for annotated Medical Documents (using UIMA). In the context of an annotated document, I identify relevant medical terms, as well as the negation of ...
Ktjex
Jan 30, 2012 at 10:25 pm
Feb 7, 2012 at 10:54 am -
Hi, I’m using lucene on Hebrew MySql tables. I used ngram (1-15 gram sizes) in my name analyzer and the only thing that doesn’t work for me is when I try to use ‘%’ in my parsing string (didn’t find ...
Gal Mainzer
Jan 31, 2012 at 5:32 pm
Feb 1, 2012 at 9:30 am -
Hello, I’m having a bit of trouble Googling this, so I’m hoping someone can point me in the right direction. We have a system which generates blocks of text which need to be searched as they come in. ...
Dave Seltzer
Jan 31, 2012 at 3:50 pm
Jan 31, 2012 at 10:06 pm -
Is there any difference, from a performance standpoint (or any other standpoint whatsoever), between instantiating a query using QueryParser and BooleanQuery? Is either of them preferable to use? Eg: ...
Felipe Carvalho
Jan 30, 2012 at 9:55 pm
Jan 30, 2012 at 11:23 pm -
After reading all about the renaming of optimize() and updating my Lucene libraries to 3.4, I was surprised and confused by what I found. I have a 1 segment index (all files are named _1*.*) that had ...
Paul Allan Hill
Jan 27, 2012 at 11:19 pm
Jan 28, 2012 at 9:13 am -
My analyser strips out accents as often these are not entered correctly, so assume there are two documents in the database with default field containing República Republica a search for ...
Paul Taylor
Jan 10, 2012 at 9:13 am
Jan 27, 2012 at 4:25 pm -
Hi all, After much code and forum searching, I've hit a frustrating point that should be more obvious. I've trolled through a ton of postings and messaging on keyword counting and it seems like all ...
David Olson
Jan 25, 2012 at 11:36 pm
Jan 26, 2012 at 2:31 pm -
It seems that it is not possible to have multiple document types defined in a single solr schema.xml file. If, in fact, this is not possible, then, what is the recommended app server deployment ...
Frank DeRose
Jan 25, 2012 at 9:49 pm
Jan 26, 2012 at 7:59 am -
I'm having a set of issues in trying to use Lucene that are all connected to the difficulty of retrieving offsets. I need some advice on how best to proceed, or a pointer if this has been answered ...
Nishad Prakash
Jan 14, 2012 at 2:33 am
Jan 20, 2012 at 3:40 am -
I saw the link, https://builds.apache.org/job/Lucene-3.x/javadoc/contrib-misc/org/apache/lucene/index/NRTManagerReopenThread.html, which talks about how to use the NRTManagerReopenThread. I am ...
Cheng
Jan 15, 2012 at 6:18 pm
Jan 16, 2012 at 5:40 am -
Hi list, We have two different document types with different fields each. My problem is given one document (Doc) from type1, find similar ones of type2. Initially I thought two strategies to do it: - ...
Pedro Lacerda
Jan 26, 2012 at 4:35 pm
Feb 1, 2012 at 10:05 am -
hi all, short of it: i want "queen bohemian rhapsody" to return that song named "Bohemian Rhapsody" by the artist named "Queen", rather than songs with titles like "Bohemian Rhapsody (Queen Cover)". ...
Johnny Marnell
Jan 15, 2012 at 6:20 am
Jan 31, 2012 at 11:13 pm -
Goofing off with my index, I ran across this example http://www.lucidimagination.com/blog/2009/05/26/accessing-words-around-a-positional-match-in-lucene/ for using span queries to see what else is ...
Stephen Howe
Jan 24, 2012 at 11:38 pm
Jan 25, 2012 at 11:29 pm -
Hi, I am using multiple writer instances in a web service. Some instances are busy all the time, while some aren't. I wonder how to configure the writer to dissolve itself after a certain time of ...
Cheng
Jan 25, 2012 at 10:02 pm
Jan 25, 2012 at 10:21 pm -
Hi, I'm trying to select city names in a way that goes easy on the spelling mistakes with the most accurate match first. My index for the city name field is tokenized. Let's say I'm looking for Rio ...
Mschipperheyn
Jan 21, 2012 at 6:29 pm
Jan 21, 2012 at 9:30 pm -
Hi, can any of you provide a working code example that utilizes the NRTManager, NRTManagerReopenThread and ExecutorServices instances? The limited availability of information regarding these classes ...
Cheng
Jan 18, 2012 at 5:46 pm
Jan 20, 2012 at 6:04 pm -
Hello, I am having problems opening a lucene index. The index has been created on the same machine. The size of index is 44G. Its a 64bit machine running OpenSuse. I have tried starting the java ...
Frank Moss
Jan 17, 2012 at 11:06 am
Jan 17, 2012 at 11:27 am -
Hi The "Documentation" link on http://lucene.apache.org/java/docs/index.html expands to list Release 3.4.0, 3.3.0, etc. but not 3.5.0. http://lucene.apache.org/java/3_5_0/ exists and works. -- Ian. ...
Ian Lea
Jan 9, 2012 at 10:56 am
Jan 16, 2012 at 12:33 am -
I have 10MM entities, for each of which I will index 10-20 fields. Also, I will have to index 100MM related information of the entities, and each piece of the information will have to go through some ...
Cheng
Jan 13, 2012 at 12:48 am
Jan 13, 2012 at 2:42 pm -
Hi, my servlet application is running a large index of 20G. I don't think it can be loaded to RAM at one time. What are the general strategies to improve the search and write performance? Thanks
Cheng
Jan 8, 2012 at 5:33 am
Jan 8, 2012 at 4:57 pm -
hi, i'm writing a normal web-search application with lucene 3.5.0. in version 3.5.0 lucene provides SearcherManager to manage multithreaded searching. but i don't know how to use this class. should i ...
Chenhan
Jan 7, 2012 at 4:49 pm
Jan 7, 2012 at 5:59 pm -
Hi, I'm using Lucene 2.0 and was wondering how to flush/commit index data to disk. It doesn't look like there is a flush() or commit() method in the 2.0 IndexWriter. Is there a way to flush the data ...
Dragon Fly
Jan 3, 2012 at 1:36 pm
Jan 5, 2012 at 7:45 pm -
Hi, I am experimenting with the Lucene trunk (aka 4.0), especially with the new IndexDocValues feature. I am trying to store some query-independent statistics such as PageRank, etc. One stat that I ...
Hany Azzam
Jan 4, 2012 at 12:15 pm
Jan 4, 2012 at 2:59 pm -
Consider a people index, containing People documents with the following names: Doc 1 [name: "Marcus"] Doc 2 [name: "Markus"] Doc 3 [name: "Mharcus"] Suppose I use an analyzer so that all 3 names have ...
Felipe Carvalho
Jan 30, 2012 at 10:36 pm
Jan 31, 2012 at 9:51 am -
Hi All, I am working on a project to find similar documents for the one being processed by a job. These documents talk about the functional issues so sometimes the description given for the document ...
Saurabh Gokhale
Jan 26, 2012 at 11:42 pm
Jan 26, 2012 at 11:54 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 75 |
posts | 302 |
users | 83 |
website | lucene.apache.org |
83 users for January 2012
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)