Search Discussions
-
Hi, If I want to improve performance, which of the following is better and why? 1. Buy a machine with a lot of RAM and use a RAMDirectory for the index. 2. Put the index on a solid state drive. By ...
Dragon Fly
Jul 18, 2012 at 3:51 pm
Jul 27, 2012 at 10:52 pm -
Is it possible to delete by docId in lucene 4? I can delete by docid in lucene 3 using IndexReader.deleteDocument(int docId), but that method is gone in lucene 4, and IndexWriter only allows deleting ...
Sean Bridges
Jul 12, 2012 at 1:10 am
Jul 13, 2012 at 4:40 am -
I created an index using Lucene 3.6.0 in which I specified that a certain text field in each document should be indexed, stored, analyzed with no norms, with term vectors, offsets and positions ...
Mike O'Leary
Jul 19, 2012 at 11:17 pm
Aug 24, 2012 at 6:04 pm -
Hi, I have the following situation: I have two pretty large indices. One consists of about 1 billion documents (takes ~6GB on disk) and the other has about 2 billion documents (~10GB on disk). The ...
Doron Yaacoby
Jul 15, 2012 at 8:41 am
Jul 19, 2012 at 6:25 am -
Dear list, I'm considering to use Lucene for indexing sequences of part-of-speech (POS) tags instead of words; for those who don't know, POS tags are linguistically motivated labels that are assigned ...
Carsten Schnober
Jul 30, 2012 at 1:07 pm
Aug 7, 2012 at 11:17 am -
Hi, I have following dataset indexed in Lucene. 2010-04-21 02:24:01 GET /blank 200 120 2010-04-21 02:24:01 GET /US/registrationFrame 200 605 2010-04-21 02:24:02 GET /US/kids/boys 200 785 2010-04-21 ...
Deepak Shakya
Jul 22, 2012 at 3:18 pm
Jul 26, 2012 at 1:37 pm -
I see some behavior at the moment when I'm flushing and would like to know if I can change that. One main thread is inserting, when it flushes, it blocks. During that time my main thread is ...
Simon McDuff
Jul 19, 2012 at 4:55 pm
Jul 21, 2012 at 7:33 am -
I want store some data like 00:00:00.123,it's string,i want print it as date .because i want sort it as date's increase. -- View this message in context ...
Sam
Jul 11, 2012 at 8:32 am
Jul 16, 2012 at 8:42 am -
I have used standardAnalyser to save the ANALYZED data in index. Data is as below:- 1. foo bag test 2. foo bar test 3. bar india foo I used When i search using--------------- foo ba I get all results ...
Hiren Shah
Jul 4, 2012 at 6:11 am
Jul 4, 2012 at 10:05 pm -
Hi all in my app (Lucene 3.5.0 powered) I index the documents (not too many, say up to 100k) using the RAMDirectory. Then I need to send the segment over the network to be merged with the existing ...
Konstantyn Smirnov
Jul 11, 2012 at 10:18 am
May 21, 2013 at 1:17 pm -
Hello, I'm using ElisionFilter to index french text. The filter works but ignore the d letter followed by an apostrophe (example: d'une). Is-it an expected behaviour or is it an issue ? Regards, ...
Yamo93
Jul 25, 2012 at 1:02 pm
Jul 26, 2012 at 1:56 pm -
Hi, I am using 3.4.0 and just discovered a weird issue. I have a set of simple English one-word queries and two target files that I want to search. One has all these queries in one line, i.e ...
Ilya Zavorin
Jul 13, 2012 at 4:28 pm
Jul 13, 2012 at 6:43 pm -
I want seach a txt ,which is store like <2012-07-06 11:11:43 some message at one line .How to seach that,I need get the time and content. -- View this message in context ...
Sam
Jul 6, 2012 at 3:16 am
Jul 10, 2012 at 9:12 am -
Hi, I've an index to analyze (manually). Unfortunately, I cannot rebuild the index. Some of the fields are 'unstored'. I was wondering whether there's any way to get the terms from an unstored field ...
Phanindra R
Jul 26, 2012 at 4:57 pm
Jul 27, 2012 at 10:15 pm -
Hello Everyone, We have a legacy system which uses lucene 2.4.1. We have ported a small hack to lucene source code back then, so that the underlying lucene segment merger code wouldn't reuse deleted ...
Snehal.chennuru
Jul 22, 2012 at 11:17 pm
Jul 25, 2012 at 10:27 pm -
Hi, My log files are showing the below exceptions almost at twice a minute frequency. what is causing it and how can i fix it? I am not using lucene directly but instead using elasticsearch (0.18.7 ...
T Vinod Gupta
Jul 8, 2012 at 7:58 pm
Jul 10, 2012 at 10:11 am -
Hi all, I have an application which holds a list of documents. These documents are indexed using Lucene. I can search on keywords of the documents. I loop the TopDocs and get the ID field (of each ...
Jochen Hebbrecht
Jul 3, 2012 at 7:56 am
Jul 3, 2012 at 10:35 am -
If I want to set up a database that is totally flat with no joins, is there any reason not to use lucene. The reasons I would be curious about are things like insert performance and whether there are ...
Hank Williams
Jul 26, 2012 at 5:17 pm
Aug 1, 2012 at 11:13 am -
Hi, I'm trying to create a phrase query with wildcard, from the forums it seems that the solution is not trivial. I'm trying to create the following queries: "this is a phrase*" OR "*This is a ...
Levin, Ilya
Jul 22, 2012 at 8:30 am
Jul 23, 2012 at 1:21 pm -
I also posted this to StackOverflow, apologies if you see this twice. I have a data set whereby documents are associated to a user id. Say that the documents represent books, and each book can have ...
Uncle
Jul 22, 2012 at 3:07 pm
Jul 23, 2012 at 12:01 pm -
Dear developers, while upgrading from 3.6.x to 4.x I have to rewrite some of my code and search for the new methods and/or classes. In 3.6.x and older versions the API Javadoc interface had an ...
Bernd Fehling
Jul 18, 2012 at 11:48 am
Jul 19, 2012 at 7:21 am -
Hello all, I have more than one record having same time stamp. When i sort by date time in decending order, the set of records which have same time stamp are displayed in the order of insertion ...
Ganesh
Jul 17, 2012 at 10:54 am
Jul 18, 2012 at 5:12 pm -
We have an application that has to do "real time" indexing of a number of documents. What it does is wake up about every 20 seconds and updates the index with any changes that have been queued since ...
Scott Smith
Jul 16, 2012 at 8:30 pm
Jul 18, 2012 at 4:55 pm -
Hi Folks, I need your help in determining whether we upgrade to 3.6 or wait till 4.0 becomes stable. We are currently using 2.x version of lucene and would like to upgrade to 3.6(and 4.0 ...
Nilesh Vijaywargiay
Jul 17, 2012 at 9:14 pm
Jul 18, 2012 at 9:30 am -
Hi all, Imagine you have the following books which are indexed using Lucene book1 - title: "foo bar test" book2 - title: "foo barrr test" book3 - title: "foo bar bar" I want to find book1 and book2 ...
Jochen Hebbrecht
Jul 3, 2012 at 3:59 pm
Jul 5, 2012 at 1:03 pm -
Given that the Alpha is out, are there any more or less definitive guidelines on when 4.0 will be officially released? Hopefully, someone probably has a ballpark estimate on this, give or take a few ...
Vitaly Funstein
Jul 31, 2012 at 3:11 am
Sep 14, 2012 at 11:14 pm -
I have one more question to pose to the group today: I have several thousand searches being performed against MemoryIndexes on a regular basis. I'd like the ability for each search to choose it's own ...
Dave Seltzer
Jul 12, 2012 at 8:14 pm
Aug 17, 2012 at 9:39 pm -
Hi All I created a Lucene index for over 3 million document, and I used term vectors to create the index.now for an external document I need to use Lucene to find the most similar documents from the ...
Elshaimaa Ali
Jul 30, 2012 at 2:00 pm
Jul 30, 2012 at 7:09 pm -
i have requirement that have to search multiple words in paragraph and in line depending upon the situation. how can i do this in lucene. im using Lucene2.9. -- View this message in context ...
Neerajshah84
Jul 25, 2012 at 5:21 am
Jul 27, 2012 at 4:48 am -
how can i put multiplue proximity search in lucene?? example :- i want to search like "fear,dark" in two words and dark adjucent to "tight" and tight,free in 3 words. Query :- fear2dark tight3free ...
Neerajshah84
Jul 25, 2012 at 5:14 am
Jul 26, 2012 at 2:37 pm -
This probably belongs in the JIRA, and is related to https://issues.apache.org/jira/browse/LUCENE-4025, but java.util.Lock.lock() doesn't throw anything. I believe the author of the change originally ...
Vitaly Funstein
Jul 21, 2012 at 3:27 am
Jul 22, 2012 at 1:41 am -
HI buddy, In gmail,there are many accounts,how google manage to search individual email without the risk of search other accounts email?If there are *huge* account,small index may knock down the ...
齐保元
Jul 18, 2012 at 8:57 am
Jul 18, 2012 at 4:55 pm -
Hello All! I'm trying to run a FilteredQuery over a BooleanQuery full of term and range queries. In my custom filter, in the getDocSet method, the indexReader contains the entire index segment. My ...
James Nolan
Jul 13, 2012 at 5:26 pm
Jul 17, 2012 at 4:36 pm -
Dear list, I am working on a search application that depends on retrieving offsets for each match. Currently (in Lucene 3.6), this seems to be overly costly, at least in my solution that looks like ...
Carsten Schnober
Jul 13, 2012 at 12:30 pm
Jul 17, 2012 at 1:37 pm -
Hello, I have recently run into the situation when there was not a sufficient amount of direct memory available for IndexWriter to work. This was essentially caused by the embedding application ...
Vitaly Funstein
Jul 10, 2012 at 3:00 am
Jul 13, 2012 at 7:57 pm -
Hello all, I am currently using v3.0.3 and planning to upgrade to v3.6. Shall i go ahead with the upgrade OR wait for 4.0? Regards Ganesh ...
Ganesh
Jul 10, 2012 at 4:29 am
Jul 10, 2012 at 7:37 am -
Hello, we have a small internet shop which uses lucene for product search. With increasing traffic we have continuos problem with literaly hundreds of threads being BLOCKED in lucene code: here is an ...
Leon Rosenberg
Jul 6, 2012 at 7:29 pm
Jul 9, 2012 at 7:23 pm -
Thanks Mike for the wonderful work on ToParentBlockJoinQuery. We had a use case for Relational data search and are working with ToParentBlockJoinQuery which works perfectly as mentioned @ ...
Jayendra Patil
Jul 25, 2012 at 11:38 am
Aug 3, 2012 at 12:26 pm -
Hi Team, I would like to know if Lucene 4.0 will support facetted search. Thanks in advance. Best regards, Ranjith... [cid:image001.gif@01CD6B26.B4C4D460]Ranjith Ratna Kumar S / Capgemini India / ...
Subramanian, Ranjith
Jul 26, 2012 at 4:21 pm
Jul 27, 2012 at 10:43 pm -
hi, I have two collections:the first collection has documents like 'docID,content', and the other collection has documents like 'docID,ratings'.Is there any fast algorithm to get the intersection ...
齐保元
Jul 20, 2012 at 9:59 am
Jul 21, 2012 at 3:29 am -
I'm using Lucene 4.0. I'm inserting around 300 000 documents / seconds. We do not have any store fields. But we noticed that .fdt get populated even so. .fdx contains useless informations. .fdt ...
Simon McDuff
Jul 19, 2012 at 12:27 pm
Jul 19, 2012 at 3:37 pm -
Hi, I'm using Lucene 4.0. I would like to index String, but since my system required High volume I need to reuse always the same memory. No question to use String. My process receives bytes and I can ...
Simon McDuff
Jul 18, 2012 at 2:35 pm
Jul 18, 2012 at 4:10 pm -
Is there a way to know which sub-clause of a boolean query matched in the result document ? Currently I am using searcher.explain() on each of the sub-clause of the boolean query (on each of the ...
Ashish Jaen
Jul 18, 2012 at 11:18 am
Jul 18, 2012 at 2:23 pm -
Hi,buddy: I have a problem concerning index readers:there are many small index/searcher instances in my application which are hold by a map.when new index request or seach request comes,I process ...
齐保元
Jul 16, 2012 at 6:58 am
Jul 16, 2012 at 9:32 am -
Hi all, I'd like to know if I can get the list of indexed terms in a document from its document ID in constant time (say, in a time independent of the size of the index). The reason why I ask might ...
Giovanni Gherdovich
Jul 15, 2012 at 3:57 pm
Jul 15, 2012 at 4:08 pm -
Hello, I have a search project which uses the Lucene PatternAnalyzer for its text/query analysis. At the moment it's configured like so: analyzer = new PatternAnalyzer(Version.LUCENE_35, ...
Dave Seltzer
Jul 12, 2012 at 6:21 pm
Jul 13, 2012 at 1:56 pm -
Hi all, I create a method above que generate my boolean query based in many parameters. The query's on not analyzed fields works perfect in debug. When start a search using any analyzed field with ...
Marcelo Neves
Jul 11, 2012 at 3:33 pm
Jul 12, 2012 at 6:39 pm -
Hi Everyone I am busying attempting to integrate Lucene 4.0 Alpha into my code base. I have a custom QueryParser that extends QueryParser and overrides newRangeQuery and newTermQuery to support typed ...
Jamie
Jul 11, 2012 at 1:32 pm
Jul 11, 2012 at 1:53 pm -
I'm running 8 index searchers java processes on a 8-core node. They all read from the same lucene index on local hard drive. the index contains about 20million docs, each doc is a small record with ...
Yang
Jul 11, 2012 at 3:04 am
Jul 11, 2012 at 1:25 pm -
I have a question about the API for storing and indexing lucene documents (in 3.x). If I want to index a document by providing a TokenStream, I can do that by calling document.add (field) where field ...
Michael Sokolov
Jul 11, 2012 at 12:54 am
Jul 11, 2012 at 10:38 am
Group Overview
group | java-user |
categories | lucene |
discussions | 72 |
posts | 302 |
users | 94 |
website | lucene.apache.org |
94 users for July 2012
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)