Search Discussions
-
Mike, we are definitely hitting something with this one! we had report from our QA chaps that our servers got stuck (limit is on 180 Seconds Request)... We are on average 14 Requsts per second.... ...
Eks dev
Jul 14, 2009 at 10:25 pm
Jul 17, 2009 at 12:16 am -
Hi, We have a very large lucene index that we're developing that has a field of email addresses. (Actually mulitple fields with multiple emails addresses, but I'll simplify here) Each document will ...
Phil Whelan
Jul 30, 2009 at 6:12 pm
Aug 11, 2009 at 11:21 pm -
Hi, I'm trying to use Lucene to query a domain that has the following structure Student 1-------* Attendance *---------1 Course The data in the domain is summarised below Course.name ...
Donal Murtagh
Jul 29, 2009 at 8:42 pm
Jul 30, 2009 at 10:15 pm -
Hi All! Let say I have a filter that produces new tokens based on the original ones. How bad will it be if my filter sets the start of each token to 0 and end to the length of a token? An example ...
OBender
Jul 20, 2009 at 2:41 pm
Jul 20, 2009 at 9:07 pm -
I was trying to do a regex search with the lucene and JavaUtilRegexCapabilities. The code used is : RegexQuery query = new RegexQuery(new Term("contents","(?m)hello.*(\r[^#]*)This is to be ...
Ba3
Jul 26, 2009 at 1:52 pm
Jul 29, 2009 at 5:14 pm -
Hi, I'm just starting to work with Lucene, and I guess that I learn best by working with code, so I've started with the demos in the Lucene distribution. I got the IndexFiles.java and IndexHTML.java ...
Ohaya
Jul 28, 2009 at 12:55 pm
Jul 28, 2009 at 6:13 pm -
hello all We've got 100GB of data which has doc,txt,pdf,ppt,etc.., we've separate parser for each file format, so we're going to index those data by lucene. (since we scared of Nutch setup , thats ...
M.harig
Jul 22, 2009 at 6:07 am
Jul 23, 2009 at 8:26 am -
I want to store in the index a data structure and load it back at search time. Is it safe to serialize the java object store it and load it back later ? Presumably I need to store it binary, right ? ...
MilleBii
Jul 3, 2009 at 7:32 pm
Jul 4, 2009 at 8:52 am -
If I understand lucene correctly, when doing multiple simultaneous searches on the same IndexSearcher, they will basically all do their own index scans and collect results independently. If that's ...
Tsuraan
Jul 22, 2009 at 4:59 pm
Jul 23, 2009 at 1:01 pm -
Lukas Michelbacher
Jul 4, 2009 at 4:37 pm
Jul 14, 2009 at 4:08 pm -
Hi, new here. I recently started using lucene and had encounter a problem.I crawl and index a number of documents. When i perform a search, lets say "tall fat", by right the results that matches all ...
Bourne71
Jul 31, 2009 at 10:51 am
Aug 4, 2009 at 1:36 am -
Is Lucene capable of handling UCS4 data natively? Thanks, Mike --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org ...
Michael Thomsen
Jul 31, 2009 at 2:08 pm
Jul 31, 2009 at 7:25 pm -
Hi! I'm with a error follow: java.lang.NoSuchMethodError: org.apache.lucene.store.FSDirectory.getDirectory(Ljava/io/File;)Lorg/apache/lucene/store/FSDirectory; In the code: File dirIndex = new ...
Fabrício Raphael
Jul 29, 2009 at 2:36 pm
Jul 29, 2009 at 4:36 pm -
Hi, I have an application in which documents are added upon receiving a user request and a background thread is needed to remove old documents. I have an IndexWriter opened on a Directory that adds ...
Chengdu Huang
Jul 28, 2009 at 4:26 am
Jul 29, 2009 at 9:49 am -
Hi, In the documents which contain the volunteer information : Doc1 : volunteer krish volunteer john volunteer Raj ... Doc2 : volunteer krish volunteer Raj volunteer Ganesh Doc3 : volunteer krish ...
Ba3
Jul 22, 2009 at 6:49 am
Jul 24, 2009 at 6:21 am -
Hello all I am sorting on datetime with minute resolution. It easily reaches the maximum heap size. I am having almost 100M records and it is using 1.5 GB. I am now in a situitation to stop sorting ...
Ganesh
Jul 21, 2009 at 8:26 am
Jul 23, 2009 at 9:46 am -
I think this is a common problem, but don't know the correct solution. Users were doing queries on a numeric field such as qdur:[73 TO 117] and expecting to find all the values within but this fails ...
Paul Taylor
Jul 29, 2009 at 10:04 am
Aug 4, 2009 at 6:46 pm -
http://issues.apache.org/jira/browse/LUCENE-1406 http://issues.apache.org/jira/browse/LUCENE-153 based on this, there are two options: 1- using the aramorph library 2- moving the code from trunk to ...
Walid
Jul 23, 2009 at 11:33 am
Aug 3, 2009 at 4:40 pm -
Hi, I tried searching: "Apache Jakarta"~10 Nothing was returned. What might be wrong? Regards, Prashant.
Prashant ullegaddi
Jul 16, 2009 at 12:34 pm
Jul 17, 2009 at 1:52 pm -
Hello, I'm trying to search for the terms like c++ but the parser is stripping off the ++. I tried escaping the ++ with slashes but it's still stripping it off. I could replace + with "plus", is that ...
Chris Salem
Jul 15, 2009 at 4:20 pm
Jul 16, 2009 at 5:08 pm -
Hi This question has probably been asked before so apologies for asking it again. Just to confirm that it is ok to use a single index writer in a web application and only close that single instance ...
Amin Mohammed-Coleman
Jul 1, 2009 at 7:57 am
Jul 2, 2009 at 8:05 am -
Hi, I am currently creating a search engine and will need to generate a query like the following: title:(+chemistry +"national curriculum") its mention that it can be done using the QueryParser but ...
Blazingwolf7
Jul 28, 2009 at 8:29 am
Jul 31, 2009 at 3:56 am -
Hi, I have a question regarding RAMDirectory. I have a 5 GB index on disk and it is opened like the following: searcher = new IndexSearcher (new RAMDirectory (indexDirectory)); Approximately how much ...
Dragon Fly
Jul 23, 2009 at 1:37 pm
Jul 24, 2009 at 8:21 am -
Hi, I am doing a search on my index for a query like this: query = "\"Term 1\" \"Term 2\" \"Term 3\"" Where I want to find Term 1, Term 2 and Term 3 in the index. However, I only want to search for ...
Max Lynch
Jul 23, 2009 at 6:29 pm
Jul 23, 2009 at 8:00 pm -
We have code (using Lucene 2.4.1) that will build a query that looks like: fielda:"ruz an"~2 OR fieldb:"ruz an"~2 OR fieldc:"ruz an"~2 When passed to a MultiFieldQueryParser and parsed it comes back ...
Philip Puffinburger
Jul 21, 2009 at 6:46 pm
Aug 7, 2009 at 9:26 am -
Hi, I was wonder if there is a list of special characters for the standard analyzer? What I mean by "special" is characters that the analyzer considers break characters. For example, if I have ...
Ohaya
Jul 31, 2009 at 2:12 am
Jul 31, 2009 at 3:13 pm -
Hi All, I am using Lucene 2.2.0 and have created an Index with some text including values in $, £ and euros too! But I can't search for text that includes $, £ or euro. I checked the index with Luke ...
Cbowditch
Jul 29, 2009 at 9:56 am
Jul 30, 2009 at 8:38 pm -
Hi there, I have a question… we have two querys which only different is the fact that Query_1 includes phrase queries where Query_2 has the phrase query but converted into a Boolean query. When each ...
Naranjo, Pedro
Jul 23, 2009 at 9:44 pm
Jul 28, 2009 at 4:28 pm -
Hi, I'm relatively new to Lucene. I have the following case: I have indexed a bunch of documents. I then, query the index using IndexSearcher and retrieve the documents using Hits (I do know this is ...
Anuj Bhatt
Jul 23, 2009 at 2:57 am
Jul 28, 2009 at 11:30 am -
Hi, We are writing to an index using IndexWriter with setMergeFactor() set to 10. After inserting several documents, a merge occurs, but our observation is that each segment only has a single ...
Venkat Rangan
Jul 26, 2009 at 5:24 am
Jul 26, 2009 at 3:04 pm -
Hello, I would like to use a stemming analyser similar to KStem or PorterStem to provide access to a wider search scope for our users. However, at the same time I also want to provide the ability for ...
Robert Corbett
Jul 22, 2009 at 1:08 pm
Jul 23, 2009 at 4:08 am -
hello there i like to re-rank lucene TopDoc result set. where shall i start thanks
Henok sahilu
Jul 22, 2009 at 1:40 pm
Jul 22, 2009 at 2:11 pm -
I am doing sorting on DateTime with minute resolution. I am having 90 million of records and sorting is consuming nearly 500 MB. 30% records are not part of primary result set and they don't have ...
Ganesh
Jul 17, 2009 at 9:12 am
Jul 21, 2009 at 5:43 pm -
Hi All, I need to make ? and ! characters to be a separate token e.g. to split [how are you?] in to 4 tokens [how], [are], [you] and [?] what would be the best way to do this? Thanks
OBender
Jul 17, 2009 at 6:29 pm
Jul 17, 2009 at 10:19 pm -
Hello, Are there any suggestions / best practices for using Lucene for searching non-linguistic text? What I mean by non-linguistic is that it's not English or any other language, but rather product ...
JesL
Jul 16, 2009 at 1:04 pm
Jul 16, 2009 at 6:04 pm -
Hello, Over in Solr land, I'm facing a problem while upgrading the lucene version to trunk. Solr has a QueryElevationComponent which is used to boost certain documents to the top. It pre-processes ...
Shalin Shekhar Mangar
Jul 15, 2009 at 7:56 am
Jul 15, 2009 at 11:55 am -
Hi, I have an issue when optimizing a large index (40GB). Basically the optimize falls over with an IOException stating there are not enough system resources, within a RandomAccessFile.readBytes call ...
Carl Austin
Jul 3, 2009 at 10:12 am
Jul 3, 2009 at 4:33 pm -
Hello again! Yes, I know some of us are still recovering from OSCON. It's time for another delicious meetup to chat about Hadoop, HBase, Solr, Lucene, and more! UW is quite a pain for us to access ...
Bradford Stephens
Jul 27, 2009 at 7:16 pm
Aug 5, 2009 at 5:38 pm -
Is there any difference between using QueryParser and MultiFieldQueryParser when have single default search field ? Depending on how many default search fields on an searching an index I select which ...
Paul Taylor
Jul 31, 2009 at 12:37 pm
Jul 31, 2009 at 3:08 pm -
How to get the number of times a term occurs in the Lucene index? Regards, Prashant.
Prashant ullegaddi
Jul 30, 2009 at 6:42 pm
Jul 31, 2009 at 10:51 am -
Hi, I am trying to index information in some proprietary-formatted files. In particular, these files contain some IP addresses in dotted notation, e.g., aa.bb.cc.dd. For my initial test, I have a ...
Ohaya
Jul 30, 2009 at 1:36 pm
Jul 30, 2009 at 2:38 pm -
Hi folks! After deleting documents from the index it can happen that fields become unused (i.e. no document has this field anymore). And IndexReader.getFieldNames() still returns these unused fields, ...
Khin, Gerald
Jul 9, 2009 at 1:40 pm
Jul 25, 2009 at 3:23 am -
Hi folks, I just upgrading Hibernate Search library of my app and so I had to upgrade Lucene too and pass from 2.2 to 2.4 version. In Lucene 2.4 the ISOLatin1AccentFilter class has changed and I ...
Luther blisset
Jul 24, 2009 at 9:41 am
Jul 24, 2009 at 11:39 am -
Hi, We have some 50M pages, and we also have computed PageRanks of those pages. What's the best way to combine lucene's score with PageRank? Regards, Prashant.
Prashant ullegaddi
Jul 19, 2009 at 11:54 am
Jul 23, 2009 at 4:52 pm -
Hello all, I am using Lucene v2.4.1. I am getting following exception while doing search on 4 indexes using ParallelMultiSearcher. ...
Ganesh
Jul 13, 2009 at 6:06 am
Jul 13, 2009 at 2:09 pm -
Hi All, I'm using the default setup of lucene (no custom analyzers configured) and came across the following issue: In Hindi if there is a letter with a diacritic in a phrase lucene will find the ...
OBender
Jul 10, 2009 at 7:10 pm
Jul 13, 2009 at 10:37 am -
Hi, I am trying to solve the following problem: In my index I have a "url" field added as Field.Store.YES, Field.Index.NOT_ANALYZED and I must use this field as a "key" to identify a document. The ...
Raf
Jul 3, 2009 at 4:27 pm
Jul 4, 2009 at 4:02 pm -
Hi, I still am new to Lucene, but I think I have an initial indexer app (based on the demo IndexFiles app) working, and also have a web app, based on the demo luceneweb web app working. I'm still ...
Ohaya
Jul 31, 2009 at 4:43 pm
Jul 31, 2009 at 8:33 pm -
Hi All, We have a system with a lucene index with 100GB and growing fast. I wonder whether there is an efficient way to backup it taking into account only the changes among old and new version of the ...
Alexandre Leopoldo Gonçalves
Jul 22, 2009 at 1:40 pm
Jul 22, 2009 at 2:43 pm -
Hi, I have a situation, where stored, un-indexed fields can contain potentially large amounts of data. Is it possibly to read the contents of a field incrementally? That is, do not load the complete ...
Günter Ladwig
Jul 14, 2009 at 3:12 pm
Jul 15, 2009 at 1:02 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 105 |
posts | 582 |
users | 117 |
website | lucene.apache.org |
117 users for July 2009
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)