Search Discussions
-
Hi, sorry I've already asked few days ago, but I got no reply and I really need some help on this.. I'm running several queries against a doc collection. The queries are documents of the collection ...
Patrick Diviacco
Mar 28, 2011 at 7:44 am
Mar 29, 2011 at 8:58 am -
I am trying to index content withing certain HTML tags, how do I index it ? Which is the best parser/tokenizer available to do this ? -- View this message in context: ...
Shrinath.m
Mar 11, 2011 at 11:03 am
Mar 15, 2011 at 4:56 am -
I'm using the following code because I want to see the entire collection in my query results: //adding wildcards-term to see all results rest = new TermQuery(new Term("*","*")); ...
Patrick Diviacco
Mar 22, 2011 at 8:23 am
Mar 23, 2011 at 8:18 am -
Hi, I would like to announce Crawl Anywhere. Crawl-Anywhere is a Java Web Crawler. It includes : * a crawler * a document processing pipeline * a solr indexer The crawler has a web administration in ...
Dominique Bejean
Mar 2, 2011 at 12:26 am
Jul 16, 2013 at 4:04 am -
Catchall field has its own disadvantages of increasing index size. MultiFieldQueryParser has to know the field names before hand. How do I do a multi field search - without knowing fields in the ...
Shrinath.m
Mar 15, 2011 at 10:46 am
Mar 17, 2011 at 2:22 pm -
Is there a way one could detect duplicates (say by using some unique hash of certain fields) and marking a document as a duplicate but not remove it. Here is an example: Doc 1) This is my test Doc 2) ...
Mark
Mar 5, 2011 at 6:50 am
Mar 10, 2011 at 4:12 pm -
Hello all, Is there any issue with ParallelMultiSearcher in Lucene 3.0.2? If we search more frequently then OutOfMemoryError is triggered or It is throwing "Not able to create native thread".. It is ...
Ganesh
Mar 17, 2011 at 11:48 am
Mar 23, 2011 at 3:08 am -
When I run my Lucene app and a parse a xml file I get the following error due to some fonts such as "é" written in the text file. If I save the text file as UTF-8 with my text editor I don't have ...
Patrick Diviacco
Mar 28, 2011 at 7:04 am
Apr 1, 2011 at 4:09 pm -
Hi all. I'm trying to parallelise writing documents into an index. Let's set aside the fact that 3.1 is much better at this than 3.0.x... but I'm using 3.0.3. One of the things I need to know is the ...
Trejkaz
Mar 29, 2011 at 12:32 am
Mar 31, 2011 at 7:10 pm -
I'm new to Lucene. If I use description = new TermQuery(new Term("description", "my string")); I ask Lucene to consider "my string" as unique word, right ? I actually need to consider each word, ...
Patrick Diviacco
Mar 21, 2011 at 4:01 pm
Mar 23, 2011 at 7:34 am -
I am asking for partial update in Lucene, where I want to update only a selected field of all fields in the document. Does Lucene provide any way to do this ? How to approach this ? -- View this ...
Shrinath.m
Mar 22, 2011 at 7:00 am
Mar 22, 2011 at 10:39 am -
Hi, I am facing the problem The line in the loop is going very slow giving me a performance hit for (int i = 0; i < hits.length; ++i) { int docId = hits[i].doc; Document d = searcher.doc(docId); ...
Suman.holani
Mar 10, 2011 at 9:36 am
Mar 11, 2011 at 10:46 am -
hi I have a code that work fine with lucene 3.2 where i used TermDocs to find the corpusTF here is the code public void calculateCorpusTF(IndexReader reader) throws IOException { // TODO ...
Nitinhardeniya
Mar 22, 2011 at 7:44 pm
Mar 24, 2011 at 5:09 pm -
I've downloaded Lucene nightly build because I need to customize the similarity *per field*. However I don't see the field parameter passed to the methods to compute the score such as "tf" and ...
Patrick Diviacco
Mar 3, 2011 at 3:26 pm
Mar 5, 2011 at 9:47 am -
Is there a Filter to get a limited number of random collection docs from the index which DO NOT contain a specific term ? i.e. term="pizza" I want to run the query against 10 random documents of the ...
Patrick Diviacco
Mar 29, 2011 at 6:40 pm
Mar 30, 2011 at 9:45 am -
I've downloaded the nightly build of Lucene (TRUNK) and I'm referring to the following documentation: https://hudson.apache.org/hudson/view/G-L/view/Lucene/job/Lucene-trunk/javadoc/all/index.html But ...
Patrick Diviacco
Mar 29, 2011 at 11:21 am
Mar 29, 2011 at 7:19 pm -
Hello everybody, I have an enquiry about StandardAnalyzer. Can I use it for other languages except from English? I give the right list of stop words at initialization. Is there anything else inside ...
Vasiliki Gkouta
Mar 13, 2011 at 11:24 pm
Mar 14, 2011 at 10:33 pm -
I've downloaded Lucene nightly build and I've seen that WhitespaceAnalyzer.java is not anymore there. Has this analyzer been removed from the library ? What should I use instead ? thanks
Patrick Diviacco
Mar 2, 2011 at 10:33 pm
Mar 4, 2011 at 3:14 pm -
Hi, What are my options for distributing an application that uses Lucene? Our current application works against a database of INVENTORY. We schedule hourly checks for modified items ...
Sol myr
Mar 22, 2011 at 8:31 am
Mar 31, 2011 at 9:20 am -
Hi, Can someone help me with this problem please? I got these when running my program: java.io.FileNotFoundException: /Users/vonhutuan/Documents/workspace/InformationExtractor/index_wordlist/_i82.frq ...
Vo Nhu Tuan
Mar 23, 2011 at 9:49 am
Mar 23, 2011 at 11:04 am -
Hi, I would like to create an index with Lucene to a document collections of text files. The index should be created in such a way, that for the search I can enforce that query term A and query term ...
Michael Wiegand
Mar 4, 2011 at 7:06 am
Mar 11, 2011 at 10:04 am -
Dear Lucene/Solr user, It is possible you may not know of an Apache project called ManifoldCF, whose purpose is to provide content to Solr for index. If you have interest in this project, this is to ...
Karl Wright
Mar 2, 2011 at 7:21 am
Mar 10, 2011 at 4:35 pm -
hi, I performing multiple queries (stored in a 100MB XML file) against a collection (indexed with lucene, and it was stored before in a 100MB XML file). The process seems pretty long on my machine ...
Patrick Diviacco
Mar 29, 2011 at 9:22 am
Mar 29, 2011 at 10:00 am -
Hi, I would like to build a search system where a search for "Dan" would also search for "Daniel" and a search for "Will", "William" . Any ideas on how to go about implementing that? I can think of ...
Deepak Konidena
Mar 24, 2011 at 6:32 pm
Mar 25, 2011 at 2:16 pm -
Hello, I would index the same document with 2 different Analyzer. So I have to create 2 different index. How can I do that ? thank you for your help, Amel.
Amel Fraisse
Mar 25, 2011 at 12:00 pm
Mar 25, 2011 at 1:48 pm -
Hi, I need to search a Catalog. Most users search *this* year's catalog, but on rare occasions they may ask for old products (from previous years). I'm trying to select between 2 options: 1) Keep ...
Sol myr
Mar 24, 2011 at 2:01 pm
Mar 24, 2011 at 2:45 pm -
Is there a way to display Lucene scores per field instead of the global one ? Both my query and my docs have 3 fields. I would like to see the scores for each field in the results. Can I ? Or should ...
Patrick Diviacco
Mar 22, 2011 at 8:35 am
Mar 23, 2011 at 5:29 am -
I am trying to index in Lucene a field that could have label of concepts in different languages. Most of the approaches I have seen so far are: - Use a single index, where each document has a field ...
Stephane Fellah
Mar 11, 2011 at 3:30 am
Mar 14, 2011 at 1:50 pm -
What's the best way to replace WhitespaceAnalyzer in this line in Lucene nightly build 4.0 ? Is there a generic analyzer I can use ? writer = new IndexWriter(FSDirectory.open(INDEX_DIR), new ...
Patrick Diviacco
Mar 4, 2011 at 2:21 pm
Mar 13, 2011 at 1:45 pm -
hi it seems my mail is judged as spam. Technical details of permanent failure: Google tried to deliver your message, but it was rejected by the recipient domain. We recommend contacting the other ...
Li Li
Mar 11, 2011 at 9:35 am
Mar 11, 2011 at 1:44 pm -
Hi, I am developing a pdf search engine, just use in local computer to search massive pdf documents. I used pdfbox+lucene to index and search, and then I have to display the context to the user in ...
Cescy
Mar 6, 2011 at 1:31 pm
Mar 7, 2011 at 9:28 pm -
Hello! I am curious to know if the Lucene Project or other associated entity accepts paid technical support/subscriptions for high-priority technical or bug resolution type of support. Thanks! -David.
Jarrin, David
Mar 3, 2011 at 8:14 pm
Mar 3, 2011 at 10:34 pm -
Hi all, Is there a way to find the length of a field of a lucene index document? Thanks, Lahiru
Lahiru Samarakoon
Mar 1, 2011 at 5:35 am
Mar 1, 2011 at 9:06 am -
Hi, I'm trying to sort results by a NumericField but the results do not sort (still appear in default score order). The NumericField was indexed using the code below: NumericField field = new ...
Azhar Jassal
Mar 25, 2011 at 2:23 pm
Mar 26, 2011 at 11:19 am -
Hi Folks, Before I run off and reinvent the wheel here - has anyone done any form of result grouping with lucene? My use case looks something like this: Newspaper pages are stored as documents in the ...
Dawn Zoë Raison
Mar 22, 2011 at 10:44 am
Mar 25, 2011 at 10:31 am -
I've some issues to open my index with Luke. I get the following error message: Unknown format version: -12 I build the index using the following code: http://codepad.org/OxGRGTRb The index type is ...
Patrick Diviacco
Mar 23, 2011 at 7:57 am
Mar 23, 2011 at 8:07 am -
Hi, My highlight code is shown as following: QueryScorer scorer = new QueryScorer(query); Highlighter highlighter = new Highlighter(simpleHTMLFormatter, scorer); highlighter.setTextFragmenter(new ...
Cescy
Mar 15, 2011 at 8:48 pm
Mar 17, 2011 at 8:26 am -
Hi, I have two web applications that uses lucene 2.3.2. Both share the same index and can write or read. Writing is synchronized based on file system to allow only one IndexWriter to work at the ...
Bart_212
Mar 9, 2011 at 7:45 pm
Mar 10, 2011 at 8:37 pm -
We are developing a large 4-tier multi-server app that will accept Questions and related Comments supplied by users. There will be 100K's of users that live in Shards. Also, ideally there would be no ...
BrightMinds Dev
Mar 4, 2011 at 6:00 pm
Mar 9, 2011 at 7:17 pm -
Hello list, Does this look correct? I am told it is not functioning, in that new entries to the index are not being picked-up? Thanks Lee try { if (! reader.isCurrent()){ IndexReader newReader = ...
Lee Goddard
Mar 4, 2011 at 1:20 pm
Mar 4, 2011 at 5:32 pm -
Hello all, Could any one guide me how to backup or do replication with Lucene. Regards Ganesh Send free SMS to your Friends on Mobile from your Yahoo! Messenger. Download Now! ...
Ganesh
Mar 1, 2011 at 6:36 am
Mar 3, 2011 at 5:38 pm -
Hi, OK so I will not bother using TieredMergePolicy for now. I will do some more tests with the contrib balanced merge policy, playing with the optimize(maxNumSegments) to try decreasing the optimize ...
V Sevel
Mar 1, 2011 at 8:18 am
Mar 2, 2011 at 11:01 am -
I need to define different similarity scores per document field. For example for field A I want to use Lucene tf.idf score, for the numerical field B I want to use a different metric (difference ...
Patrick Diviacco
Mar 1, 2011 at 7:42 pm
Mar 1, 2011 at 10:47 pm -
Is there a minimum string length requirement for proximity search? For example, would "a~" or "an~" trigger proximity search? The result would be horrible if there is no such requirement. Thanks, ...
Andy Yang
Mar 31, 2011 at 1:54 am
Mar 31, 2011 at 2:01 am -
Hello All Recently, I am trying to develop an automatic definition extraction system for Amharic Language - using machine learning technique (Version Space learning). Can anyone suggest me some java ...
Henok sahilu
Mar 29, 2011 at 7:12 am
Mar 29, 2011 at 1:36 pm -
Hi, I am using MultiFieldQueryParser with a custom analyzer for parsing search text. Now, when I say MultiFieldQueryParser qp = new MultiFieldQueryParser(Version, new String[] {"field1", "field2", ...
Deepak Konidena
Mar 24, 2011 at 5:49 pm
Mar 25, 2011 at 2:19 pm -
Is there some sort of default limit imposed on the Lucene indexes? I try to index 50k or 60k documents but when I use Luke to go inside the index and check the total # of entries indexed, it shows ...
Pulkit Singhal
Mar 24, 2011 at 10:07 pm
Mar 25, 2011 at 9:56 am -
Hi Luceners, this is my 1st experience with ARQ, LARQ & Lucene; everyth. went smooth so far, however the slope seems to be getting steeper suddenly. The initial problem was to develop a Java app to ...
Fr Jurain
Mar 22, 2011 at 3:16 pm
Mar 24, 2011 at 9:47 am -
I'm new to Lucene and I would like to know what's the difference (if there is any) between PhraseQuery.add(Term1) PhraseQuery.add(Term2) PhraseQuery.add(Term3) and term1 = new TermQuery(new ...
Patrick Diviacco
Mar 21, 2011 at 5:43 pm
Mar 22, 2011 at 1:38 pm -
I'm having a problem with the performance of lazily-loaded fields with lucene. The basic structure of the code is that I get a set of documents back from a query, then iterate through them, reading ...
Brian Hurt
Mar 21, 2011 at 6:16 pm
Mar 22, 2011 at 1:23 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 111 |
posts | 422 |
users | 114 |
website | lucene.apache.org |
114 users for March 2011
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)