Search Discussions
-
Team, A NumericDocValueField("TIME_SORT",unixTimeStamp value) is indexed along with other fields. During search time, I've used NumericDocValuesSorter to get Sorter.DocMap so that I can access the ...
Ramprakash Ramamoorthy
Mar 6, 2014 at 10:21 am
Mar 10, 2014 at 9:54 am -
Hi everyone When I query a lucene index, I get back a list of document ids. This index search is fast. Now for all documents matching the result I need a unique String field called "id" which is ...
Rohit Banga
Mar 21, 2014 at 8:36 pm
Mar 28, 2014 at 4:39 am -
Hi to all, I'm trying to use the Replicator (http://lucene.apache.org/core/4_7_0/replicator/index.html?org/apache/lucene/replicator/package-summary.html) to backup my indexes while indexing. We have ...
Roberto Franchini
Mar 13, 2014 at 4:22 pm
Mar 24, 2014 at 4:34 pm -
Dear Team, we are using lucene in our product , it well searching for high speed and performance but Japaneese, chinese and korean language not searching properly we had use QueryParser QueryParser ...
Kalaik
Mar 21, 2014 at 1:07 pm
Mar 26, 2014 at 7:50 am -
I am trying to analyze some japanese web pages for presence of slang/adult phrases in them using lucene-analyzers-kuromoji-4.6.0.jar. While the tokenizer breaks up the word into proper words, I am ...
Rahul Ratnakar
Mar 10, 2014 at 5:58 pm
Mar 11, 2014 at 1:25 am -
Greetings We have a problem whereby Lucene 4.7 occasionally does not apply a filter query during searching. The problem is intermittent. One in thirty or so searches will return what appears to be an ...
Jamie
Mar 28, 2014 at 11:00 am
Mar 28, 2014 at 3:48 pm -
Hi all, I am using lucene to index a large corpus of text, with every word being a separate document (this is something I cannot change), and I am hitting a limitation of the CompositeReader only ...
Artem Gayardo-Matrosov
Mar 21, 2014 at 4:29 pm
Mar 21, 2014 at 9:09 pm -
Hi We're planning to upgrade lucene-analyzers-commons 4.3.0 to 4.6.1 . While running our unit test with 4.6.1 it fails at org.apache.lucene.analysis.Tokenizer on line 88 (setReader method). There it ...
Joe Wong
Mar 20, 2014 at 7:58 pm
Mar 20, 2014 at 11:03 pm -
Hi All, Is there any way I could construct a query that would not automatically exclude negation terms (such as "no", "not", etc)? For example, I need to find strings like "not happy", "no idea", ...
Natalia Connolly
Mar 17, 2014 at 7:03 pm
Mar 18, 2014 at 3:05 pm -
Hi :) My index (Lucene 3.5) contains a field called title. Its value is indexed (analyzed and stored) with the WhitespaceAnalyzer and can contains html entities such as ’ or ° My problem is ...
G.Long
Mar 3, 2014 at 5:10 pm
Mar 4, 2014 at 11:43 am -
Dear all, I am trying to compute the cosine similarity between several documents. I have an indexed directory A made using 10000 files and another indexed directory B made using 20000 files. All the ...
Stefy D.
Mar 20, 2014 at 9:05 pm
Mar 21, 2014 at 3:04 pm -
Hi folks, We are currently using Lucene 4.5 and we are hitting some bottlenecks and appreciate some input from the community. This particular index (the disk size for which is about 10GB) is ...
Arvind Kalyan
Mar 5, 2014 at 9:18 am
Mar 5, 2014 at 9:53 pm -
Hello, I have a doubt about index size, I am testing a program using Lucene to index some dataset. At the final the result of index size is varying a little, since i haven't finished the tests at ...
Jose Carlos Canova
Mar 25, 2014 at 5:37 am
Mar 25, 2014 at 12:42 pm -
Hi there I am searching for a way to store very large documents in a Lucene 4.7 index and keep them ready to use the PostingsHighlighter for search result highlighting. I do not want to read the ...
Mirko Sertic
Mar 24, 2014 at 2:03 pm
Mar 25, 2014 at 12:47 am -
Hi, I have few questions related to updatable DocValues API... It would be great if I can get help. 1. Is it possible to provide updateNumericDocValue(Term term, Map<String,Long ), incase I wish to ...
Ravikumar Govindarajan
Mar 14, 2014 at 10:15 am
Mar 15, 2014 at 7:18 am -
Hello, I just posted an article on Comparing Document Classification Functions of Lucene and Mahout. http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html ...
Koji Sekiguchi
Mar 7, 2014 at 2:31 am
Mar 9, 2014 at 7:57 am -
Hi, Is there any data type in lucene can support functions like SortedDocValues for any numeric(int, long, float, double) type. SortedDocValues only supports bytes, I want some data type can get ...
Yonghui Zhao
Mar 5, 2014 at 12:01 pm
Mar 5, 2014 at 12:58 pm -
Hello, does Lucene provide a zero or one character wildcard (like ? in Perl RegEx)? Example of what I mean: "house%" finds "house" and "houses" As far as I know in Lucene the ? wildcard is for ...
Sven Teichmann
Mar 25, 2014 at 1:57 pm
Mar 25, 2014 at 4:37 pm -
I have a usage pattern where I need to package up and store away all files from an index referenced by multiple commit points. To that end, I basically call IndexWriter.commit(), followed by ...
Vitaly Funstein
Mar 21, 2014 at 12:35 am
Mar 21, 2014 at 6:01 pm -
Hi I started migrating my lucene search application from 2.9 version to 4.7.0 . Please suggest me the best way and best practices for this. There are many files to rewrite. Thank you, Narasimha.
NarasimhaRao DPNV
Mar 20, 2014 at 6:39 pm
Mar 20, 2014 at 11:10 pm -
Hi I'm trying to submit a lucene query string to my index to return a data based on a numeric range. I'm using the syntax provided in the Query Parser Syntax document but the results I get indicate ...
Matthew Petersen
Mar 20, 2014 at 8:44 pm
Mar 20, 2014 at 9:46 pm -
Hi, I posted this question on the Solr mailing list but it has more to do with Lucene. I have a performance and scoring problem for phrase queries 1. Performance - phrase queries involving frequent ...
Manuel Le Normand
Mar 12, 2014 at 9:28 am
Mar 12, 2014 at 2:13 pm -
I'm working on a system that uses Lucene 4.6.0, and I have a couple of use cases for documents that modify themselves as they're being indexed. For example, we have text classifiers that we would ...
Stephen Green
Mar 11, 2014 at 6:34 pm
Mar 11, 2014 at 8:40 pm -
Hi, Phrases with stop words in them are not getting searched whereas a phrase without it gets searched using ComplexPhraseQueryParser/SpanQueryParser. SpanQueryParser reference ...
Modassar Ather
Mar 5, 2014 at 7:40 am
Mar 7, 2014 at 6:04 am -
Hello, I would like to use Apache *Lucene 4*.x and count words in the string, for example: "I loved cats, but now I really love dogs" - count "love" word in the String - result should be 2. I would ...
Hollow Quincy
Mar 28, 2014 at 2:35 pm
Mar 28, 2014 at 6:43 pm -
Hi, I would like seek assistance regarding the following issue I'm encountering. I'm running Tomcat and have deployed Jena (2.6.2) and Lucene Core (2.9.0), whereby Jena invokes the Lucene classes ...
Turri, Albert (ELS-NYC)
Mar 25, 2014 at 5:29 pm
Mar 25, 2014 at 5:34 pm -
I was just trying to implement a StoredFieldsWriter[4.6.1] and found that finishDocument() method has an empty impl. Any reason for not declaring it abstract? We could easily miss over-riding it -- ...
Ravikumar Govindarajan
Mar 13, 2014 at 12:22 pm
Mar 13, 2014 at 3:07 pm -
I have a small set of document numbers as a query result collected with some non-scoring collector. Now, I want to send high-performant successive queries only in this document number scope, as part ...
Christian Reuschling
Mar 12, 2014 at 5:41 pm
Mar 12, 2014 at 8:14 pm -
Hello, what is the best method to score documents similar to default similarity, but the document frequency should be calculated per query against the matching result document set, not statically ...
Christian Reuschling
Mar 6, 2014 at 6:34 pm
Mar 8, 2014 at 1:28 am -
Hi; Tf-Idf is explanation says that: *idf(t)* appears for *t* in both the query and the document, hence it is squared in the equation. DefaultSimilarity does not square it. What it the explanation of ...
Furkan KAMACI
Mar 6, 2014 at 11:29 pm
Mar 7, 2014 at 2:06 pm -
Hi, is there a way I can search in file to match my synomyms, instead of building a SynonymMap ? My synonym list is going to be very large and I don;t want to keep it in memory. Regards Geet
Geet Gangwar
Mar 3, 2014 at 2:02 pm
Mar 3, 2014 at 5:10 pm -
Sir i am PG student, my research topic is to optimize the indexing file [reduce index file size, RAM usage, CPU utilization, and create index with payload to improve searching speed]. Currently ...
Mrugendra
Mar 2, 2014 at 2:07 pm
Mar 2, 2014 at 4:57 pm -
Dear open source projectors: How to implement and search? I can't do it. ------------------ From Lmc in China
上而求索
Mar 1, 2014 at 12:52 pm
Mar 1, 2014 at 5:30 pm -
Hi , Please tell me how can I implement HitCollector in lucene 4.7.0, migrating from previous versions . I didn't find HitCollector in 4.7. Thanks & Regards, Narasimha.
NarasimhaRao DPNV
Mar 26, 2014 at 3:48 pm
Mar 26, 2014 at 3:48 pm -
We are using Lucene 3.6 to perform incremental indexing. We use an algorithm we found on the web to perform the incremental indexing. 1. For each file that we indexed, we create a UID field to ...
Yuan
Mar 24, 2014 at 5:02 pm
Mar 24, 2014 at 5:02 pm -
Hi all, Does anybody know of a way of getting a breakdown of the disk space a particular field takes up in a lucene index? I'm experimenting with different query-time and index-time field ...
Alan Woodward
Mar 21, 2014 at 10:34 am
Mar 21, 2014 at 10:34 am -
Hi , Want to Know Big Data / Hadoop ? If yes , join us for Webinar by industry experts at below link. *Webinar on Hadoop, Hosted by : Manoj , Research Director* *Join us for a webinar on Mar 19, 2014 ...
Vivek Kumar
Mar 19, 2014 at 12:21 pm
Mar 19, 2014 at 12:21 pm -
For example, given a new big department merged from three departments. A few employees worked for two or three departments before merging. That means, the attributes of one person might be listed ...
Mobius ReX
Mar 17, 2014 at 6:03 pm
Mar 17, 2014 at 6:03 pm -
Hi, the Apache Lucene/Solr committers decided with a large majority on the vote to require Java 7 for the next minor release of Apache Lucene and Apache Solr (version 4.8)! Support for Java 6 by ...
Uwe Schindler
Mar 12, 2014 at 2:07 pm
Mar 12, 2014 at 2:07 pm -
Hello, If I want to print out all strings within distance 2 of a given string, how do I do it? I'm totally new to Lucene, so below are some parts that I'm starting to assemble. Any tips are welcome ...
Robert Servensen
Mar 5, 2014 at 12:47 am
Mar 5, 2014 at 12:47 am
Group Overview
group | java-user |
categories | lucene |
discussions | 40 |
posts | 152 |
users | 60 |
website | lucene.apache.org |
60 users for March 2014
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)