Search Discussions

61 discussions - 232 posts

  • Hi, I am thinking to make my lucene indexing multi threaded, can someone throw some light on the best approach to be followed for achieving this. I will give short gist about what i am trying to do, ...
    Nischal reddyNischal reddy
    Sep 2, 2013 at 2:13 pm
    Oct 28, 2014 at 2:16 pm
  • This useful-looking item is in the test-framework jar. Is there some subtle reason that it isn't in the common analyzer jar? Some reason why I'd regret using it?
    Benson MarguliesBenson Margulies
    Sep 5, 2013 at 10:41 pm
    Sep 7, 2013 at 9:03 pm
  • Hi, I'm developing a web application, that contains a REST service in the Tomcat, that receives several requests per second. The REST requests do research in a Lucene index, to do this i use the ...
    David MirandaDavid Miranda
    Sep 5, 2013 at 1:16 am
    Sep 7, 2013 at 1:09 am
  • Hello, Over the last few weeks I've been working on upgrading an application from Lucene 3.x to Lucene 4.x in hopes of improving performance. Unfortunately, after going through the full migration ...
    Sep 18, 2013 at 7:28 pm
    Oct 3, 2013 at 3:40 am
  • Hi, In the Example of Multiphrase Query it is mentioned "To use this class, to search for the phrase "Microsoft app*" first use add(Term) on the term "Microsoft", then find all terms that have "app" ...
    Sep 25, 2013 at 2:04 pm
    Oct 3, 2013 at 11:42 am
  • I'm creating multiple instances of a field, some with Field.Store.YES and some with Field.Store.NO, with Lucene 4.4. If Field.Store.YES is set then I see multiple instances of the field in the ...
    Alan BurlisonAlan Burlison
    Sep 16, 2013 at 10:33 am
    Sep 17, 2013 at 2:33 pm
  • Hi@all I am getting strange performance measures on Lucene 4.4.0, maybe someone can explain this: The following syntax leads to pretty slow queries on my machine(16ms for every execution) ...
    Mirko SerticMirko Sertic
    Sep 7, 2013 at 2:59 pm
    Sep 10, 2013 at 7:01 am
  • Hi, I am trying to store all the Field values using CompressionTool, But When I search for any content, it is not finding any results. Can you help me, how to create the Field with CompressionTool to ...
    Jebarlin RobertsonJebarlin Robertson
    Sep 13, 2013 at 11:38 am
    Sep 19, 2013 at 3:45 am
  • We've recently upgraded to Lucene 4.4.0 and mergeSegments now causes an OOM error. As background, our index contains about 14 million documents (growing slowly) and we process about 1 million updates ...
    Michael van RooyenMichael van Rooyen
    Sep 25, 2013 at 12:22 pm
    Oct 8, 2013 at 11:11 am
  • Hello. I am faced with a trivial issue: Everytime my Query is being fired as a Boolean Query... Providing Input : <param name="user_name" value="USER_NAME_MENTIONED"/ \* This input is provided. Since ...
    Ankit MurarkaAnkit Murarka
    Sep 12, 2013 at 2:20 pm
    Sep 12, 2013 at 4:11 pm
  • I'm confused by the comment about compound components here. If a single token fissions into multiple tokens, then what belongs in the PositionLengthAttribute. I'm wanting to store a fraction in here! ...
    Benson MarguliesBenson Margulies
    Sep 7, 2013 at 12:03 am
    Sep 7, 2013 at 12:48 pm
  • Hello. I am struck in a problem and have been continously getting exception of /*Stream Closed and LockObtainFailedException..*/ I am trying to read the complete document line by line and once I have ...
    Ankit MurarkaAnkit Murarka
    Sep 1, 2013 at 11:29 am
    Sep 2, 2013 at 2:32 pm
  • Is there any way to check the similarity of texts with Lucene? I have the DBpedia indexed and wanted to get the texts more similar between the abstract and DBpedia another text. If I do a search in ...
    David MirandaDavid Miranda
    Sep 3, 2013 at 5:33 pm
    Sep 4, 2013 at 6:42 pm
  • Hi, I'm trying to optimize an index we have, and one thing that has come up recently is that we're not really using term frequencies, and we don't need any scoring. We noticed that the term ...
    Marcos Juarez LopezMarcos Juarez Lopez
    Sep 19, 2013 at 11:18 pm
    Sep 26, 2013 at 4:43 pm
  • Hi I am learning lucene, I am developing an application do do a search in log files in multi-environment boxes, I have googled for the deeper understanding, but all examples were just referring for ...
    Sep 19, 2013 at 5:45 pm
    Sep 21, 2013 at 10:41 am
  • Hello, We're using lucene 4.1. We have the word "block-major-5" indexed. Using the classic analyzer, we get the following tokens : block and major-5. However, -- With Thanks and Regards, Ramprakash ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Sep 20, 2013 at 12:14 pm
    Sep 20, 2013 at 1:08 pm
  • Firstly, some context. I'm indexing a large set of mbox files which contain multiple email messages, each mbox file being related to a single issue. I'm therefore indexing each mbox as a single ...
    Alan BurlisonAlan Burlison
    Sep 15, 2013 at 9:15 am
    Sep 15, 2013 at 12:05 pm
  • I am using the SmartChineseAnalyzer in v3.6 but accessing or instantiating it for the first time takes 10 to 15 seconds before it does anything. I do not see this huge delay with StandardAnalyzer. Is ...
    Darren HoffmanDarren Hoffman
    Sep 6, 2013 at 7:42 pm
    Sep 6, 2013 at 10:12 pm
  • I have a use case where some of my documents have duplicate terms in various fields or within the same field. For an example, I may have a million documents with just the term "foo" in field A, and ...
    Kristofer KarlssonKristofer Karlsson
    Sep 5, 2013 at 7:29 am
    Sep 5, 2013 at 2:08 pm
  • Hi, I am learning lucene, and have created indexes using LuceneWriter (which worked fine), but when I try an query it with LuceneReader it dose not work, need help on the same. Following is the code ...
    Sep 21, 2013 at 6:48 am
    Sep 23, 2013 at 11:58 am
  • Suppose I have a string like "ab@cd%d". My analyzer will turn this into "ab cd d". Can I pass it "ab\@cd\%d" and force it to treat it as a single word? I want to use the Query parser, but I don't ...
    Scott SmithScott Smith
    Sep 17, 2013 at 8:27 pm
    Sep 18, 2013 at 9:52 pm
  • In lucene 4.3.0 there is no IndexFileNameFilter. And I find in org.apache.lucene.index.IndexFileNames the index file extensions have only 3 types. public static final String INDEX_EXTENSIONS[] = new ...
    Yonghui ZhaoYonghui Zhao
    Sep 18, 2013 at 11:04 am
    Sep 18, 2013 at 12:28 pm
  • Can anyone shed light as to why this is a token filter and not a char filter? I'm wishing for one of these _upstream_ of a tokenizer, so that the tokenizer's lookups in its dictionaries are seeing ...
    Benson MarguliesBenson Margulies
    Sep 16, 2013 at 4:25 pm
    Sep 16, 2013 at 5:45 pm
  • All, Apologies if I missed this in the documentation, but should: FuzzyQuery q = new FuzzyQuery(new Term("field", "ab"), 2) retrieve a document that contains: abcd and vice versa. Same question for ...
    Allison, Timothy B.Allison, Timothy B.
    Sep 12, 2013 at 1:43 am
    Sep 13, 2013 at 3:28 am
  • Hello all Looking on the 10% slowest queries, I get very bad performances (~60 sec per query). These queries have lots of conditions on my main field (more than a hundred), including phrase queries ...
    Manuel Le NormandManuel Le Normand
    Sep 8, 2013 at 11:04 am
    Sep 8, 2013 at 2:13 pm
  • Hi again, In order to delete part of my index I run a delete by query that intends to erase 15% of the docs. I added this params to the solrconfig.xml <mergePolicy ...
    Manuel Le NormandManuel Le Normand
    Sep 8, 2013 at 11:27 am
    Sep 8, 2013 at 1:58 pm
  • Hello! I have really long document field values. Tokens of these fields are of the form: word|payload|position_increment. (I need to control position increments and payload manually.) I collect these ...
    Igor ShalyminovIgor Shalyminov
    Sep 27, 2013 at 2:12 pm
    Oct 4, 2013 at 1:21 pm
  • Hi, Please let me know if you are aware of a book or reference guide on Lucene4 and above. I came across 'Lucene in Action, 2nd Edition', but it covers Lucene 3x. Thanks in Advance, Ashwin
    Ashwin TandelAshwin Tandel
    Sep 24, 2013 at 5:13 pm
    Sep 25, 2013 at 10:19 pm
  • hi all, I am quite new to Lucene. I have downloaded an example from a tutorial, adapted it for version 3.6 (which is the one I have installed) and run it several times. The script indexes an array of ...
    Ruud DozijnRuud Dozijn
    Sep 24, 2013 at 11:11 am
    Sep 24, 2013 at 1:00 pm
  • Hi, folks I build lucene index using lucene-4.3. However, I found for a field, some terms are searchable while searching the others will throw the following exception: java.io.EOFException: seek past ...
    Hao yanHao yan
    Sep 18, 2013 at 4:39 am
    Sep 19, 2013 at 10:53 pm
  • Hello, I was going to use the TotalHitCountCollector in cases where I'm interested just in the number of results. Obviously I was hoping to gain in performances compared to a "scored" query. search ...
    Nicola BusoNicola Buso
    Sep 18, 2013 at 3:21 pm
    Sep 19, 2013 at 9:47 am
  • Hi, I wrote a simple code to update a lucene document with new values. Code Snippet: Term term = new Term("PRODUCT_CODE", productCode); TermQuery query = new TermQuery(term); TopDocs productDoc = ...
    Sanket ParanjapeSanket Paranjape
    Sep 18, 2013 at 12:51 pm
    Sep 19, 2013 at 5:56 am
  • Here it fails because -verbose is not set: $ java -cp ./lucene-core-4.4-SNAPSHOT.jar org.apache.lucene.index.IndexUpgrader ./INDEX Exception in thread "main" java.lang.IllegalArgumentException ...
    Bruce KarshBruce Karsh
    Sep 17, 2013 at 1:28 am
    Sep 17, 2013 at 8:56 pm
  • Hi, This is probably a very basic question, but I am unable to find |Lucene| contrib jar in Apache's |Maven| repository. For e.g. I looked here ...
    Abhinav M KulkarniAbhinav M Kulkarni
    Sep 11, 2013 at 4:40 am
    Sep 11, 2013 at 6:48 pm
  • Hello All, I would like to know the basic difference between providing a phrase suggestion based on String[] suggestionMatcher=getSuggestion(suggestedPhrase,phraseRecommender); and providing phrase ...
    Ankit MurarkaAnkit Murarka
    Sep 5, 2013 at 7:38 am
    Sep 6, 2013 at 11:11 am
  • Hi All, Can you please let me know if there is an equivalent of LatLongDistanceFilter in Lucene 4.4 API. This API was present in Lucene 3.6 API. I have to mainly compute whether a point(lat,lang) is ...
    James bondJames bond
    Sep 24, 2013 at 4:59 pm
    Oct 8, 2013 at 4:02 pm
  • Hi there, I try to deep dive into the inner LucenePostingFormat to check what might I do for improving query performance. I'm curious about the termBlock stats that I get from checkIndex -verbose ...
    Manuel Le NormandManuel Le Normand
    Sep 22, 2013 at 1:36 pm
    Sep 23, 2013 at 11:57 am
  • Hi, I'm trying to do something very simple with the parent/child blockjoinquery. I have a several child docs and a parent doc added to index in the same order. There are 3 fields + filter field for ...
    Krithika rKrithika r
    Sep 20, 2013 at 12:12 am
    Sep 20, 2013 at 10:57 am
  • Hi, While trying to play with the CompoundWordTokenFilterBase I noticed that the behavior is to include the original token together with the new sub-tokens. I assume this is expected (haven't found ...
    Alex ParvulescuAlex Parvulescu
    Sep 18, 2013 at 2:28 pm
    Sep 18, 2013 at 6:24 pm
  • Hi all, is there any good documentation of how to change and modify the index of Lucene version 4 other than what is already on the website? Blogs, papers, reports etc. or just a report on experience ...
    Ralf BierigRalf Bierig
    Sep 17, 2013 at 9:32 pm
    Sep 18, 2013 at 11:15 am
  • Most of my terms return the correct position that they are in, but there is a percent or them that return really bad values. For example, I have a field that contains 5 terms, when I ask for term ...
    Ross WoolfRoss Woolf
    Sep 17, 2013 at 5:06 pm
    Sep 18, 2013 at 11:08 am
  • Hi, I want to do a kind of 'facet search', that initial research in a field of all documents in the Lucene index, and second search in other field of the documents returned to the first research ...
    David MirandaDavid Miranda
    Sep 17, 2013 at 10:55 am
    Sep 17, 2013 at 12:33 pm
  • Hi, I am getting an exception while indexing files, i tried debugging but couldnt figure out the problem. I have a custom analyzer which creates the token stream , i am indexing around 15k files, ...
    Nischal reddyNischal reddy
    Sep 16, 2013 at 7:12 pm
    Sep 17, 2013 at 12:19 pm
  • I want to be sure I understand this correctly. Suppose I have a search that I'm going to run through the query parser that looks like: body:"some phrase" AND keyword:"my-keyword" clearly "body" and ...
    Scott SmithScott Smith
    Sep 16, 2013 at 4:37 pm
    Sep 16, 2013 at 5:14 pm
  • Has anyone experienced a latency increase between the above versions? Mainly in conjunction queries. Thanks -John
    John WangJohn Wang
    Sep 14, 2013 at 3:09 am
    Sep 16, 2013 at 8:46 am
  • First let me start by saying: I'm sorry! I know this question has probably been asked and answered already, but I am new to this project and just trying to get up to speed. I do have a very simple ...
    Wasikowski, Brian [ JRDUS]Wasikowski, Brian [ JRDUS]
    Sep 13, 2013 at 5:03 pm
    Sep 13, 2013 at 6:00 pm
  • Hi, I am confused a bit about the lucene attributes, can someone please help me out with this, can we store all the attributes of a term in the index? i have set following attributes for a term, ...
    Nischal reddyNischal reddy
    Sep 13, 2013 at 11:32 am
    Sep 13, 2013 at 4:03 pm
  • Hi, I have written a custom Tokenizer which will split my input text into tokens, i have overridden the incrementToken method and setting chartermAttribute, offsetAttribute, typeAttribute (Please ...
    Nischal reddyNischal reddy
    Sep 11, 2013 at 1:25 pm
    Sep 13, 2013 at 4:02 pm
  • I’m wanting to high jack SpellChecker class as a general spell checking and word suggestion tool. The idea of using this class was to avoid creating my own. At first it seems to fit the bill ...
    Johnny JenkinsJohnny Jenkins
    Sep 11, 2013 at 2:25 am
    Sep 11, 2013 at 11:21 pm
  • Hello Have a peculiar problem to deal with and I am sure there must be some way to handle it. 1. Indexes exist on the server for existing files. 2. Generating indexing is automated so files when ...
    Ankit MurarkaAnkit Murarka
    Sep 11, 2013 at 12:54 pm
    Sep 11, 2013 at 1:27 pm
Group Navigation
period‹ prev | Sep 2013 | next ›
Group Overview
groupjava-user @

67 users for September 2013

Michael McCandless: 23 posts Benson Margulies: 13 posts Ankit Murarka: 12 posts Ian Lea: 12 posts Alan Burlison: 10 posts Erick Erickson: 10 posts Adrien Grand: 9 posts David Miranda: 8 posts Nischal reddy: 8 posts Jack Krupansky: 7 posts Allison, Timothy B.: 6 posts Uwe Schindler: 6 posts Jebarlin Robertson: 5 posts Manuel Le Normand: 5 posts Mirko Sertic: 5 posts Robert Muir: 5 posts Darren Hoffman: 4 posts Marcos Juarez Lopez: 4 posts VIGNESH S: 4 posts Desidero: 3 posts
show more