FAQ

Search Discussions

62 discussions - 246 posts

  • Hi, I am looking for a token filter that can combine 2 terms into 1? E.g. the input has been tokenized by white space: t1 t2 t2a t3 I want a filter that output: t1 t2t2a t3 I know it is a very ...
    Xi ShenXi Shen
    Dec 21, 2012 at 7:50 am
    Dec 29, 2012 at 3:29 am
  • Hi, I need a guide to implement my own TokenFilter. I checked the wiki, but I could not find any useful guide :( -- Regards, David Shen http://about.me/davidshen https://twitter.com/#!/davidshen84
    Xi ShenXi Shen
    Dec 22, 2012 at 2:26 pm
    Dec 26, 2012 at 11:29 pm
  • Hello list, I am entirely new to Lucene and was trying yo get myself familiar with it with the help of tutorial presented here : http://www.lucenetutorial.com/lucene-in-5-minutes.html I was trying to ...
    Mohammad TariqMohammad Tariq
    Dec 4, 2012 at 3:07 pm
    Dec 5, 2012 at 9:45 am
  • Hello, I have been getting the following lock error when attempting to open an index writer to add new documents to an index. org.apache.lucene.store.LockObtainFailedException Lock obtain timed out ...
    Bowden WiseBowden Wise
    Dec 18, 2012 at 6:59 pm
    Dec 19, 2012 at 12:24 pm
  • Greetings, We are using lucene in our log analysis tool. We get data around 35Gb a day and we have this practice of zipping week old indices and then unzip when need arises. Though the compression ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Dec 7, 2012 at 7:33 am
    Dec 11, 2012 at 1:51 pm
  • I’ve been trying to do semi-structured queries & query parsing. In other words, you could have XML snippets mixed in with plain terms, e.g. a query like: christmas tree <store loc=”abc” ...
    Wu, Stephen T., Ph.D.Wu, Stephen T., Ph.D.
    Dec 7, 2012 at 9:47 pm
    Dec 14, 2012 at 2:41 pm
  • Hello, I'm trying to merge 12 indexed into one big index using the Lucene IndexMergeTool (command line used appended below). The merge seemed to finish successfully, but when I ran CheckIndex on the ...
    Tom Burton-WestTom Burton-West
    Dec 5, 2012 at 9:31 pm
    Dec 13, 2012 at 10:22 pm
  • Hi, I am looking to get a bit more information back from SOLR/Lucene about the query/document pair scores. This would include field level scores, overall text relevance score, Boost value, BF value ...
    Vishwas GoelVishwas Goel
    Dec 26, 2012 at 4:31 am
    Dec 26, 2012 at 2:22 pm
  • Hi, I'm following Grant's advice on how to combine BooleanQuery and SpanQuery ...
    Carsten SchnoberCarsten Schnober
    Dec 13, 2012 at 3:49 pm
    Dec 19, 2012 at 4:32 pm
  • I have just downloaded and set up Lucene 4.0.0 to implement a search facility for a web app I'm developing. Creating the index seems to be successful - the files created contain the text that I'm ...
    Ramon CashaRamon Casha
    Dec 18, 2012 at 2:15 pm
    Dec 18, 2012 at 3:06 pm
  • Hi all, I was wanting to use the 4.1 version to access some of the latest improvements, I was hoping to just connect to the maven snapshot repository but it seems that they are not being updated as ...
    Neil IresonNeil Ireson
    Dec 11, 2012 at 1:49 pm
    Dec 11, 2012 at 9:29 pm
  • Hi all, We start to evaluate Lucene 4.0 for using in the production environment. This means that we need to index millions of document with TeraBytes of content and search in it. For now we want to ...
    Vitaly_artemovVitaly_artemov
    Dec 23, 2012 at 11:12 am
    Dec 24, 2012 at 1:30 pm
  • Hi, I would like to be able to display up to multiple millions of lat/lng points on a map, to make this possible my intention is to plot less than 1000 clusters of points by dividing the world into a ...
    Neil IresonNeil Ireson
    Dec 13, 2012 at 7:43 pm
    Dec 19, 2012 at 2:14 am
  • Hi all, I run my code on a cluster where I have to preset resource limits and therefore the processes have limited virtual memory causing OOME when using MMapDirectory on large indexes. This means I ...
    Neil IresonNeil Ireson
    Dec 12, 2012 at 10:32 am
    Dec 12, 2012 at 3:14 pm
  • Hi! I'm using lucene.net, but I'm sure this question is not platform specific. :) I've created an index for a website which uses a central database server and three front-end servers. For now I've ...
    Lars-Erik AabechLars-Erik Aabech
    Dec 10, 2012 at 10:33 am
    Dec 10, 2012 at 12:22 pm
  • I am (also) running lucene unit tests. In the teardown-method(@After) I (try to) delete the complete directory-folder. Unfortunately this does not always work. If not, the file _0_nrm.cfs (or _0.fdx) ...
    Clemens Wyss DEVClemens Wyss DEV
    Dec 9, 2012 at 4:46 pm
    Dec 9, 2012 at 8:57 pm
  • In order to provide suggestions our query also includes a "WildcardQuery with a leading *", which, of course, has a HUGE performance impact :-( E.g. Say we have indexed "vacancyplan", then if a user ...
    Clemens Wyss DEVClemens Wyss DEV
    Dec 7, 2012 at 9:16 am
    Dec 7, 2012 at 12:14 pm
  • I need to send a class containing Lucene elements such as `Query` over the network using EJB and of course this class need to be serialized. I marked my class as `Serializable` but it does not seems ...
    BIAGINI NathanBIAGINI Nathan
    Dec 4, 2012 at 9:34 am
    Dec 4, 2012 at 1:27 pm
  • Hello, I'm still trying to figure out some of the nuances of Lucene and I have run into a small issue. I have created my own custom analyzer which uses the WhitespaceTokenizer and chains together the ...
    Jeremy LongJeremy Long
    Dec 26, 2012 at 2:09 pm
    Dec 27, 2012 at 12:13 am
  • Hello, I am looking for Russian stemmer. Do Lucene have one as well as documentation on how to use it? Please let me know where can I find the Russian stemmer. Thanks! Dima
    DokondrDokondr
    Dec 17, 2012 at 11:19 pm
    Dec 18, 2012 at 12:03 am
  • I'm using trunk to try out DocValues. Directory directory = new RAMDirectory(); IndexWriterConfig iwConfig = new IndexWriterConfig( Version.LUCENE_41, new StandardAnalyzer(Version.LUCENE_41)) ...
    Varun ThackerVarun Thacker
    Dec 12, 2012 at 5:52 pm
    Dec 12, 2012 at 6:19 pm
  • Hi all, I am new with Lucene. I try to understand how can I find the term position. I use following code to index documents: ... IndexWriter writer = new IndexWriter(mIndexDir, mIwc); FileInputStream ...
    Vitaly_artemovVitaly_artemov
    Dec 6, 2012 at 10:30 am
    Dec 7, 2012 at 4:34 pm
  • Hello, I'm having an issue creating a custom analyzer utilizing the WordDelimiterFilter. I'm attempting to create an index of information gleaned from JAR manifest files. So if I have ...
    Jeremy LongJeremy Long
    Dec 23, 2012 at 4:57 pm
    Dec 26, 2012 at 2:15 pm
  • Hello, Please, help. I am lost in TokenStream / Token / Analyzer API. I am trying to figure out how to get _token_itself_ or token text while looking at "Invoking the Analyzer" example (see example ...
    DokondrDokondr
    Dec 25, 2012 at 6:18 pm
    Dec 25, 2012 at 8:17 pm
  • Now, i have a index library with 100 segments. Using forcemerge function can merge all segments into a segment. But I also want the newly generated index library which is written in another disk ...
    Hu JingHu Jing
    Dec 20, 2012 at 1:24 am
    Dec 21, 2012 at 12:52 am
  • Hi. I am trying to make a NGramPhrase query that could tolerate terms missing, so even if one of the NGrams doesn't match it still gets picked up by search. I know I could use the combination of ...
    김한규김한규
    Dec 19, 2012 at 7:36 am
    Dec 20, 2012 at 10:01 am
  • Hello, I am looking for an example of using Tokenizer + Analyzer (in particular org.apache.lucene.analysis.ru.RussianAnalyzer) for standalone stemming. Can't find such an example here ...
    DokondrDokondr
    Dec 18, 2012 at 6:16 pm
    Dec 19, 2012 at 2:22 pm
  • Hello all, I beginning with an application and nobody knows with Lucene-analyzer 3.3.0.jar and Lucene snowball 3.0.1.jar are both included Its do same thing ? I how can I be sure that excluding ...
    Adrien RUFFIEAdrien RUFFIE
    Dec 17, 2012 at 5:41 pm
    Dec 18, 2012 at 7:01 am
  • in lucene 3.0,i can get term doc by using indexreader termdocs. how to implement this in lucene 4.0
    Hu JingHu Jing
    Dec 18, 2012 at 12:19 am
    Dec 18, 2012 at 5:16 am
  • Hi, guys: Does queryplugin implementation impacts caching? I have implemented a new query parser which just take the input query string and return my own query object. But the problem is, when i ...
    LukaiLukai
    Dec 17, 2012 at 5:59 am
    Dec 17, 2012 at 7:06 am
  • If I specify a precisionStep of 26 for a TrieDate field, what rough impact should this have on both performance and index size? The input data has time in it, but the milliseconds per day is not ...
    Jack KrupanskyJack Krupansky
    Dec 14, 2012 at 10:48 pm
    Dec 14, 2012 at 11:11 pm
  • Hi all I'm currently benchmarking Lucene to get an understanding of what optimisations are available for long queries, and wanted to check what the recommended approach is. Unsurprisingly a naive ...
    Matthew WillsonMatthew Willson
    Dec 11, 2012 at 2:20 pm
    Dec 11, 2012 at 6:14 pm
  • Hello there, my colleague and I ran into an example which didn't return the result size which we were expecting. We discovered that there is a mismatch in handling terms while indexing and ...
    Bayer DennisBayer Dennis
    Dec 11, 2012 at 9:52 am
    Dec 11, 2012 at 10:55 am
  • Hi Is it possible to delete a set of documents where they match certain conditions? I would like to delete a set of articles that belong to a given user within a category. Thanks, ----- -- View this ...
    RajashekarRajashekar
    Dec 6, 2012 at 8:36 am
    Dec 6, 2012 at 9:29 am
  • Lucene version: 3.0.3 Does IndexWriter use the analyzer when adding indexes via addIndexesNoOptimize()? What about for optimize()? I am examining some existing code and trying to determine what ...
    Earl HoodEarl Hood
    Dec 5, 2012 at 7:33 am
    Dec 5, 2012 at 10:58 pm
  • Can one replace the basic scoring algorithm (TF/IDF) for a specific field, to use a different one? I need to compute similarity for NAME field. The regular TF/IDF is not good enough, and I want to ...
    Eyal Ben MeirEyal Ben Meir
    Dec 1, 2012 at 7:16 pm
    Dec 2, 2012 at 4:36 pm
  • I just found out about the blocktree implementation and how it is used to increase the speed of prefix search. Have you tried to use it for spatial search? I will explain to you how i think it will ...
    Apostolis XekoukoulotakisApostolis Xekoukoulotakis
    Dec 22, 2012 at 3:56 pm
    Jan 18, 2013 at 3:46 pm
  • We have a section of code that does the text highlight, it was running fine under lucene 1.9.1, but we are getting the following error after upgrade to 3.6.1. It does not seem to be anything that we ...
    Bin LanBin Lan
    Dec 27, 2012 at 8:40 pm
    Dec 27, 2012 at 9:25 pm
  • Dear all, We are moving from Lucene 2.3 to 4.1. For the migration, we use the IndexUpgrader class in org.apache.lucene.index of lucene 3.6. And then migrate it to 4.0 using the 4.0 IndexUpgrader ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Dec 26, 2012 at 2:52 pm
    Dec 26, 2012 at 2:56 pm
  • I am trying to index some documents in 3.6.1 using text_ja field and copying it to string field so that I can do an exact match on copy field, but it seems that all the documents are not getting ...
    Khem_99Khem_99
    Dec 23, 2012 at 2:46 am
    Dec 26, 2012 at 2:11 pm
  • Hi, I want to be able to count the number of times a certain character appears in the page, and then add that number as a Field to the document itself. I've been unable to fund a way to do this ...
    Itai PelegItai Peleg
    Dec 22, 2012 at 11:55 am
    Dec 22, 2012 at 11:55 am
  • Hello, I have been getting the following lock error when attempting to open an index writer to add new documents to an index. org.apache.lucene.store.LockObtainFailedException Lock obtain timed out ...
    Bowden WiseBowden Wise
    Dec 19, 2012 at 8:15 pm
    Dec 19, 2012 at 8:49 pm
  • Hi, Found some typos. In package description: http://lucene.apache.org/core/4_0_0/core/org/apache/lucene/analysis/package-summary.html?is-external=true#package_description In the example code bellow ...
    DokondrDokondr
    Dec 18, 2012 at 7:32 pm
    Dec 18, 2012 at 10:40 pm
  • Hi all, I use Lucene 4.0. I try to find offsets for phrase queries. My code works then I search for one word but then I call it for some phrase I didn't get offsets. termsEnum.seekExact returns false ...
    Vitaly_artemovVitaly_artemov
    Dec 17, 2012 at 4:48 pm
    Dec 18, 2012 at 3:15 pm
  • Hi all, I don't know that how to update one field which is not stored of an document in lucene 4.0. Can anybody tell me? Thanks! Cheers, --- Bob ...
    Bo ZhangBo Zhang
    Dec 18, 2012 at 9:15 am
    Dec 18, 2012 at 10:06 am
  • java-user-subscribe
    DokondrDokondr
    Dec 17, 2012 at 10:48 pm
    Dec 17, 2012 at 10:50 pm
  • Hello, Any 'tentative' release date for 4.1 would help. I know it is difficult pointing a date, but still couldn't resist asking, for we could plan accordingly. Thanks in advance. -- With Thanks and ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Dec 12, 2012 at 11:51 am
    Dec 12, 2012 at 5:22 pm
  • Hi, I wonder if there is a way to use a SpanQuery to find documents with fields that end with a certain term. Kind of the oppoisite of SpanFirstQuery, i.e. "SpanLastQuery", if you want. What I would ...
    Hasenberger, JosefHasenberger, Josef
    Dec 11, 2012 at 3:21 pm
    Dec 11, 2012 at 4:05 pm
  • I have a CustomAnalyzer which overrides "public final TokenStream tokenStream ( String fieldName, Reader reader )": @Override public final TokenStream tokenStream ( String fieldName, Reader reader ) ...
    Clemens Wyss DEVClemens Wyss DEV
    Dec 9, 2012 at 1:16 pm
    Dec 10, 2012 at 10:03 am
  • Hi all, I'm implementing an approach of mixture of language models in Lucene 4.0.0. Here is a little math to be precise: The ranking score for query q with t terms: p(q | \theta) = \prod_{t \in q} ...
    Nikita ZhiltsovNikita Zhiltsov
    Dec 10, 2012 at 5:36 am
    Dec 10, 2012 at 5:51 am
Group Navigation
period‹ prev | Dec 2012 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions62
posts246
users78
websitelucene.apache.org

78 users for December 2012

Jack Krupansky: 13 posts Ian Lea: 12 posts Carsten Schnober: 10 posts Dokondr: 9 posts Michael McCandless: 9 posts Neil Ireson: 9 posts Steve Rowe: 9 posts Uwe Schindler: 9 posts Ramprakash Ramamoorthy: 8 posts Xi Shen: 8 posts Vitaly_artemov: 7 posts Bowden Wise: 6 posts Clemens Wyss DEV: 6 posts Lukai: 6 posts Robert Muir: 6 posts Lance Norskog: 5 posts Mohammad Tariq: 5 posts BIAGINI Nathan: 4 posts Erick Erickson: 4 posts Glen Newton: 4 posts
show more
Archives