FAQ

Search Discussions

48 discussions - 203 posts

  • Hi, My apps need to read from and write to some big indexes frequently. So I use RAMDirectory instead of FSDirectory, and give JVM about 2GB memory size. I notice that the speed of reading and ...
    ChengCheng
    Jun 4, 2012 at 2:08 pm
    Jun 30, 2012 at 10:25 am
  • hi all I need to return certain fields of all matched documents quickly. I am now using Document.get(field), but the performance is not well enough. Originally I use HashMap to store these fields ...
    Li LiLi Li
    Jun 20, 2012 at 12:49 pm
    Jun 25, 2012 at 5:07 pm
  • hi I have strings like "drinks - water" and I've read in "Lucene in Action" that the StandardAnalyzer and other analyzers removes the "-" from the string but so far none of them worked... All of them ...
    ListasListas
    Jun 25, 2012 at 1:41 am
    Jun 26, 2012 at 3:26 am
  • Hello, I have checked out lucene 3.6 and I am trying to run the ant jflex. It is throwing a Stackoverflow error when it is trying to execute the target: jflex-UAX29URLEmailTokenizer. Any idea why ...
    Bin01Bin01
    Jun 14, 2012 at 12:59 am
    Jun 15, 2012 at 12:32 am
  • Hi, I'm trying to brush up on some of the *cough* newer APIs (we've been using 2.9.2 up until now). Anyway, I have the test below which a modified version of one of the tests in Lucene In Action, but ...
    Brendan GraingerBrendan Grainger
    Jun 28, 2012 at 11:06 pm
    Jun 29, 2012 at 3:28 am
  • Hi, I have one query of lucene about sort. I have 10000 documents in my index which having fields A,B,C,D. i want first 100 results in my query but they must be sort by field A. Suppose I have query ...
    Yogesh patelYogesh patel
    Jun 26, 2012 at 3:06 am
    Jun 28, 2012 at 11:02 pm
  • Hi there, I have to index chinese content and I don't get the expected results when searching. It seems that the WildcardQuery does not work properly with the chinese characters. See attached sample ...
    Paco AvilaPaco Avila
    Jun 27, 2012 at 10:20 am
    Jun 28, 2012 at 7:38 am
  • Hi everybody I'm using Lucene3.6 to index Wikipedia documents which is over 3 million article, the data is on a mysql database and it is taking more than 24 hours so far.Do you know any tips that can ...
    Elshaimaa AliElshaimaa Ali
    Jun 19, 2012 at 7:08 pm
    Jun 20, 2012 at 12:43 am
  • http://lucene.apache.org/core/3_6_0/fileformats.html#Frequencies The .frq file contains the lists of documents which contain each term, along with the frequency of the term in that document (except ...
    WangjingWangjing
    Jun 27, 2012 at 9:40 am
    Jun 28, 2012 at 5:00 am
  • Hi, We have been using lucene 2.3.2 for years well (yes, we should upgrade). Recently we encountered data corruption error when commiting IndexWriter: /// background merge hit exception: _14b:c61262 ...
    Zhang, LishengZhang, Lisheng
    Jun 30, 2012 at 8:47 pm
    Jun 30, 2012 at 10:21 pm
  • I'm quite new to Lucene and recently, I ran into a problem. I have a lucene document that looks like this: --- type --- gene --- id --- xla:379474 --- alt_id --- emb:BC054227 gb:BC054227 ...
    SecevallivSecevalliv
    Jun 25, 2012 at 1:02 pm
    Jun 26, 2012 at 11:02 am
  • Hello all, I am tying to write a simple autosuggest functionality. I was looking at some auto suggest code, and came over this post ...
    Mansour Al AkeelMansour Al Akeel
    Jun 22, 2012 at 10:26 pm
    Jun 24, 2012 at 7:42 am
  • hi, does anyone knows how to extract meaningful words from Lucene index?
    齐保元齐保元
    Jun 26, 2012 at 9:40 am
    Jun 27, 2012 at 10:25 pm
  • Our Hit highlighting (Using the older Highlighter) is wired with a "too huge" limit, so we could skip the multi-million character files, not just for highlighter.setMaxDocCharsToAnalyze, but if a ...
    Paul HillPaul Hill
    Jun 22, 2012 at 7:24 pm
    Jun 25, 2012 at 5:17 pm
  • Hi, I am getting the following OOM consistently whenever the index is opened . Is it because now the index is holding too many terms ? Our application ( that has Lucene 2.9.3 ) already has reached ...
    Nishesh GuptaNishesh Gupta
    Jun 1, 2012 at 11:52 pm
    Jun 6, 2012 at 3:58 am
  • Dear, I am using Lucene for my log search tool. Is there a way I can automatically perform a commit operation on my IndexWriter when a particular set of docs is flushed from memory to the disk. My ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Jun 27, 2012 at 5:55 am
    Jun 28, 2012 at 6:54 am
  • <EOM --------------------------------------------------------------------- To unsubscribe, e-mail: <span class="m_body_email_addr" title="356411d2f4b6e34d06eca5aa0a7230d4" ...
    Deshpande, VikasDeshpande, Vikas
    Jun 25, 2012 at 2:00 pm
    Jun 25, 2012 at 4:32 pm
  • Hello everyone, I am having a problem with a lucene store. When starting an IndexWriter on it, it throws the following exception: Caused by: java.io.IOException: read past EOF ...
    Chris GioranChris Gioran
    Jun 19, 2012 at 2:51 pm
    Jun 19, 2012 at 11:45 pm
  • I've found the class WordnetSynonymParser in org.apache.lucene.analysis.synonym but there aren't examples of its usage neither in the API nor in google. Does any one have experience with it? Thank ...
    Kits89Kits89
    Jun 15, 2012 at 3:10 pm
    Jun 18, 2012 at 3:53 pm
  • As others have previously proposed on this list, I am interesting in inserting a second token at some positions in my index. I'll call this Limited Index Expansion. I want to retain the original ...
    Paul HillPaul Hill
    Jun 12, 2012 at 7:07 pm
    Jun 12, 2012 at 11:57 pm
  • Hi all, This is driving me crazy. In my data if I search "state" AND "GA" I get hits. If I search "state" AND "OR" or "state" AND "IN" I get no hits even though I can see examples of state AND IN in ...
    Bob RhodesBob Rhodes
    Jun 7, 2012 at 5:50 pm
    Jun 7, 2012 at 11:55 pm
  • .fdx file contains, for each document, a pointer to its field data. BUT fdx is contains pointer to WHAT? it's a pointer of field data offset in the fdt file? my app is File file = new File(path) ...
    WangjingWangjing
    Jun 25, 2012 at 3:28 am
    Jun 26, 2012 at 2:43 am
  • I imagine this is a question that comes up from time to time, but I haven't been able to find a definitive answer anywhere, so... I'm wondering whether there is some type of Lucene query that filters ...
    Mike SokolovMike Sokolov
    Jun 16, 2012 at 6:34 pm
    Jun 18, 2012 at 2:58 am
  • Hi, I'm currently reading "Lucene in action (2nd edition)". At page 105 - section 3.5.4, I'm reading the following paragraph: --- QueryParser won’t create a NumericRangeQuery for you. This is because ...
    Jochen HebbrechtJochen Hebbrecht
    Jun 12, 2012 at 7:30 am
    Jun 12, 2012 at 11:26 am
  • Hi, Is there a safe way to forcefully close an IndexWriter that is unable to flush to disk? We're seeing occasional issues where an IndexWriter encounters an IOException on close and does not release ...
    Geoff CooneyGeoff Cooney
    Jun 4, 2012 at 2:00 pm
    Jun 4, 2012 at 6:51 pm
  • Based on this link http://www2002.org/CDROM/refereed/643/node6.html , I'm calculating Okapi similarity between the query document and another document as below using Lucene: I have indexed the ...
    Kasun PereraKasun Perera
    Jun 19, 2012 at 9:57 am
    Jul 17, 2012 at 3:14 am
  • Hi, Suppose we have a query "balcony table". I want results to be returned by exact match (first priority) and by single words matching as well (for "balcony" or for "table"). So currently my ...
    SxamSxam
    Jun 30, 2012 at 8:55 pm
    Jun 30, 2012 at 9:46 pm
  • All, I have a question about join support across multiple document types in Solr/Lucene. Let me lay out the use case. Suppose I have 3 tables: * Table A has 3 columns, id, a1, a2. * Table B has 4 ...
    Frank DeRoseFrank DeRose
    Jun 29, 2012 at 7:14 pm
    Jun 29, 2012 at 8:47 pm
  • I'm beginner of Lucene. I have one big size txt file containing image id + its tag list (some containing foreign characters - anyway only tag written in English will be used for my application ) ...
    KjysmuKjysmu
    Jun 27, 2012 at 5:06 am
    Jun 27, 2012 at 9:19 am
  • I'm a fresh man, and courious with inverted index. and who can show a sample with dataset to show work of Lucene. Thanks.
    DEW¤DEW¤
    Jun 20, 2012 at 4:11 am
    Jun 20, 2012 at 4:16 am
  • I want to calculate average document length for document collection which each document having 3 different fields(filed1, field2,field3) This is the program to calculate average length when only one ...
    Kasun PereraKasun Perera
    Jun 18, 2012 at 3:19 am
    Jun 19, 2012 at 9:40 am
  • Hi all, I'm searching for a way to reuse a Lucene search. For example, I'm searching for the word "acci". But too many ScoreDocs are returned, and I provide: "accide". Can it reuse the existing ...
    Jochen HebbrechtJochen Hebbrecht
    Jun 14, 2012 at 12:19 pm
    Jun 14, 2012 at 3:36 pm
  • I got the OutOfMemoryError when I tried to open an Lucene index. it's very weird since this is only seen when I run this inside an Apache PIG LATIN script on a particular hadoop cluster of ours, and ...
    YangYang
    Jun 13, 2012 at 9:16 pm
    Jun 13, 2012 at 11:20 pm
  • Hello, I've read the documentation about the TiredMergePolicy class. But I just can't get behind what this sentence is trying to state: [..] For normal merging, this policy first computes a "budget" ...
    ThomasThomas
    Jun 12, 2012 at 8:44 am
    Jun 12, 2012 at 4:01 pm
  • I noticed today that my code calls IndexSearcher.search (Query query, Filter filter, Collector collector) But also noticed that the DOCs says "Applications should only use this if they need all of ...
    Paul HillPaul Hill
    Jun 8, 2012 at 5:33 pm
    Jun 8, 2012 at 5:41 pm
  • Hey guys, I'm trying to index nested documents in lucene 3.6. I have the parent document having a 'type' and 'typename' fields and the children having 'value' and 'author' fields. The below snippet ...
    Ananth VAnanth V
    Jun 8, 2012 at 10:04 am
    Jun 8, 2012 at 10:42 am
  • I was looking at the Lucene API for IndexCommit and noticed that the JavaDoc states that *'Decision that a commit-point should be deleted is taken by the ...
    Colin Goodheart-SmitheColin Goodheart-Smithe
    Jun 6, 2012 at 11:16 am
    Jun 6, 2012 at 11:38 am
  • Apologies for the short notice guys, we're meeting up at The Plough in Bloomsbury on Wednesday 6th June. As usual the format is open and there's a healthy mix of experience and backgrounds. Please ...
    Richard MarrRichard Marr
    Jun 2, 2012 at 11:30 pm
    Jun 5, 2012 at 3:42 pm
  • Did you find any solution for this. I am looking for similar solution, please let me know if you found any useful info regarding fuzzy phrase search inlucene. Thanks & Regards, Harish B.N. Lead ...
    Harish BnHarish Bn
    Jun 1, 2012 at 2:49 pm
    Jun 1, 2012 at 4:05 pm
  • Hi kjysmu, I moved the discussion to java-user@lucene instead of dev@lucene since your question is not related to Lucene development. http://people.apache.org/~hossman/#java-user To understand how to ...
    Adrien GrandAdrien Grand
    Jun 26, 2012 at 4:25 pm
    Jun 26, 2012 at 4:25 pm
  • CommonGrams provides a neat trick for optimizing slow phrase queries that contain common words. (E.g. Hathi Trust has some ...
    Chris HarrisChris Harris
    Jun 22, 2012 at 12:08 am
    Jun 22, 2012 at 12:08 am
  • I am trying to use this class and add my synonym list in synonyms.properties file. File Content : car auto car machine car automobile But results obtained are only for last synonym specified , i.e ...
    BlunderboyBlunderboy
    Jun 14, 2012 at 5:41 pm
    Jun 14, 2012 at 5:41 pm
  • Hi, In CarmelTopKTermPruningPolicy class, the threshold is calculated as follows: *float threshold = docs[k - 1].score - scoreDelta;* docs[k - 1].score corresponds to z_t in the original paper ...
    Zeynep P.Zeynep P.
    Jun 12, 2012 at 2:57 pm
    Jun 12, 2012 at 2:57 pm
  • Lets suppose that we make a query with multiple terms. Lucene creates a topScoreDocsCollector with an Inorder traversal of posting lists. Lets suppose we are in a specific segment, since we use a ...
    Apostolis XekoukoulotakisApostolis Xekoukoulotakis
    Jun 12, 2012 at 10:43 am
    Jun 12, 2012 at 10:43 am
  • Hi, Best Buy is building new Search Platform/Eco-System powered by Lucene/Solr. We are hiring multiple Lucene/Solr engineers, tech leads, and architects, both full-time and consulting based in ...
    SVSV
    Jun 7, 2012 at 12:11 pm
    Jun 7, 2012 at 12:11 pm
  • you can use aggregation for that. dump a collection of prices as a field with multiple values into a document //pseudo-code def doc = new Document(...) doc.add new Field( 'id', id ) doc.add new ...
    Konstantyn SmirnovKonstantyn Smirnov
    Jun 6, 2012 at 8:07 am
    Jun 6, 2012 at 8:07 am
  • Hi, We are hiring multiple Lucene/Solr engineers, tech leads, architects based in Minneapolis - both full time and consulting for developing new search platform. Please reach out to me - ...
    SVSV
    Jun 6, 2012 at 4:52 am
    Jun 6, 2012 at 4:52 am
  • Hi, We are hiring multiple Lucene/Solr engineers, tech leads, architects based in Minneapolis - both full time and consulting for developing new search platform. Please reach out to me - ...
    SVSV
    Jun 1, 2012 at 9:07 pm
    Jun 1, 2012 at 9:07 pm
Group Navigation
period‹ prev | Jun 2012 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions48
posts203
users67
websitelucene.apache.org

67 users for June 2012

Jack Krupansky: 16 posts Michael McCandless: 14 posts Li Li: 13 posts Paul Hill: 11 posts Ian Lea: 10 posts Uwe Schindler: 9 posts Cheng: 7 posts Wangjing: 6 posts Listas: 5 posts Brendan Grainger: 5 posts Mike Sokolov: 5 posts Ilya Zavorin: 4 posts Robert Muir: 4 posts Zhang, Lisheng: 4 posts Secevalliv: 3 posts Apostolis Xekoukoulotakis: 3 posts Elshaimaa Ali: 3 posts Jochen Hebbrecht: 3 posts Kasun Perera: 3 posts Mansour Al Akeel: 3 posts
show more
Archives