FAQ

Search Discussions

96 discussions - 465 posts

  • Hi, I'm working with Lucene 2.4.0 and the JVM (JDK 1.6.0_07). I'm consistently receiving "OutOfMemoryError: Java heap space", when trying to index large text files. Example 1: Indexing a 5 MB text ...
    Paul_murdochPaul_murdoch
    Aug 31, 2009 at 2:29 pm
    Jan 29, 2010 at 12:28 pm
  • hello all thanks for lucene............ this is my doubt , am searching for a keyword "about us" from my lucene index , am not getting the results what i want , since the urls are formed like the ...
    M.harigM.harig
    Aug 4, 2009 at 5:32 am
    Aug 5, 2009 at 4:19 am
  • Hi all, I am trying to tune Lucene to respect such tokens like C++, C#, .NET The task is known for Lucene community, but surprisingly I can't google out somewhat good info on it. Of course, I tried ...
    ValeryValery
    Aug 20, 2009 at 2:28 pm
    Aug 21, 2009 at 12:43 pm
  • Hi, Since moving our app to Java 6 and Tomcat 6, we have started getting occasional exceptions of the form: java.io.IOException: Stream closed at sun.nio.cs.StreamDecoder.ensureOpen(Unknown Source) ...
    Chris BamfordChris Bamford
    Aug 27, 2009 at 2:12 pm
    Sep 15, 2009 at 5:38 pm
  • Hi, I am trying to index documents and when all is complete and optimize is called I get IFD [main]: setInfoStream deletionPolicy=org.apache.lucene.index.KeepOnlyLastCommitDeletionPolicy@4fced0 IW 0 ...
    RishisinghalRishisinghal
    Aug 13, 2009 at 9:19 am
    Aug 16, 2009 at 5:56 pm
  • Hi I'd like to extend Lucene's FieldCache such that it will read native values from a different place (in my case, payloads). That is, instead of iterating on a field's terms and parsing each String ...
    Shai EreraShai Erera
    Aug 20, 2009 at 11:50 am
    Sep 10, 2009 at 5:03 pm
  • Hello Lucene users, On behalf of the Lucene dev community (a growing community far larger than just the committers) I would like to announce the second release candidate for Lucene 2.9. Please ...
    Mark MillerMark Miller
    Aug 28, 2009 at 7:03 pm
    Sep 9, 2009 at 2:34 pm
  • Hi, In my indexer app (based on the IndexFiles.java demo), I am adding the "path" field: doc.add(new Field("path", f.getPath(), Field.Store.YES, Field.Index.ANALYZED)); Per Luke, the full path (e.g., ...
    OhayaOhaya
    Aug 6, 2009 at 8:04 pm
    Aug 7, 2009 at 1:22 pm
  • Hi, I've a single index of size 87GB containing around 50M documents. When I search for any query, best search time I observed was 8sec. And when query is expanded with synonyms, search takes minutes ...
    Prashant ullegaddiPrashant ullegaddi
    Aug 3, 2009 at 4:34 am
    Aug 4, 2009 at 1:18 pm
  • Hi all! I'm currently running a big lucene index and one of my main concerns is the integrity of the data entered. A few things come to mind, like enforcing that certain fields be non-blank, forcing ...
    Daniel ShaneDaniel Shane
    Aug 13, 2009 at 2:34 pm
    Aug 26, 2009 at 10:37 pm
  • Hi, I'm starting to work on an app to list all of the terms in the "path" field. I'm including the beginning of my code below. When I run this, pointing it to a directory named "index" containing the ...
    OhayaOhaya
    Aug 2, 2009 at 2:10 am
    Aug 2, 2009 at 11:29 pm
  • Hi I am getting this issue in Lucene2.4 when I try to merge multiple IndexWriters(generally 6) sh-3.2# Exception in thread "Lucene Merge Thread #5" org.apache.lucene.index.MergePolicy$MergeException: ...
    Sumanta BhowmikSumanta Bhowmik
    Aug 20, 2009 at 7:44 am
    Aug 28, 2009 at 7:39 am
  • Hi, I am trying to build a query that looks like the following: url:(+news +politics)^1.5 content:(+news +politics)^2.0 But I can't seems to find any reference to it. I try hardcoding it like the ...
    Bourne71Bourne71
    Aug 12, 2009 at 9:09 am
    Aug 13, 2009 at 1:38 pm
  • Hi, I'm using the SnapshotDeletionPolicy class to backup my index. I basically call the snapshot() method from the class SnapshotDeletionPolicy at some point, get a list of files that changed, copy ...
    Lucas Nazário dos SantosLucas Nazário dos Santos
    Aug 14, 2009 at 6:48 pm
    Aug 18, 2009 at 1:52 pm
  • Hey there, We're trying to add foreign language support into our new search engine -- languages like Arabic, Farsi, and Urdu (that don't work with standard analyzers). But our data source doesn't ...
    Bradford StephensBradford Stephens
    Aug 6, 2009 at 7:46 pm
    Aug 10, 2009 at 7:00 pm
  • Hi want the query "R.E.S" to match "R.E.S" I use StandardFilter in my analyzer below and the description says: 'Splits words at punctuation characters, removing punctuation. However, a dot that's not ...
    Paul TaylorPaul Taylor
    Aug 6, 2009 at 2:03 pm
    Aug 6, 2009 at 8:13 pm
  • Hi, I've indexed some 50million documents. I've indexed the target URL of each document as "url" field by using StandardAnalyzer with index.ANALYZED. Suppose, there is a wikipedia page with ...
    Prashant ullegaddiPrashant ullegaddi
    Aug 2, 2009 at 10:28 am
    Aug 2, 2009 at 6:39 pm
  • Hi I would like to contribute/help in the development of Lucene and I'm not sure where to start. I understand Lucene is a mature project with some really great contributors and I was wondering ...
    Amin Mohammed-ColemanAmin Mohammed-Coleman
    Aug 12, 2009 at 9:51 am
    Aug 13, 2009 at 7:59 am
  • Hi, I've noticed a kind of strange problem with term counts and actual terms. Some background: I wrote an app that creates an index, including a "path" field. I am now working on an app (code was in ...
    OhayaOhaya
    Aug 2, 2009 at 8:32 am
    Aug 2, 2009 at 7:29 pm
  • I already know about this, but I want to give a customized score for all documents in collection, independent if wache document is or isn't relevant to the vector model. The similarity function is ...
    Fabrício RaphaelFabrício Raphael
    Aug 25, 2009 at 3:03 pm
    Aug 28, 2009 at 6:59 pm
  • You mean that you do not need score calculation therefore you do not want results sorted by relevancy. Just you need is a Boolean Retrieval Model, right? All results will have ConstantScore (0 or 1). ...
    AHMET ARSLANAHMET ARSLAN
    Aug 22, 2009 at 11:45 am
    Aug 24, 2009 at 4:41 pm
  • Hi, I am trying to make a decision on weather or not I can use Lucene for my requirements, which mainly include data tagging. I have to be able to parse or index a .txt file and then be able to ...
    xs2Abhishekxs2Abhishek
    Aug 11, 2009 at 9:28 pm
    Aug 12, 2009 at 7:33 pm
  • Hi, I have an app to initially create a Lucene index, and to populate it with documents. I'm now working on that app to insert new documents into that Lucene index. In general, this new app, which is ...
    OhayaOhaya
    Aug 4, 2009 at 3:40 pm
    Aug 4, 2009 at 5:30 pm
  • Hello, I have question about KEYWORD type and searching/updating. I am getting strange behavior that I can't quite comprehend. My index is created using standard analyzer, which used for writing and ...
    Leonard GestrinLeonard Gestrin
    Aug 3, 2009 at 2:44 am
    Aug 4, 2009 at 2:40 pm
  • I've built a Lucene Directory implementation for jdbm, an embedded Java database. Part of the Directory API are two methods related to "file" modification dates: touchFile and fileModified. My ...
    CemerickCemerick
    Aug 25, 2009 at 2:57 pm
    Sep 2, 2009 at 2:36 am
  • Hi there, I wonder if someone can help? We have a successful Lucene app deployed on Tomcat which works well. As far as we can tell, our developers have observed all the guidelines in the Lucene FAQ, ...
    Chris BamfordChris Bamford
    Aug 26, 2009 at 1:19 pm
    Aug 27, 2009 at 4:00 pm
  • Hi, How can I get the score of a span that is the result of SpanQuery.getSpans() ? The score should can be the same for each document, but if it's unique per span, it's even better. I tried looking ...
    Eran SeviEran Sevi
    Aug 2, 2009 at 3:30 pm
    Aug 26, 2009 at 1:47 pm
  • Hi, This question is going to be a little complicated to explain, but let me try. I have implemented an indexer app based on the demo IndexFiles app, and a web app based on the luceneweb web app for ...
    OhayaOhaya
    Aug 20, 2009 at 9:36 pm
    Aug 21, 2009 at 4:30 am
  • Hi, I have been seeing an issue running MatchAllDocsQueries concurrently. Running one against a test index is very fast (70 ms). Running two concurrently can take 5-25 seconds on the same test index! ...
    Carl AustinCarl Austin
    Aug 6, 2009 at 12:21 pm
    Aug 6, 2009 at 1:09 pm
  • Hi, I was trying to download a nightly build jar, so I went to Lucene website and clicked on the link that redirected to: http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/ and I got a ...
    Adriano CrestaniAdriano Crestani
    Aug 4, 2009 at 9:40 pm
    Aug 6, 2009 at 5:28 am
  • I just want to see if it's safe to use two different analyzers for the following situation: I have an index that I want to preserve case with so I can do case-sensitive searches with my ...
    Max LynchMax Lynch
    Aug 12, 2009 at 3:20 am
    Dec 30, 2009 at 11:36 pm
  • Hi Lucene users, at the moment I have some problems with the locking mechanism of IndexWriter. Some times my application quits/terminates before I can close the IndexWriter. Then the "write.lock" ...
    Jan Peter StotzJan Peter Stotz
    Aug 30, 2009 at 5:25 pm
    Aug 30, 2009 at 6:34 pm
  • The Problem: periodically we see thousands of files get created from an IndexWriter in a Java process in a very short period of time. Since we started trying to track this, we saw an index go from ...
    Micah JaffeMicah Jaffe
    Aug 18, 2009 at 1:31 am
    Aug 27, 2009 at 11:29 pm
  • I was wondering if there is any way to directly use Lucene API to extract terms from a given string. My requirement is that I have a text document for which I need a term frequency vector ( after ...
    Joe_coderJoe_coder
    Aug 13, 2009 at 11:41 am
    Aug 13, 2009 at 1:28 pm
  • I just happened to benchmark a little modified version of lucene with a little modified version of sphinx :) Have posted my results here http://ai-cafe.blogspot.com Would also be updating more @ the ...
    AnshumAnshum
    Aug 13, 2009 at 4:43 am
    Aug 13, 2009 at 11:24 am
  • We periodically optimize large indexes (100 - 200gb) by calling IndexWriter.optimize(). It takes a heck of a long time, and I'm wondering if a more efficient solution might be the following: - Create ...
    NigelNigel
    Aug 5, 2009 at 4:15 pm
    Aug 11, 2009 at 3:34 pm
  • Hi, I am a newbie in lucene and am trying the 'indexing and searching' demo of lucene 1.4.3 using kaffe 1.0.6. After inputing the query, an error occurs as follows: Query: stringSearching for: string ...
    石川石川
    Aug 11, 2009 at 3:45 am
    Aug 11, 2009 at 8:29 am
  • hello all, thanks to lucene. Am using lucene 2.4.0 for my application. My doubt is , can i read the index for many number of times? i mean , i've a search application which reads the index , which is ...
    M.harigM.harig
    Aug 7, 2009 at 11:11 am
    Aug 8, 2009 at 7:19 am
  • We have an IndexWriter.optimize running on 4 Proc Xenon Java 1.5 Win2003 machine. We get a repeatable FileNotFoundException because the path to the file is wrong: ...
    Uwe GoetzkeUwe Goetzke
    Aug 31, 2009 at 3:40 pm
    Aug 31, 2009 at 10:40 pm
  • Hello all, I am having some content with text "attention". If is search using "att*", "attent*", the results are displayed. If i search for "attenti*" then no results are displayed. I am using ...
    GaneshGanesh
    Aug 31, 2009 at 6:07 am
    Aug 31, 2009 at 10:32 am
  • While indexing with the latest nightly build of Solr on Amazon EC2 the following JVM bug has occurred twice on two different servers. Post the log to a Jira issue? java version "1.6.0_07" Java(TM) SE ...
    Jason RutherglenJason Rutherglen
    Aug 28, 2009 at 9:57 pm
    Aug 28, 2009 at 10:48 pm
  • In the free first chapter of the new Lucene in Action book, it states that it's targetting Lucene 3.0, but on the Manning page for the book, it says the code in the book is written for 2.3. I'm ...
    TsuraanTsuraan
    Aug 26, 2009 at 9:58 pm
    Aug 27, 2009 at 3:29 pm
  • When running lucene, on a machine with a firewall, I got the following error message, which I think it must be related to the firewall. In fact, when I shut down the firewall, the error dissapears. ...
    David de la TorreDavid de la Torre
    Aug 26, 2009 at 8:03 am
    Aug 26, 2009 at 9:28 am
  • Hello, I'm trying to write a custom scorer that only uses the term frequency function from the DefaultSimilarity class, the problem is that documents with lower frequencies are returning with higher ...
    Chris SalemChris Salem
    Aug 19, 2009 at 8:20 pm
    Aug 20, 2009 at 1:30 pm
  • Hello, We're experiencing a problem using Lucene 2.4.1 and Compass 2.1.4 using wildcard search. Attribute values containing slashes can be searched using the full word, but not using wildcards. We ...
    Ueli KistlerUeli Kistler
    Aug 13, 2009 at 12:19 pm
    Aug 13, 2009 at 8:53 pm
  • I have a situation where I have a series of terms queries as part of a BooleanQuery. example: term: 'sole type' - leather BooleanClause.SHOULD_OCCURR term: 'title' - 'Men's Golf shoes' ...
    Christian BongiornoChristian Bongiorno
    Aug 12, 2009 at 8:01 pm
    Aug 13, 2009 at 7:37 am
  • Hi, I am fairly new to Lucene and have encounter a problem with the search function i am trying to create using Lucene. When I search, lets say "news sharing", then the results return and display. ...
    Bourne71Bourne71
    Aug 11, 2009 at 8:51 am
    Aug 12, 2009 at 9:09 am
  • I saw some discussion on the board but I'm not sure I've got quite the same problem. As an example, I have a query that might be a technical skill: SAP EM FIN AM I would like that to match a document ...
    Donna L GreshDonna L Gresh
    Aug 7, 2009 at 2:35 pm
    Aug 7, 2009 at 4:15 pm
  • Hello, when searching over multiple indices, we create one IndexReader for each index, and wrap them into a MultiReader, that we use for IndexSearcher creation. This is fine for searching multiple ...
    Christian ReuschlingChristian Reuschling
    Aug 4, 2009 at 9:50 am
    Aug 5, 2009 at 7:59 am
  • Hello Lucene users, On behalf of the Lucene dev community (a growing community far larger than just the committers) I would like to announce the first release candidate for Lucene 2.9. Please ...
    Mark MillerMark Miller
    Aug 27, 2009 at 10:18 pm
    Aug 28, 2009 at 5:08 pm
Group Navigation
period‹ prev | Aug 2009 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions96
posts465
users107
websitelucene.apache.org

107 users for August 2009

Shai Erera: 30 posts Ohaya: 28 posts Michael McCandless: 27 posts Simon Willnauer: 24 posts Grant Ingersoll: 15 posts Phil Whelan: 15 posts Mark Miller: 14 posts Ian Lea: 11 posts Prashant ullegaddi: 11 posts AHMET ARSLAN: 10 posts Anshum: 9 posts Bourne71: 9 posts M.harig: 9 posts Rishisinghal: 9 posts Valery: 9 posts Robert Muir: 8 posts Ganesh: 7 posts Paul Taylor: 7 posts Sumanta Bhowmik: 7 posts Bradford Stephens: 6 posts
show more
Archives