FAQ

Search Discussions

105 discussions - 582 posts

  • Mike, we are definitely hitting something with this one! we had report from our QA chaps that our servers got stuck (limit is on 180 Seconds Request)... We are on average 14 Requsts per second.... ...
    Eks devEks dev
    Jul 14, 2009 at 10:25 pm
    Jul 17, 2009 at 12:16 am
  • Hi, We have a very large lucene index that we're developing that has a field of email addresses. (Actually mulitple fields with multiple emails addresses, but I'll simplify here) Each document will ...
    Phil WhelanPhil Whelan
    Jul 30, 2009 at 6:12 pm
    Aug 11, 2009 at 11:21 pm
  • Hi, I'm trying to use Lucene to query a domain that has the following structure Student 1-------* Attendance *---------1 Course The data in the domain is summarised below Course.name ...
    Donal MurtaghDonal Murtagh
    Jul 29, 2009 at 8:42 pm
    Jul 30, 2009 at 10:15 pm
  • Hi All! Let say I have a filter that produces new tokens based on the original ones. How bad will it be if my filter sets the start of each token to 0 and end to the length of a token? An example ...
    OBenderOBender
    Jul 20, 2009 at 2:41 pm
    Jul 20, 2009 at 9:07 pm
  • I was trying to do a regex search with the lucene and JavaUtilRegexCapabilities. The code used is : RegexQuery query = new RegexQuery(new Term("contents","(?m)hello.*(\r[^#]*)This is to be ...
    Ba3Ba3
    Jul 26, 2009 at 1:52 pm
    Jul 29, 2009 at 5:14 pm
  • Hi, I'm just starting to work with Lucene, and I guess that I learn best by working with code, so I've started with the demos in the Lucene distribution. I got the IndexFiles.java and IndexHTML.java ...
    OhayaOhaya
    Jul 28, 2009 at 12:55 pm
    Jul 28, 2009 at 6:13 pm
  • hello all We've got 100GB of data which has doc,txt,pdf,ppt,etc.., we've separate parser for each file format, so we're going to index those data by lucene. (since we scared of Nutch setup , thats ...
    M.harigM.harig
    Jul 22, 2009 at 6:07 am
    Jul 23, 2009 at 8:26 am
  • I want to store in the index a data structure and load it back at search time. Is it safe to serialize the java object store it and load it back later ? Presumably I need to store it binary, right ? ...
    MilleBiiMilleBii
    Jul 3, 2009 at 7:32 pm
    Jul 4, 2009 at 8:52 am
  • If I understand lucene correctly, when doing multiple simultaneous searches on the same IndexSearcher, they will basically all do their own index scans and collect results independently. If that's ...
    TsuraanTsuraan
    Jul 22, 2009 at 4:59 pm
    Jul 23, 2009 at 1:01 pm
  • Lukas MichelbacherLukas Michelbacher
    Jul 4, 2009 at 4:37 pm
    Jul 14, 2009 at 4:08 pm
  • Hi, new here. I recently started using lucene and had encounter a problem.I crawl and index a number of documents. When i perform a search, lets say "tall fat", by right the results that matches all ...
    Bourne71Bourne71
    Jul 31, 2009 at 10:51 am
    Aug 4, 2009 at 1:36 am
  • Is Lucene capable of handling UCS4 data natively? Thanks, Mike --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org ...
    Michael ThomsenMichael Thomsen
    Jul 31, 2009 at 2:08 pm
    Jul 31, 2009 at 7:25 pm
  • Hi! I'm with a error follow: java.lang.NoSuchMethodError: org.apache.lucene.store.FSDirectory.getDirectory(Ljava/io/File;)Lorg/apache/lucene/store/FSDirectory; In the code: File dirIndex = new ...
    Fabrício RaphaelFabrício Raphael
    Jul 29, 2009 at 2:36 pm
    Jul 29, 2009 at 4:36 pm
  • Hi, I have an application in which documents are added upon receiving a user request and a background thread is needed to remove old documents. I have an IndexWriter opened on a Directory that adds ...
    Chengdu HuangChengdu Huang
    Jul 28, 2009 at 4:26 am
    Jul 29, 2009 at 9:49 am
  • Hi, In the documents which contain the volunteer information : Doc1 : volunteer krish volunteer john volunteer Raj ... Doc2 : volunteer krish volunteer Raj volunteer Ganesh Doc3 : volunteer krish ...
    Ba3Ba3
    Jul 22, 2009 at 6:49 am
    Jul 24, 2009 at 6:21 am
  • Hello all I am sorting on datetime with minute resolution. It easily reaches the maximum heap size. I am having almost 100M records and it is using 1.5 GB. I am now in a situitation to stop sorting ...
    GaneshGanesh
    Jul 21, 2009 at 8:26 am
    Jul 23, 2009 at 9:46 am
  • I think this is a common problem, but don't know the correct solution. Users were doing queries on a numeric field such as qdur:[73 TO 117] and expecting to find all the values within but this fails ...
    Paul TaylorPaul Taylor
    Jul 29, 2009 at 10:04 am
    Aug 4, 2009 at 6:46 pm
  • http://issues.apache.org/jira/browse/LUCENE-1406 http://issues.apache.org/jira/browse/LUCENE-153 based on this, there are two options: 1- using the aramorph library 2- moving the code from trunk to ...
    WalidWalid
    Jul 23, 2009 at 11:33 am
    Aug 3, 2009 at 4:40 pm
  • Hi, I tried searching: "Apache Jakarta"~10 Nothing was returned. What might be wrong? Regards, Prashant.
    Prashant ullegaddiPrashant ullegaddi
    Jul 16, 2009 at 12:34 pm
    Jul 17, 2009 at 1:52 pm
  • Hello, I'm trying to search for the terms like c++ but the parser is stripping off the ++. I tried escaping the ++ with slashes but it's still stripping it off. I could replace + with "plus", is that ...
    Chris SalemChris Salem
    Jul 15, 2009 at 4:20 pm
    Jul 16, 2009 at 5:08 pm
  • Hi This question has probably been asked before so apologies for asking it again. Just to confirm that it is ok to use a single index writer in a web application and only close that single instance ...
    Amin Mohammed-ColemanAmin Mohammed-Coleman
    Jul 1, 2009 at 7:57 am
    Jul 2, 2009 at 8:05 am
  • Hi, I am currently creating a search engine and will need to generate a query like the following: title:(+chemistry +"national curriculum") its mention that it can be done using the QueryParser but ...
    Blazingwolf7Blazingwolf7
    Jul 28, 2009 at 8:29 am
    Jul 31, 2009 at 3:56 am
  • Hi, I have a question regarding RAMDirectory. I have a 5 GB index on disk and it is opened like the following: searcher = new IndexSearcher (new RAMDirectory (indexDirectory)); Approximately how much ...
    Dragon FlyDragon Fly
    Jul 23, 2009 at 1:37 pm
    Jul 24, 2009 at 8:21 am
  • Hi, I am doing a search on my index for a query like this: query = "\"Term 1\" \"Term 2\" \"Term 3\"" Where I want to find Term 1, Term 2 and Term 3 in the index. However, I only want to search for ...
    Max LynchMax Lynch
    Jul 23, 2009 at 6:29 pm
    Jul 23, 2009 at 8:00 pm
  • We have code (using Lucene 2.4.1) that will build a query that looks like: fielda:"ruz an"~2 OR fieldb:"ruz an"~2 OR fieldc:"ruz an"~2 When passed to a MultiFieldQueryParser and parsed it comes back ...
    Philip PuffinburgerPhilip Puffinburger
    Jul 21, 2009 at 6:46 pm
    Aug 7, 2009 at 9:26 am
  • Hi, I was wonder if there is a list of special characters for the standard analyzer? What I mean by "special" is characters that the analyzer considers break characters. For example, if I have ...
    OhayaOhaya
    Jul 31, 2009 at 2:12 am
    Jul 31, 2009 at 3:13 pm
  • Hi All, I am using Lucene 2.2.0 and have created an Index with some text including values in $, £ and euros too! But I can't search for text that includes $, £ or euro. I checked the index with Luke ...
    CbowditchCbowditch
    Jul 29, 2009 at 9:56 am
    Jul 30, 2009 at 8:38 pm
  • Hi there, I have a question… we have two querys which only different is the fact that Query_1 includes phrase queries where Query_2 has the phrase query but converted into a Boolean query. When each ...
    Naranjo, PedroNaranjo, Pedro
    Jul 23, 2009 at 9:44 pm
    Jul 28, 2009 at 4:28 pm
  • Hi, I'm relatively new to Lucene. I have the following case: I have indexed a bunch of documents. I then, query the index using IndexSearcher and retrieve the documents using Hits (I do know this is ...
    Anuj BhattAnuj Bhatt
    Jul 23, 2009 at 2:57 am
    Jul 28, 2009 at 11:30 am
  • Hi, We are writing to an index using IndexWriter with setMergeFactor() set to 10. After inserting several documents, a merge occurs, but our observation is that each segment only has a single ...
    Venkat RanganVenkat Rangan
    Jul 26, 2009 at 5:24 am
    Jul 26, 2009 at 3:04 pm
  • Hello, I would like to use a stemming analyser similar to KStem or PorterStem to provide access to a wider search scope for our users. However, at the same time I also want to provide the ability for ...
    Robert CorbettRobert Corbett
    Jul 22, 2009 at 1:08 pm
    Jul 23, 2009 at 4:08 am
  • hello there i like to re-rank lucene TopDoc result set. where shall i start thanks
    Henok sahiluHenok sahilu
    Jul 22, 2009 at 1:40 pm
    Jul 22, 2009 at 2:11 pm
  • I am doing sorting on DateTime with minute resolution. I am having 90 million of records and sorting is consuming nearly 500 MB. 30% records are not part of primary result set and they don't have ...
    GaneshGanesh
    Jul 17, 2009 at 9:12 am
    Jul 21, 2009 at 5:43 pm
  • Hi All, I need to make ? and ! characters to be a separate token e.g. to split [how are you?] in to 4 tokens [how], [are], [you] and [?] what would be the best way to do this? Thanks
    OBenderOBender
    Jul 17, 2009 at 6:29 pm
    Jul 17, 2009 at 10:19 pm
  • Hello, Are there any suggestions / best practices for using Lucene for searching non-linguistic text? What I mean by non-linguistic is that it's not English or any other language, but rather product ...
    JesLJesL
    Jul 16, 2009 at 1:04 pm
    Jul 16, 2009 at 6:04 pm
  • Hello, Over in Solr land, I'm facing a problem while upgrading the lucene version to trunk. Solr has a QueryElevationComponent which is used to boost certain documents to the top. It pre-processes ...
    Shalin Shekhar MangarShalin Shekhar Mangar
    Jul 15, 2009 at 7:56 am
    Jul 15, 2009 at 11:55 am
  • Hi, I have an issue when optimizing a large index (40GB). Basically the optimize falls over with an IOException stating there are not enough system resources, within a RandomAccessFile.readBytes call ...
    Carl AustinCarl Austin
    Jul 3, 2009 at 10:12 am
    Jul 3, 2009 at 4:33 pm
  • Hello again! Yes, I know some of us are still recovering from OSCON. It's time for another delicious meetup to chat about Hadoop, HBase, Solr, Lucene, and more! UW is quite a pain for us to access ...
    Bradford StephensBradford Stephens
    Jul 27, 2009 at 7:16 pm
    Aug 5, 2009 at 5:38 pm
  • Is there any difference between using QueryParser and MultiFieldQueryParser when have single default search field ? Depending on how many default search fields on an searching an index I select which ...
    Paul TaylorPaul Taylor
    Jul 31, 2009 at 12:37 pm
    Jul 31, 2009 at 3:08 pm
  • How to get the number of times a term occurs in the Lucene index? Regards, Prashant.
    Prashant ullegaddiPrashant ullegaddi
    Jul 30, 2009 at 6:42 pm
    Jul 31, 2009 at 10:51 am
  • Hi, I am trying to index information in some proprietary-formatted files. In particular, these files contain some IP addresses in dotted notation, e.g., aa.bb.cc.dd. For my initial test, I have a ...
    OhayaOhaya
    Jul 30, 2009 at 1:36 pm
    Jul 30, 2009 at 2:38 pm
  • Hi folks! After deleting documents from the index it can happen that fields become unused (i.e. no document has this field anymore). And IndexReader.getFieldNames() still returns these unused fields, ...
    Khin, GeraldKhin, Gerald
    Jul 9, 2009 at 1:40 pm
    Jul 25, 2009 at 3:23 am
  • Hi folks, I just upgrading Hibernate Search library of my app and so I had to upgrade Lucene too and pass from 2.2 to 2.4 version. In Lucene 2.4 the ISOLatin1AccentFilter class has changed and I ...
    Luther blissetLuther blisset
    Jul 24, 2009 at 9:41 am
    Jul 24, 2009 at 11:39 am
  • Hi, We have some 50M pages, and we also have computed PageRanks of those pages. What's the best way to combine lucene's score with PageRank? Regards, Prashant.
    Prashant ullegaddiPrashant ullegaddi
    Jul 19, 2009 at 11:54 am
    Jul 23, 2009 at 4:52 pm
  • Hello all, I am using Lucene v2.4.1. I am getting following exception while doing search on 4 indexes using ParallelMultiSearcher. ...
    GaneshGanesh
    Jul 13, 2009 at 6:06 am
    Jul 13, 2009 at 2:09 pm
  • Hi All, I'm using the default setup of lucene (no custom analyzers configured) and came across the following issue: In Hindi if there is a letter with a diacritic in a phrase lucene will find the ...
    OBenderOBender
    Jul 10, 2009 at 7:10 pm
    Jul 13, 2009 at 10:37 am
  • Hi, I am trying to solve the following problem: In my index I have a "url" field added as Field.Store.YES, Field.Index.NOT_ANALYZED and I must use this field as a "key" to identify a document. The ...
    RafRaf
    Jul 3, 2009 at 4:27 pm
    Jul 4, 2009 at 4:02 pm
  • Hi, I still am new to Lucene, but I think I have an initial indexer app (based on the demo IndexFiles app) working, and also have a web app, based on the demo luceneweb web app working. I'm still ...
    OhayaOhaya
    Jul 31, 2009 at 4:43 pm
    Jul 31, 2009 at 8:33 pm
  • Hi All, We have a system with a lucene index with 100GB and growing fast. I wonder whether there is an efficient way to backup it taking into account only the changes among old and new version of the ...
    Alexandre Leopoldo GonçalvesAlexandre Leopoldo Gonçalves
    Jul 22, 2009 at 1:40 pm
    Jul 22, 2009 at 2:43 pm
  • Hi, I have a situation, where stored, un-indexed fields can contain potentially large amounts of data. Is it possibly to read the contents of a field incrementally? That is, do not load the complete ...
    Günter LadwigGünter Ladwig
    Jul 14, 2009 at 3:12 pm
    Jul 15, 2009 at 1:02 pm
Group Navigation
period‹ prev | Jul 2009 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions105
posts582
users117
websitelucene.apache.org

117 users for July 2009

Michael McCandless: 45 posts Eks dev: 31 posts Uwe Schindler: 26 posts Erick Erickson: 25 posts Simon Willnauer: 21 posts Shai Erera: 20 posts Matthew Hall: 19 posts OBender Hotmail: 17 posts Ohaya: 17 posts Robert Muir: 16 posts Prashant ullegaddi: 14 posts AHMET ARSLAN: 13 posts Ganesh: 13 posts Ian Lea: 13 posts Grant Ingersoll: 12 posts Phil Whelan: 12 posts Ba3: 11 posts Chris Hostetter: 9 posts Steven A Rowe: 9 posts Henok sahilu: 8 posts
show more
Archives