Search Discussions

82 discussions - 347 posts

  • Hi, I build an RAMDirectory on a FSDirectory, and would like the writer associated with the RAMDirectory to periodically write to hard drive. Is this achievable? Thanks.
    Feb 5, 2012 at 6:57 am
    Feb 6, 2012 at 4:55 pm
  • Hi, I have a little bit of an unusual set of requirements, and I am looking for advice. I have researched the archives, and seen some relevant posts, but they are fairly old and not specifically a ...
    Peter MillerPeter Miller
    Feb 6, 2012 at 2:52 am
    Feb 9, 2012 at 2:12 pm
  • Hi I have a noobie question. I am trying to use the SweetSpotSimilarity (SSS) class ...
    Peyman FaratinPeyman Faratin
    Feb 15, 2012 at 2:40 pm
    Mar 6, 2012 at 10:59 pm
  • Hello, I'm trying to create a Lucene Query that will take a term and expand it to include common OCR errors (for example, 'cl' is often misread as 'd', so a search for 'clog' should also hit 'dog') ...
    Alan WoodwardAlan Woodward
    Feb 28, 2012 at 12:34 pm
    Mar 2, 2012 at 9:26 pm
  • My apologies if this answer is readily available someplace, I've searched around and not found a definitive answer. I'd like to run a query for documents that _do not_ contain particular indexed ...
    Tim EckTim Eck
    Feb 16, 2012 at 8:59 pm
    Oct 26, 2012 at 1:18 am
  • 3.5.0: I passed a fixed size executor service with one thread, and then with two threads, to the IndexSearcher constructor. It hung. With three threads, it didn't work, but I got different results ...
    Benson MarguliesBenson Margulies
    Feb 19, 2012 at 2:08 pm
    Feb 20, 2012 at 11:56 am
  • Hi, lucene-3.0.3 can be used for searching a text from PDF, xlsx, docx, doc, xls, msg, TXT files. For this we have any common function to accomplish this. Please help me on this. Thanks Prasad
    Prasad KVSHPrasad KVSH
    Feb 1, 2012 at 1:11 pm
    Feb 2, 2012 at 6:02 pm
  • Hello! I have a small issue with the QueryParser in my program. It uses my custom filter to Parse its queries, but i get unexpexted results from when i am having an input from the keyboard To ...
    Feb 25, 2012 at 6:38 am
    Mar 2, 2012 at 5:57 am
  • A long-running program of mine (which Uwe's read a model of) slowly keeps adding merge threads. I count 22 at the moment. Each one shows up, runs for a bit, and then goes to sleep for, seemingly ...
    Benson MarguliesBenson Margulies
    Feb 20, 2012 at 2:05 am
    Feb 27, 2012 at 3:59 pm
  • Hi, I want to use Lucene with the following scoring logic: When I index my documents I want to set for each field a score/weight. When I query my index I want to set for each query term a ...
    Yuval KestenYuval Kesten
    Feb 21, 2012 at 3:19 pm
    Feb 23, 2012 at 12:21 pm
  • Hi all. We have 1..N indexes for each time someone adds some data. Each time they can choose different tokenisation settings. Because of this, each text index has its own query parser instance ...
    Feb 15, 2012 at 12:40 am
    Feb 15, 2012 at 1:27 am
  • Hello i want to implement my custom filter, my wuestion is quite simple but i cannot find a solution to it no matter how i try: How can i access the TermAttribute of the next token than the one i ...
    Feb 9, 2012 at 7:19 pm
    Feb 9, 2012 at 10:15 pm
  • Hi all. I've found a rather frustrating issue which I can't seem to get to the bottom of. Our application will crash with an access violation around the time when the index is closed, with various ...
    Feb 1, 2012 at 12:17 am
    Feb 1, 2012 at 11:45 am
  • Hi, I want to customize the indexing of some specific kind of files I have. I am using 2.9.3 but upgrading is possible. This is how my file's data looks ***************************** Data for 2010 ...
    Prakash Reddy BandePrakash Reddy Bande
    Feb 27, 2012 at 4:56 pm
    Feb 27, 2012 at 7:43 pm
  • I have a solr instance with about 400m docs. For text searches it is perfectly fine. When I do searches that calculate the amount of times a word appeared in the doc set for every day of a month, it ...
    Jason ToyJason Toy
    Feb 23, 2012 at 6:25 am
    Feb 23, 2012 at 1:39 pm
  • Hi all, for some reason, we need empty numeric field values (to ensure that the length of the field value list is constant). We tried to add an empty String-Fieldable instead in the case a value is ...
    Christian ReuschlingChristian Reuschling
    Feb 15, 2012 at 11:58 am
    Feb 15, 2012 at 2:04 pm
  • Hello, I am currently evaluating Lucene 3.5.0 for upgrading from 3.0.3, and in the context of my usage, the most important parameter is index writing throughput. To that end, I have been running ...
    Vitaly FunsteinVitaly Funstein
    Feb 9, 2012 at 4:28 am
    Feb 11, 2012 at 5:11 am
  • Hi, I am using NRTManager and NRTManagerReopenThread. Though I don't close either writer or the reopen thread, I receive AlreadyClosedException as follow. My initiating NRTManager and ...
    Feb 8, 2012 at 5:20 am
    Feb 8, 2012 at 1:09 pm
  • Hi List Apologies for such a long message. I have tried to include everything, that you might need to know to answer my question. I am having difficulties understanding how or what ...
    Feb 2, 2012 at 4:57 pm
    Feb 3, 2012 at 5:28 pm
  • So I subclass Query Parser and give it query dug up then debugging shows it calls getFieldQuery(String field, String queryText, boolean quoted) twice once with queryText=dug and one with queryText=up ...
    Paul TaylorPaul Taylor
    Feb 1, 2012 at 9:32 pm
    Feb 2, 2012 at 8:26 am
  • Hey guys, We have been getting great feedback from people on this list and wanted to let you guys know of major updates that we have made to the Lucene Architecture/Documentation site that we have ...
    Vineet SinhaVineet Sinha
    Feb 29, 2012 at 10:19 pm
    Mar 1, 2012 at 4:09 am
  • Hello all, I was using DateTime as String and now i am using NumericField. Using NumericField takes more heap and storage space then the earlier String version. Is it good to move to NumericField or ...
    Feb 28, 2012 at 10:15 am
    Feb 28, 2012 at 11:41 am
  • Hi, Let's say I have 6 documents and each document has 2 fields (i.e. CustomerName and OrderDate). For example: Doc 1 John 20120115 Doc 2 Mary 20120113 Doc 3 Peter 20120117 Doc 4 Kate 20120208 Doc 5 ...
    Dragon FlyDragon Fly
    Feb 26, 2012 at 1:31 pm
    Feb 27, 2012 at 3:30 pm
  • This is a pretty simple question to answer, but I have customers asking me how this is suppose to work and I'm having trouble explaining it. I have an app that indexes emails so there are plenty of ...
    Charlie HubbardCharlie Hubbard
    Feb 16, 2012 at 5:19 pm
    Feb 26, 2012 at 2:13 pm
  • Hello, I'm trying to understand the behavior of CustomScoreQuery. It seemed to me, that default CustomScoreQuery(Query subQuery, ValueSourceQuery valSrcQuery) should return a score that is a product ...
    Dominika PuzioDominika Puzio
    Feb 16, 2012 at 8:39 pm
    Feb 21, 2012 at 4:54 pm
  • Hi, I have one index which is mixed by multiple categories. How can I separate it by category? I would like to save each category into a different folder. Code example would be great. Thanks
    Feb 19, 2012 at 1:27 pm
    Feb 20, 2012 at 9:39 am
  • If I have a lot of segments, and an executor service in my searcher, the following runs out of memory instantly, building giant heaps. Is there another way to express this? Should I file a JIRA that ...
    Benson MarguliesBenson Margulies
    Feb 19, 2012 at 2:22 pm
    Feb 19, 2012 at 3:27 pm
  • Hi, I've been looking for a short circuit AND operator in Lucene or a way to do subquerying. Basically for queries such as field1:foo AND field2:*bar, I think it would be highly beneficial to ...
    Delalande, ThierryDelalande, Thierry
    Feb 15, 2012 at 11:34 am
    Feb 16, 2012 at 11:14 pm
  • Hello all, This debate we might have had more frequently in the group. Yet one more time, i want to clarify. I was using multiple indexes (per week one index) with previous versions of Lucene (2.4 - ...
    Feb 23, 2012 at 7:31 am
    Feb 23, 2012 at 11:26 am
  • Hi , when i m adding three document i m not getting top mathced text on the top , but when i have only two document then it displaying properly as shown in follwoing text i m using default similarit ...
    A ZA Z
    Feb 14, 2012 at 6:17 pm
    Feb 20, 2012 at 5:28 pm
  • Hi guys, I hope I'm sending this to the right place. I have this possible idea in mind (still fuzzy, but enough to describe this), and I was wondering if Lucene or Solr could help in this. I've ...
    Pedro FerreiraPedro Ferreira
    Feb 15, 2012 at 6:05 pm
    Feb 20, 2012 at 5:23 pm
  • Hi, I need to manage multiple applications, each having its own writer yet on a same FSdirectory. How to make it happen while I encounter quite a few exceptions? thanks
    Feb 14, 2012 at 4:49 pm
    Feb 14, 2012 at 5:56 pm
  • Hi, Is there a way to improve query performance when using a leading * as a wildcard on a path property? I have hundreds of queries to run on a lucene index (~250mo). Executing those queries without ...
    Feb 13, 2012 at 4:38 pm
    Feb 13, 2012 at 5:06 pm
  • Hi there, I am currently working on a search engine based on lucene and have some issues because java is not my regular programming language, which makes things a it hard. What I was wondering about ...
    Feb 13, 2012 at 2:32 pm
    Feb 13, 2012 at 5:02 pm
  • Assume we have a Lucene index over which several types of analyses are performed. Assume that the conclusions of some analysis require that new tokens be added to existing documents in the index. For ...
    Arnon MazzaArnon Mazza
    Feb 1, 2012 at 2:05 pm
    Feb 2, 2012 at 8:57 pm
  • Hello all, I am upgrading from 3.0.3 to 3.5.0. 1) NumberTools is deprecated. I am converting long to string and storing it in Index. Now this is deprecated. If i replace this API with NumericUtils / ...
    Feb 1, 2012 at 8:44 am
    Feb 1, 2012 at 10:18 am
  • We tracked down a large memory leak (effectively a leak anyway) caused by how Analyzer users CloseableThreadLocal. CloseableThreadLocal.hardRefs holds references to Thread objects as keys. The ...
    Matthew BellewMatthew Bellew
    Feb 29, 2012 at 5:18 pm
    Mar 2, 2012 at 11:57 pm
  • Lucene (using 3.5) seems to be caching field values for documents (after they have been retrieved) and I am hoping someone can provide more information on how and where exactly the field values are ...
    Rose, Stuart JRose, Stuart J
    Feb 24, 2012 at 9:19 pm
    Feb 24, 2012 at 11:20 pm
  • Hi, I am using Taxonomy Search to build a facet comprising things such as “/author/American/Mark Twain”. Since the word "author" has a synonym of "writer", can I use "writer" instead of "author" to ...
    Feb 23, 2012 at 4:48 am
    Feb 23, 2012 at 5:28 am
  • Trying out ShingleFIlter and the way it is documented it implys that you can just add it to your anaylzer and that's it with no side-effects except a larger index, but I read other implying you have ...
    Paul TaylorPaul Taylor
    Feb 21, 2012 at 1:05 pm
    Feb 21, 2012 at 3:11 pm
  • Using Lucene 3.5.0, on a 32-core machine, I have coded something shaped like: make a writer on a RAMDirectory. start: Create a near-real-time searcher from it. farm work out to multiple threads, each ...
    Benson MarguliesBenson Margulies
    Feb 19, 2012 at 2:21 am
    Feb 19, 2012 at 11:07 am
  • Hello, I have a noobie question. I am trying to implement a small poc app.I have lots of sharded indexes in a folder and i am trying to read them like this: MultiReader reader = new ...
    Feb 16, 2012 at 8:35 am
    Feb 16, 2012 at 11:14 am
  • Hi, My application will go on for ever. When is good time to refresh the writer (and merge the segments)? Thanks
    Feb 13, 2012 at 6:17 pm
    Feb 14, 2012 at 4:45 pm
  • Hello, I want to score span queries based on the simple presence or absence of a hit (I'm not interested in Tf or Idf here), with a possible boost on specific spans. I've already extended ...
    Alan WoodwardAlan Woodward
    Feb 13, 2012 at 11:39 am
    Feb 13, 2012 at 1:34 pm
  • Hi, I have about 6.5 million documents which lead to 1.5G index. The speed of search a couple terms, like "dvd" and "price", causes about 0.1 second. I am afraid that our data will grow rapidly ...
    Feb 8, 2012 at 12:44 pm
    Feb 8, 2012 at 2:19 pm
  • My Index does NOT have a simple UID, it uses the file PATH to the file as the unique key. I was implementing a CustomScoreQuery which not only tweaked the score it also wanted to write down which ...
    Paul Allan HillPaul Allan Hill
    Feb 4, 2012 at 12:10 am
    Feb 6, 2012 at 11:12 pm
  • Hi I learnt about Lucene from google and i thought of implementing it my company. I don't want to use Lucene as a web search application. I have a large backup storage and which consists of html ...
    Dheeraj KvDheeraj Kv
    Feb 1, 2012 at 8:56 am
    Feb 6, 2012 at 10:01 am
  • Hi, I have an issue with Lucene 2.9.4 and sorting of wildcard queries. If I set a boost to some documents during indexing like this: doc.setBoost(1000.00); and execute a query like this ...
    Lutz FechnerLutz Fechner
    Feb 1, 2012 at 10:42 am
    Feb 1, 2012 at 11:53 am
  • Using Lucene 3.5, I created a query parser based on the dismax parser but in order to get matches on misspellings ecetra I additionally do a fuzzy search and a wildcard search ...
    Paul TaylorPaul Taylor
    Feb 3, 2012 at 3:01 pm
    Mar 8, 2012 at 10:00 pm
  • Hi all, I have a question. Is there a way to distinguish queries like 'hotel' and 'hotel restaurant', queries with overlapping patterns, effectively? For example, if I want the search to return ...
    ☼ 林永忠 ☼ (Yung-chung Lin)☼ 林永忠 ☼ (Yung-chung Lin)
    Feb 28, 2012 at 6:26 pm
    Feb 29, 2012 at 9:44 am
Group Navigation
period‹ prev | Feb 2012 | next ›
Group Overview
groupjava-user @

100 users for February 2012

Ian Lea: 31 posts Uwe Schindler: 27 posts Cheng: 26 posts Mike McCandless: 17 posts Benson Margulies: 12 posts Ganesh: 11 posts Damerian: 9 posts Erick Erickson: 9 posts Paul Allan Hill: 9 posts Superruiye: 9 posts Robert Muir: 8 posts Alan Woodward: 7 posts Paul Taylor: 7 posts Steven A Rowe: 7 posts Chris Hostetter: 6 posts Prasad KVSH: 6 posts Trejkaz: 6 posts Li Li: 5 posts Peter Miller: 5 posts Yuval Kesten: 5 posts
show more