Search Discussions

75 discussions - 359 posts

  • Hi, We're currently in the process of switching many of our screens from MySQL to Lucene because MySQL simply dies because we have too much data and it's becoming too long to generate the stats we ...
    Michel NadeauMichel Nadeau
    Apr 1, 2010 at 1:18 am
    Apr 3, 2010 at 2:25 am
  • We are seeing a situation where the IndexWriter is using up the Java Heap space and only releases memory for garbage collection upon a commit. We are using the default RAMBufferSize of 16 mb. We are ...
    Woolf, RossWoolf, Ross
    Apr 1, 2010 at 10:58 pm
    May 19, 2010 at 5:07 pm
  • Hi, It seems like my IndexWriter after commiting and optimizing has a retained size of 140Mb. See [1] for a screenshot of the heapdump analysis done with Eclipse MAT. Of those 140MB 67MB are retained ...
    Ruben LagunaRuben Laguna
    Apr 7, 2010 at 8:36 pm
    Apr 9, 2010 at 4:55 am
  • Hi. I am searching for some guidance on right memory options for my Search Server application. How much memory a lucene based application should be given? Till a few days back I was running my search ...
    Samarendra PratapSamarendra Pratap
    Apr 27, 2010 at 12:52 pm
    Apr 29, 2010 at 7:47 am
  • I'm putting on a talk at Lucene Eurocon (http://lucene-eurocon.org/sessions-track1-day2.html#1) on "Practical Relevance" and I'm curious as to what people put in practice for testing and improving ...
    Grant IngersollGrant Ingersoll
    Apr 29, 2010 at 2:15 pm
    May 5, 2010 at 4:41 pm
  • Hello, I am trying to determine begin and end offsets for terms and phrases matching a query. Is there a way using either the highlighter or fast vector highlighter in contrib? I have already ...
    Stephen GreeneStephen Greene
    Apr 16, 2010 at 3:50 pm
    Apr 27, 2010 at 4:30 pm
  • Hello, I'm using Lucene to index and search through a collection of Chinese documents. However, I'm noticing an odd behavior in query parsing/searching. Given the two queries below: (Ci refers to ...
    Wei HoWei Ho
    Apr 29, 2010 at 7:50 pm
    May 11, 2010 at 11:56 pm
  • Hi, folks. I am using PyLucene and doing a lot of get tokens. lucene.py reports version 2.4.0. It is rpath linux with 8GB of memory. Python is 2.4. I'm not sure what the maxheap is, I think that it ...
    Herbert RoitblatHerbert Roitblat
    Apr 10, 2010 at 3:23 am
    Apr 14, 2010 at 8:50 pm
  • Just out of curiousity, why does LUCENE-1377 have a minor priorty? https://issues.apache.org/jira/browse/LUCENE-1377 Don't people index, filter, search HTML, perhaps more than any other format? Looks ...
    Apr 23, 2010 at 8:48 pm
    Apr 27, 2010 at 7:33 pm
  • On a small index that I have I'd like to query certain fields by adding wildcards on either side of the term: foo - *foo*. I realize the performance implications but there are some cases where these ...
    Christopher ConditChristopher Condit
    Apr 30, 2010 at 8:11 pm
    May 1, 2010 at 5:43 pm
  • Hi, I am trying to use the MultiFieldQueryParser to search "title" and "desc" fields. However the Lucene API appears to only let me provide a single search term. Is it possible to use multiple search ...
    Apr 15, 2010 at 10:35 pm
    Apr 20, 2010 at 2:23 pm
  • Hello. Is it possible to combine PrefixQuery and FuzzyQuery? The search on a term should both be fuzzy but also match with results that jut begin with that token (or an approximation of that token). ...
    Lukas ÖsterreicherLukas Österreicher
    Apr 19, 2010 at 2:44 pm
    Apr 19, 2010 at 4:54 pm
  • Hi, I am trying to chase an issue in our code and it is being quite difficult. We have seen two instances (see below) where we get the same error. I have been trying to reproduce but it has been ...
    Apr 14, 2010 at 11:41 am
    Apr 16, 2010 at 3:22 pm
  • Hi, is it normal for indexing time to increase up to 10 times after introducing NumericField instead of Field (for two fields)? I've changed two date fields from String representation (Field) to ...
    Tomislav PoljakTomislav Poljak
    Apr 14, 2010 at 6:13 pm
    Apr 15, 2010 at 12:15 pm
  • Hello! I am a beginner with Lucene. I'm needing to do the following: I have a text file with the following terms: "Lucene in action" "Lucene" and a file with the following sentences: 1 - "Lucene in ...
    Fotos fotosFotos fotos
    Apr 10, 2010 at 12:25 am
    Apr 15, 2010 at 6:37 am
  • I'm getting a ClosedChannelException from IndexWriter.getReader(). I don't think the writer has been closed and, if it were, I would expect an AlreadyClosedException as described in the API ...
    Apr 8, 2010 at 6:15 pm
    Apr 8, 2010 at 10:19 pm
  • Hi all, I am new to Lucene and I want to ask about range score that Lucene used, because I got score greater than 1. I'm using lucene-3.0.1 and using MoreLikeThis to do document similarity and ...
    Clara VaniaClara Vania
    Apr 27, 2010 at 10:03 am
    Apr 28, 2010 at 3:18 am
  • I am analizying this wiht my custom analyzer: String s = "mail77 mail88888 tc ro45mine durante ...
    Apr 21, 2010 at 1:22 pm
    Apr 21, 2010 at 6:02 pm
  • I'm trying to use Highlighter with QueryScorer after reading: https://issues.apache.org/jira/browse/LUCENE-1685 The problem is: I'm not getting a result unless my the query term is an exact match. Am ...
    Apr 29, 2010 at 8:22 pm
    Apr 29, 2010 at 10:10 pm
  • Greetings to all. I have read at so many places that we should not open a Searcher for each request for the sake of performance, but I have always been wondering whether it is actually Searcher or ...
    Samarendra PratapSamarendra Pratap
    Apr 22, 2010 at 10:39 am
    Apr 24, 2010 at 4:25 pm
  • I am encountering a strange issue. I have a CustomStopAnalyzer. If I do this (supporting code taken from AnalyzerUtils in LIA3 source code Mike uploaded): Analyzer customStopAnalyzer = new ...
    Apr 20, 2010 at 4:00 pm
    Apr 21, 2010 at 11:02 am
  • We are in the process of removing the deprecated api from our code to move to version. One of the deprecation is, the queryparser now expects a version parameter in the constructor. I also have read ...
    Siraj HaiderSiraj Haider
    Apr 12, 2010 at 9:14 pm
    Apr 13, 2010 at 2:39 pm
  • dear all, I have a problem using lucene in NFS. A scheduler runs periodically generating reports in pdf format and saves it to a file server. The drive of the file server is mounted to the scheduler ...
    Vijay VeeraraghavanVijay Veeraraghavan
    Apr 30, 2010 at 7:39 am
    Apr 30, 2010 at 3:02 pm
  • Hi! we are using Lucene 2.4.1 in our app. It works great so far, but now a customer ran into a strange problem. During the day, the search index is updated regularly with the newest changes in the ...
    Anna HuneckeAnna Hunecke
    Apr 29, 2010 at 10:51 am
    Apr 30, 2010 at 9:07 am
  • Using Lucene.Net I've built an index of documents. The documents also have a unique identifier (my identifier, not the lucene index's id). The unique identifers are also a sort order of new-ness ...
    Ravi PatelRavi Patel
    Apr 22, 2010 at 7:59 pm
    Apr 24, 2010 at 2:11 pm
  • Hi all, I have a question about usage of lucene, I want to figure out how I can get one or all posting lists, after adding a document to the index, but without materializing it in files. So after I ...
    Yağız KargınYağız Kargın
    Apr 20, 2010 at 7:40 am
    Apr 21, 2010 at 1:20 pm
  • Hi, I have indexed the following two fields: org_id - NOT_ANALYZEDorg_name - ANALYZED However when I try to search by org_id, for example, 12345, I get no hits. I am using the StandardAnalyzer to ...
    Apr 19, 2010 at 5:41 pm
    Apr 19, 2010 at 6:48 pm
  • I am building an online application where I want to provide search functionality to users and each user is to search only within his own data. Can you give me some ideas about the structure of the ...
    Erdinc YilmazelErdinc Yilmazel
    Apr 18, 2010 at 9:37 pm
    Apr 19, 2010 at 6:25 pm
  • Hi, I would like to have example of adding payload for lucene 3.0. I found an example on internet which uses BoostingTermQuery but I couldn't find this class in Lucene 3.0.0 jar and I was wondering ...
    Apr 16, 2010 at 11:58 am
    Apr 19, 2010 at 1:24 pm
  • Hi All, I am implementing a search function for address by hibernate search which is based on lucene. The class definition as following: @Indexed public class Address implements Cloneable { ...
    Apr 15, 2010 at 10:30 am
    Apr 19, 2010 at 10:32 am
  • Hello, I have document for which I'd like to index an array of indexes. For example, there is a product that belongs to categories with IDs 12, 15, 16, 145, 148. I'd like to index these categories, ...
    Kristjan SiimsonKristjan Siimson
    Apr 14, 2010 at 7:16 pm
    Apr 14, 2010 at 10:19 pm
  • Hi all, Please let me know if this should be posted instead to the Lucene java-dev list. We have very large tis files (about 36 GB). I have not been too concerned as I assumed that due to the ...
    Burton-West, TomBurton-West, Tom
    Apr 12, 2010 at 9:58 pm
    Apr 13, 2010 at 4:32 pm
  • Is there currently a way to take a query, run it on multiple hosts containing different indexes, then merge the results from each host to present to the user? It looks like Solr can handle multiple ...
    Shaun SenecalShaun Senecal
    Apr 25, 2010 at 10:03 am
    May 11, 2010 at 4:12 am
  • Hi All, I have a question regarding the new Lucene query parser framework in the contribs project. My company's project is running on top of 2.4.0 release of Lucene. I am trying to evaluate the new ...
    Kannan chandrasekaranKannan chandrasekaran
    Apr 29, 2010 at 12:44 am
    Apr 29, 2010 at 5:22 am
  • I have a situation similar to the following that I'm trying to solve: I have a field in my document that contains a range of numbers. Say, for example, the universe of numbers is the range of ...
    Jeremy VolkmanJeremy Volkman
    Apr 24, 2010 at 3:41 am
    Apr 27, 2010 at 10:43 pm
  • Hi all, Please suggest VM options for faster lucene search for 23G index. I am using lucene version 2.9.2 and java version 1.6 . -- Er. Harsh Srivastava ...
    Harsh SrivastavaHarsh Srivastava
    Apr 26, 2010 at 6:56 am
    Apr 26, 2010 at 8:20 am
  • Hi does Lucene search uses short-circuit when i execute query like: A:10 AND b:20 AND c:30 In general, does position of field names can impact search performance e.g. if field A with value 10 is more ...
    Apr 21, 2010 at 10:21 am
    Apr 21, 2010 at 1:24 pm
  • Hello, I would like to query based on a start and end date. I was thinking something like this start_date: [20000101 TO <todays date ] end_date: [<todays date TO 20900101] Would this work for me? Our ...
    Apr 16, 2010 at 1:24 pm
    Apr 21, 2010 at 1:11 pm
  • Hi, Can a NumericRangeQuery be one of several Queries inside a complex BooleanQuery? When I do this my NumericRangeQuery seems to automagically be converted to a TermRangeQuery. Thanks, Paul
    Murdoch, PaulMurdoch, Paul
    Apr 16, 2010 at 6:11 pm
    Apr 20, 2010 at 1:26 am
  • Hi all, say I have an Index with one field named "category". There are two documents one with value "(testvalue)" and one with value "test value". Now somone search with "test". My Searchenine uses ...
    Franz RothFranz Roth
    Apr 14, 2010 at 11:42 am
    Apr 15, 2010 at 1:04 pm
  • Hi All, I got the following error while trying to optimize index sized 31 GB: Exception in thread "Lucene Merge Thread #3" P.Lucene.Expert.index.MergePolicy$MergeException: java.io.IOException: Data ...
    Liat orenLiat oren
    Apr 15, 2010 at 7:36 am
    Apr 15, 2010 at 9:32 am
  • Hello All, I am kind of new to Lucene, and having problem filtering search results. Background: My Indexed documents have multiple bills and each bill has multiple versions. Each version of the same ...
    Sirish VadalaSirish Vadala
    Apr 13, 2010 at 8:59 pm
    Apr 14, 2010 at 9:07 pm
  • Hi, folks. I appreciate the help people have been offering. Here is my problem. My immediate need is to get the tokens for a document from the Lucene index. I have a list of documents that I walk, ...
    Herbert RoitblatHerbert Roitblat
    Apr 12, 2010 at 6:15 pm
    Apr 12, 2010 at 8:28 pm
  • I have a requirement where in the results have to be sorted in ascending order for few fields, and descending order for one field. Currently I am using: String[] sortOrder = { IFIELD_YEAR, ...
    Sirish VadalaSirish Vadala
    Apr 29, 2010 at 5:09 pm
    Apr 29, 2010 at 5:20 pm
  • I'm trying to compile JCC, using python setup.py build This is what I get: ~/pylucene-2.9.2-1/jcc$ python setup.py build running build running build_py copying jcc/config.py - ...
    Herbert RoitblatHerbert Roitblat
    Apr 27, 2010 at 9:38 pm
    Apr 27, 2010 at 11:43 pm
  • Hello Luceners, I am sure I'm not the only one having such a snippet in my dedicated analyzer: m.put("en", new SnowballAnalyzer("English")); m.put("es", new SnowballAnalyzer("Spanish")); m.put("de", ...
    Paul LibbrechtPaul Libbrecht
    Apr 26, 2010 at 3:05 pm
    Apr 26, 2010 at 4:06 pm
  • Dear all, I want to provide hierarchical matching in Lucene. For example, in a document, there is a field and its value: transportation: vehicle and suppose there is a class/subclass relationship ...
    Wei YiWei Yi
    Apr 22, 2010 at 9:09 pm
    Apr 23, 2010 at 4:20 am
  • Hi, I'm building a BooleanQuery that may contain a NumericRangeQuery. The NRQ may be one of several sub-queries in the parent BooleanQuery. I wasn't able to make the NRQ function properly by ...
    Murdoch, PaulMurdoch, Paul
    Apr 20, 2010 at 7:09 pm
    Apr 21, 2010 at 9:54 am
  • I'd like to use lucene to search text documents for the existence of a large list of search terms. I have a file that contains thousands of entries, one word per line. I was thinking about to writing ...
    Fred RahmanianFred Rahmanian
    Apr 20, 2010 at 8:43 pm
    Apr 20, 2010 at 9:27 pm
Group Navigation
period‹ prev | Apr 2010 | next ›
Group Overview
groupjava-user @

100 users for April 2010

Michael McCandless: 29 posts Uwe Schindler: 26 posts Erick Erickson: 13 posts Herbert Roitblat: 12 posts Ian Lea: 12 posts Jm: 11 posts Justin: 11 posts Ruben Laguna: 11 posts Grant Ingersoll: 9 posts Samarendra Pratap: 9 posts Shai Erera: 9 posts Henrib: 8 posts Woolf, Ross: 8 posts Michel Nadeau: 7 posts Prasenjit mukherjee: 7 posts Stephen Greene: 7 posts Chris Hostetter: 5 posts Chris Lu: 5 posts Koji Sekiguchi: 5 posts Suman Holani: 4 posts
show more