Search Discussions

62 discussions - 245 posts

  • I want to store the long type value to my index files like follwing: NumericField priceField = new NumericField("price"); priceField.setDoubleValue(temp.getCurrentprice()); document.add(priceField) ...
    Jianwen louJianwen lou
    Mar 27, 2012 at 7:21 am
    Apr 2, 2012 at 8:02 pm
  • I've posted a self-contained test case to github of a mystery. git://github.com/bimargulies/lucene-4-update-case.git The code can be seen at ...
    Benson MarguliesBenson Margulies
    Mar 6, 2012 at 1:36 pm
    Mar 6, 2012 at 3:08 pm
  • Hi Group, I am working on a Lucene search solution for multiple fields. So far, if the fields are of string type I am having no difficulties in retrieving using the MultiFieldQueryParser. For ...
    Deb LuceneDeb Lucene
    Mar 14, 2012 at 3:33 pm
    Mar 28, 2012 at 2:47 pm
  • Hi, We migrated our project to Lucene 3.5 and used the TieredMergePolicy with all its defaults. Our project contains a relatively small index (+-90MB) and gets small writes every few seconds. The ...
    Tim BogaertTim Bogaert
    Mar 15, 2012 at 11:14 am
    Mar 15, 2012 at 5:07 pm
  • Under "LUCENE-1458, LUCENE-2111: Flexible Indexing", CHANGES.txt appears to be missing one critical hint. If you have existing code that called IndexReader.terms(), where do you start to get a ...
    Benson MarguliesBenson Margulies
    Mar 6, 2012 at 1:50 pm
    Mar 6, 2012 at 3:00 pm
  • Hello, I am using Lucene version 3.4 through the Python extension (pylucene) in a multi-threaded script. When I launch the script I sometimes get a fatal error message (log attached) and sometimes ...
    David MoscaDavid Mosca
    Mar 29, 2012 at 2:35 pm
    Mar 30, 2012 at 2:12 pm
  • Hi all, I have a search application with 16 million documents that uses custom scores per document using a ValueSource. These values are updated a lot (and sometimes all at once), so I can't really ...
    Christoph KaserChristoph Kaser
    Mar 27, 2012 at 7:29 am
    Apr 5, 2012 at 2:28 pm
  • Sorry, I'm coming up empty in Google here. --------------------------------------------------------------------- To unsubscribe, e-mail: j[email protected] For additional ...
    Benson MarguliesBenson Margulies
    Mar 5, 2012 at 3:54 pm
    Mar 5, 2012 at 4:48 pm
  • I need to delete entries from posting list. How to do it in Lucene 4.0? I need to do this to test different pruning algorithms. Thanks in advance ZP -- View this message in context ...
    Zeynep P.Zeynep P.
    Mar 19, 2012 at 10:24 am
    Apr 23, 2012 at 5:52 pm
  • Hello, I am using BlockGroupingCollector for first time and I have some small problem with it. Indexing code is pretty much copy of the one from docs. Searching looks like this: Filter groupEndFilter ...
    Grzegorz TańczykGrzegorz Tańczyk
    Mar 8, 2012 at 9:44 am
    Mar 9, 2012 at 5:20 pm
  • Hello I've been looking at the BlockJoinQuery in Lucene 3.4.0 and would like to clarify my understanding. Suppose we have a parent document that we index with (say) 4 child documents. My ...
    Kiwi cliveKiwi clive
    Mar 22, 2012 at 2:51 pm
    Aug 2, 2012 at 9:49 pm
  • Hello, I'm currently working out some problems when searching for Tibetan Characters. More specifically: /u0f10-/u0f19. We are using the StandardAnalyzer (3.4) and I've narrowed the problem down to ...
    Denis BrodeurDenis Brodeur
    Mar 30, 2012 at 4:47 pm
    Mar 30, 2012 at 6:12 pm
  • Dear Bravo for your work on lucene, A comment : ToParentBlockJoinQuery return bad children (Children of previous adjacent block) This happens if parent is the first of the list of document block ...
    Jean-Marc MORASJean-Marc MORAS
    Mar 12, 2012 at 6:41 pm
    Mar 15, 2012 at 3:49 pm
  • I'm working on a project where I need to tag both the part of speech and other syntactic information on tokens so that this information is searchable. I have read the threads on the mailing list ...
    Mark McGuireMark McGuire
    Mar 14, 2012 at 4:38 pm
    Apr 10, 2012 at 7:21 am
  • hi all, I read these two articles http://blog.mikemccandless.com/2012/01/searching-relational-content-with.html, http://blog.mikemccandless.com/2012/01/tochildblockjoinquery-in-lucene.htmland wrote a ...
    Li LiLi Li
    Mar 23, 2012 at 4:27 am
    Mar 23, 2012 at 11:36 am
  • Hi Group, Sorry for cross posting! We need to index a document corpus (news articles) with some meta data features. The meta data are actually company names with some scoring (a double, between 0 to ...
    Deb LuceneDeb Lucene
    Mar 21, 2012 at 2:21 pm
    Mar 21, 2012 at 4:04 pm
  • I'm running with 3.4 code and have studied up on all the API related to the optimize() replacements and understand I needn't worry about deleted documents, but I still want to ask a few things about ...
    Paul HillPaul Hill
    Mar 7, 2012 at 1:01 am
    Mar 8, 2012 at 7:11 pm
  • Hi Guys, Will this be available in Lucene 3.6 or is it only going into version 4.0 ? Clive
    Kiwi cliveKiwi clive
    Mar 29, 2012 at 11:56 am
    Mar 29, 2012 at 11:36 pm
  • Does Lucene API allows to add new field values to a existing doc. For example, Initially Doc 1 FieldName Value id 1 schoolname xyz zipcode pqr After update can I keep the existing field values and ...
    Anupam BhattacharyaAnupam Bhattacharya
    Mar 28, 2012 at 12:17 pm
    Mar 29, 2012 at 12:44 pm
  • I'm using a MemoryIndex to search in-memory content. I was wondering if there's a way to specify multiple values for the same field in a MemoryIndex. I've tried addField but it throws an exception ...
    Dave SeltzerDave Seltzer
    Mar 7, 2012 at 2:39 pm
    Mar 12, 2012 at 3:34 pm
  • hello, I'm trying to programmatically create a query but don't get it working. The query should return all results that match some prefix, but not any results that /exactly/ match the prefix (in the ...
    Mar 7, 2012 at 1:53 pm
    Mar 7, 2012 at 2:10 pm
  • Dear All, We started using Lucene version 3.0.3, we have different types of documents like PDF, XLS, XLSX, DOC, DOCX,TXT etc., at a specified folder. We have created index on these files(using ...
    Prasad KVSHPrasad KVSH
    Mar 7, 2012 at 9:44 am
    Mar 7, 2012 at 11:27 am
  • Hi Chaps, JoinUtil.createJoinQuery() specifies a Query for the from side of the join. Is it possible to query over both sides of the join (while still providing the two join fields) ? If not, what is ...
    Kiwi cliveKiwi clive
    Mar 29, 2012 at 12:04 pm
    Apr 4, 2012 at 5:38 pm
  • I have a string 01a_b-_-c-d which is tokenized as 01a_b c d and the string a_b-_-c_d which is tokenized as a b c d why is there a difference when there is a digit at the beginning? I am using ...
    Nilesh VijaywargiayNilesh Vijaywargiay
    Mar 27, 2012 at 6:04 pm
    Mar 28, 2012 at 6:35 am
  • Hi, I am using lucene-3.5 and getting an OutOfMemoryError on a large indexing task of 100M documents. I am creating an index with 3 UUIDs as separate field values. I am using Store.YES on 1 of them ...
    Ken McCrackenKen McCracken
    Mar 19, 2012 at 7:30 pm
    Mar 20, 2012 at 10:21 pm
  • Greetings! First off, I realize Lucene is a search engine and therefore does not possess many of the features of a database. That being said, I have encountered a particular use case where I need to ...
    Mar 14, 2012 at 7:54 am
    Mar 14, 2012 at 11:40 am
  • There is a mismatch between the score for a wildcard match and an exact match I search for 1.4196585 = (MATCH) max plus0.1 times others of: 0.3763506 = (MATCH) ConstantScore(recording:luve*), product ...
    Paul TaylorPaul Taylor
    Mar 9, 2012 at 10:40 am
    Mar 9, 2012 at 11:46 pm
  • I have a number of fields that either only ever have a term frequency of 1 or I don't want them to be disavantaged if they do have a greater term frequency, and I never boost the field so I disable ...
    Paul TaylorPaul Taylor
    Mar 6, 2012 at 9:42 pm
    Mar 6, 2012 at 11:11 pm
  • Hi , I'm new to Lucene.Can anyone tell me how can i start learning about it with the code base. I have knowledge of endeca search engine and have worked on it. So, if anyone could relate it and give ...
    Rahul reddyRahul reddy
    Mar 5, 2012 at 6:58 pm
    Mar 5, 2012 at 9:37 pm
  • Is there a reason why this doesn't return a count? Would a JIRA requesting same be viewed with any sympathy? --------------------------------------------------------------------- To unsubscribe, ...
    Benson MarguliesBenson Margulies
    Mar 4, 2012 at 9:42 pm
    Mar 4, 2012 at 9:52 pm
  • I am walking down the document in an index by number, and I find that I want to update one. The updateDocument API only works on queries and terms, not numbers. So I can call remove and add, but, ...
    Benson MarguliesBenson Margulies
    Mar 4, 2012 at 4:32 pm
    Mar 4, 2012 at 4:47 pm
  • I've updated codebase from 3.4 to 3.5 and as part of that took the advice to no longer optimize my indexes. During testing everything seemed okay but since releasing to Live noticed the load on the ...
    Paul TaylorPaul Taylor
    Mar 2, 2012 at 5:06 pm
    Mar 2, 2012 at 7:25 pm
  • Hi, I'm new to Lucene. I'm indexed some documents with Lucene and need to sanitize it to ensure that they do not have any social security numbers (3-digits 2-digits 4-digits). (How) Can I write a ...
    Su haSu ha
    Mar 2, 2012 at 7:22 am
    Mar 2, 2012 at 9:22 am
  • Hello all, Is anyone tried Lucene in 64 Bit. Please share your experience about its performance, how big your database size and RAM? Regards Ganesh ...
    Mar 1, 2012 at 9:00 am
    Mar 2, 2012 at 5:03 am
  • Hi https://builds.apache.org/job/Lucene-trunk/javadoc/core/org/apache/lucene/document/DocValuesField.html The documentation at the above link indicates that the optimal way to add a DocValues field ...
    Sudarshan GaikaiwariSudarshan Gaikaiwari
    Mar 1, 2012 at 7:21 am
    Mar 1, 2012 at 2:25 pm
  • Dear list, I have a quite specific issue on which I would appreciate very much having some thoughts before I start the actual implementation. Here's my task description: I would like to index corpora ...
    Carsten SchnoberCarsten Schnober
    Mar 6, 2012 at 2:40 pm
    Apr 11, 2012 at 4:13 pm
  • Hi all, I have upgraded my code to the latest trunk version. I my code I was using indexReader's function getTermFreqVector(spans.doc(), "Text", mapper); and my mapper based on the TermVectorMapper ...
    Rok RejcRok Rejc
    Mar 23, 2012 at 1:30 pm
    Mar 23, 2012 at 5:21 pm
  • I'm new to lucene and have searched the archives and FAQ for an answer on how to do something and have found no information. Our server is running SOLR and is running out of memory when returning ...
    Doug KunzmanDoug Kunzman
    Mar 21, 2012 at 2:18 pm
    Mar 21, 2012 at 6:43 pm
  • Hi guys, I'm using Lucene for my project and I need to calcule how similar two (or more) documents are, using TFIDF. How to get TFIDF with lucene? Any insights on this? Thank you for your support ...
    Hassane CabirHassane Cabir
    Mar 12, 2012 at 5:38 pm
    Mar 13, 2012 at 2:25 am
  • Hello, I have a number of operations that I want to apply to a TokenStream, supplementing the original tokens with modified forms. For example, I want to reverse tokens, to allow prefix wildcard ...
    Alan WoodwardAlan Woodward
    Mar 12, 2012 at 4:47 pm
    Mar 12, 2012 at 4:52 pm
  • I have an ID field that contains about 100,000 unique ids. If I want to query all records with ids [1-100], How should I be doing this? I tried doing it the following way ...
    Kushal DaveKushal Dave
    Mar 6, 2012 at 4:35 pm
    Mar 6, 2012 at 5:15 pm
  • Hi there, Is Java7 now safe to use with Lucene? If so, is there a minimum Lucene version I must use with it? Thanks, - Chris
    Chris BamfordChris Bamford
    Mar 6, 2012 at 1:13 pm
    Mar 6, 2012 at 1:42 pm
  • Thank you to Simon Willnauer the Lucene PMC chair for taking some time to speak with us. You can download the podcast at: http://www.rce-cast.com/Podcast/rce-67-lucene-indexing-and-search.html If you ...
    Brock PalenBrock Palen
    Mar 5, 2012 at 7:36 pm
    Mar 5, 2012 at 9:25 pm
  • There's no javadoc on here yet, and I am a little puzzled by the fact that it is returning null for me. Does that imply that there can't be any deleted docs known to the reader? ...
    Benson MarguliesBenson Margulies
    Mar 5, 2012 at 6:54 pm
    Mar 5, 2012 at 7:11 pm
  • TopDocs top = searcher.search(contextQuery, filter, maxDocsToRetrieve); Which document fields are included in the calculation of the scores in the returned items? All fields? All fields touched in ...
    Benson MarguliesBenson Margulies
    Mar 4, 2012 at 11:10 pm
    Mar 5, 2012 at 4:10 pm
  • I'm testing lucene-gosen for Japanese tokenization and wondering what the differences are between the two jars provided. (ipadic / chaisen)? In my preliminary testing, I'm not seeing any difference ...
    Thushara WijeratnaThushara Wijeratna
    Mar 2, 2012 at 9:42 pm
    Mar 3, 2012 at 12:47 am
  • In the Javadoc page for the Similarity class, it says, "Lucene combines Boolean model (BM) of Information Retrieval with Vector Space Model (VSM) of Information Retrieval - documents "approved" by BM ...
    Mike O'LearyMike O'Leary
    Mar 1, 2012 at 11:16 pm
    Mar 2, 2012 at 4:11 pm
  • If I happen to subclass MultiFieldQueryParser unneccessarily (thought need more than one default search but don't after all) would it have any impact on performance ? thanks Paul ...
    Paul TaylorPaul Taylor
    Mar 2, 2012 at 7:30 am
    Mar 2, 2012 at 9:24 am
  • Surge 2012, the scalability conference, September 27-28, Baltimore, MD has opened its CFP. Please visit http://omniti.com/surge/2012/cfp for details. -- Katherine Jeschke Director of Marketing and ...
    Katherine JeschkeKatherine Jeschke
    Mar 30, 2012 at 2:02 pm
    Mar 30, 2012 at 2:02 pm
  • im using lucene2.9 and i need display search result like filename,filepath,textContent.. this text content highlight the matched string. When click on the search result ,it will go to the exact file ...
    Neeraj shahNeeraj shah
    Mar 30, 2012 at 4:43 am
    Mar 30, 2012 at 4:43 am
Group Navigation
period‹ prev | Mar 2012 | next ›
Group Overview
groupjava-user @

77 users for March 2012

Michael McCandless: 28 posts Benson Margulies: 22 posts Uwe Schindler: 16 posts Paul Taylor: 12 posts Ian Lea: 10 posts Robert Muir: 10 posts Li Li: 9 posts Paul Hill: 7 posts Steven A Rowe: 7 posts David Mosca: 5 posts Deb Lucene: 5 posts Ilya Zavorin: 5 posts Jianwen lou: 5 posts Ganesh: 4 posts Kiwi clive: 4 posts Tim Bogaert: 4 posts Brandon Mintern: 3 posts Damerian: 3 posts Greg Bowyer: 3 posts Grzegorz Tańczyk: 3 posts
show more