Search Discussions

55 discussions - 200 posts

  • Hi, According to the Lucene In Action (Second Edition), the section 2.11.2 "Accessing an index over a remote file system" explains that there are issues related to accessing a Lucene index across ...
    Jong KimJong Kim
    Oct 1, 2012 at 10:06 pm
    Oct 2, 2012 at 3:25 pm
  • The exception "read past EOF" Bothering me a long time, trace at below. Exception in thread "Lucene Merge Thread #7" org.apache.lucene.index.MergePolicy$MergeException: java.io.IOException: read past ...
    Oct 31, 2012 at 3:28 am
    Nov 7, 2012 at 9:46 am
  • Hi, Prior to search I have a concrete list of Lucene Document Ids (different every time) and I want to limit my search only to those specific documents. Is there a way to do it? I don't need to check ...
    Oct 16, 2012 at 8:37 am
    Oct 22, 2012 at 2:17 pm
  • We have the need to re-index some fields in our application frequently. Our typical document consists of a) Many single-valued {long/int} re-indexable fields b) Few large-valued {text/string} static ...
    Ravikumar GovindarajanRavikumar Govindarajan
    Oct 25, 2012 at 10:10 am
    Nov 7, 2012 at 3:53 pm
  • Hi all, Sorry if I'm asking an age old question but we have migrated to lucene 3.6.0 and I see StandardAnalyzer has changed its behaviour, particularly when tokenizing email addresses. From reading ...
    Kiwi cliveKiwi clive
    Oct 24, 2012 at 10:42 am
    Oct 25, 2012 at 9:11 am
  • I am using DefaultSimilarity and did not boost any field while indexing. My index is comprised of the following fields: - Title - Author - Bookname - Description All of the 4 fields are indexed and ...
    Siraj HaiderSiraj Haider
    Oct 22, 2012 at 9:26 pm
    Oct 29, 2012 at 2:14 pm
  • I need to take an html page that I retrieve from my lucene search and highlight all of the terms that are part of the search. I need to skip over any html tags since I don't want any words in tags ...
    Scott SmithScott Smith
    Oct 24, 2012 at 12:01 am
    Nov 6, 2012 at 12:19 pm
  • Hello I am using Lucene 3.4 and I have nearly the same question like Jan in his post: http://mail-archives.apache.org/mod_mbox/lucene-java-user/200801.mbox/%3C4791E482.60900%40gmx.de%3E I could found ...
    Willi HaaseWilli Haase
    Oct 25, 2012 at 1:30 pm
    Oct 26, 2012 at 10:42 am
  • Hi All, I'm reading the docs of Apache Lucene. I just read through the docs of the analyser docs/core/org/apache/lucene/analysis/package-summary.html. Here they have given a code snippet,I've ...
    Selvakumar netajiSelvakumar netaji
    Oct 5, 2012 at 11:33 am
    Oct 8, 2012 at 1:14 pm
  • Hi, I'm currently trying to search on the following search string in my Lucene index: "2012/0.124.323". The java code to search for ('value' is my search string) ---- QueryParser queryParser = new ...
    Jochen HebbrechtJochen Hebbrecht
    Oct 1, 2012 at 12:59 pm
    Oct 1, 2012 at 3:33 pm
  • I'm currently converting some lucene code to 4.0. It appears that you are no longer allowed to delete a document by its ID. Is that correct? Is my only option to figure some kind of query (which ...
    Scott SmithScott Smith
    Oct 26, 2012 at 11:35 pm
    Oct 29, 2012 at 8:20 pm
  • so there are lots of Qs that are asked about wanting to modify a lucene document (i.e. remove fields, add fields....) but are told that one needs to reindex. No one ever answers the technical Q of ...
    Shaya PotterShaya Potter
    Oct 22, 2012 at 7:23 pm
    Oct 22, 2012 at 8:00 pm
  • Hello, I have some custom queries & scorer that need to able to construct the "global" docIds (doc + docBase). But when i use these in a QueryWrapperFilter they no longer work, because ...
    Thomas MatthijsThomas Matthijs
    Oct 8, 2012 at 9:13 am
    Oct 8, 2012 at 5:36 pm
  • Hi, I'm new to Lucene and I reading the docs on Lucene. I read through the Lucene Index File Format, so to exercise well I tried to open the lucene index through a text editor. The editor opened with ...
    Oct 1, 2012 at 5:05 am
    Nov 16, 2012 at 9:45 am
  • I am using updateDocument() method to update my document in the lucene index. Here is how I am doing it. writer.updateDocument(new Term(Constants.DOC_ID_FIELD, doc.get(Constants.DOC_ID_FIELD)), ...
    Deepak ShakyaDeepak Shakya
    Oct 17, 2012 at 12:53 pm
    Oct 17, 2012 at 1:43 pm
  • Hi, I'm currently researching using a WFST suggester on e.g. book titles. While our basic use cases are well covered, there seem to be at least three which aren't: * The possibility to associate a ...
    Oliver ChristOliver Christ
    Oct 30, 2012 at 1:45 pm
    Oct 30, 2012 at 9:22 pm
  • Hello, using Lucene 4.0.0b, I am trying to get a superset of all stop words (for an international app). I have looked around, and not found anything specific. Is this the way to go? CharArraySet ...
    Oct 27, 2012 at 2:53 am
    Oct 28, 2012 at 5:42 am
  • Is there anything in Lucene 4.0 that provides 'absolute' scoring so that i can compare the scoring results of different searches ? To explain if I do a search for two values fred OR jane and there is ...
    Paul TaylorPaul Taylor
    Oct 25, 2012 at 11:11 am
    Oct 25, 2012 at 6:06 pm
  • Hy Guys, In previous versions of Lucene there was a class TermPositions that could be obtained form IndexReader. Is there something that replaces it in Lucene 4.0.0? Also is there some documentation ...
    Ivan VasilevIvan Vasilev
    Oct 25, 2012 at 1:51 pm
    Oct 25, 2012 at 3:40 pm
  • Hi to all I am using Apache Lucene in java Swing API . For searching i am indexing on the contents of html files But the problem is that i have to give the complete software to the client and we dont ...
    Oct 10, 2012 at 11:29 am
    Oct 25, 2012 at 12:39 pm
  • Hi, We are using Lucene-core and we reindex once a day and plan to do it more often in a day sooner. During re loading of the lucene indexes, some of the slave servers don't recover and we have to ...
    Raghavan ParthasarathyRaghavan Parthasarathy
    Oct 23, 2012 at 9:04 pm
    Oct 25, 2012 at 2:00 am
  • Hi, I want to remove the data from indexed field, not documents that containing that data. i.e. Suppose I have a field person containing some person names and I want to remove some un-named data from ...
    Oct 22, 2012 at 6:58 am
    Oct 22, 2012 at 1:08 pm
  • Hi, I've modified the HyphenationCompoundWordTokenFilter to emit less subtokens because the original filter can emit all kinds of subtokens that have a very different meaning on their own. I've ...
    Markus JelsmaMarkus Jelsma
    Oct 4, 2012 at 1:38 pm
    Oct 5, 2012 at 10:49 am
  • We recently switched from QueryParser to ComplexPhraseQueryParser (from lucene-queryparser-3.6.0.jar), and we've come across two separate problems. The first is that because it parses quoted ...
    Brandon MinternBrandon Mintern
    Oct 26, 2012 at 10:37 pm
    Nov 2, 2012 at 7:49 am
  • Hello, I have a Lucene index created, and I would like to know how to delete the file index entries that do not already exist on the computer. Is there any way from Lucene or have to go file by file ...
    ViTi NoViTi No
    Oct 30, 2012 at 12:48 pm
    Oct 31, 2012 at 12:35 am
  • Converting some code to lucene 4.0, it appears that we can no longer set whether we want to store norms or termvectors using the "sugared" Field classes (e.g., StringField() and TextField). I gather ...
    Scott SmithScott Smith
    Oct 29, 2012 at 10:57 pm
    Oct 30, 2012 at 5:30 pm
  • Hi Guys, I use the following code to index documents and set Payloads to term positions: public class TestPayloads_ { private static final String INDEX_DIR = "E:/Temp/Index"; public static void ...
    Ivan VasilevIvan Vasilev
    Oct 29, 2012 at 5:44 pm
    Oct 30, 2012 at 8:26 am
  • Hi guys, I've recently moved from lucene 2.3 to 3.6. The application uses CF format. With lucene 2.3, I understood the interaction of merge factor etc with repect to how many files were created in ...
    Kiwi cliveKiwi clive
    Oct 27, 2012 at 7:45 pm
    Oct 29, 2012 at 8:23 pm
  • How do I determine if the index has been modified in 4.0? The ifchanged() and isChanged() appear to have been removed.
    Scott SmithScott Smith
    Oct 26, 2012 at 11:55 pm
    Oct 29, 2012 at 8:21 pm
  • Hello. We have an index that when creted using lucene2.3.2, has a size of about 4G. Creating the same index (with the same parameters) with lucene 3.6.0 results in an 11G index. Could someone shed ...
    Kiwi cliveKiwi clive
    Oct 26, 2012 at 5:50 pm
    Oct 26, 2012 at 8:28 pm
  • Hi Guys, When executing: ndexWriterConfig iwc = new IndexWriterConfig(Version.LUCENE_40, new MyAnalyzer()); I have the following exception: Exception in thread "main" ...
    Ivan VasilevIvan Vasilev
    Oct 26, 2012 at 3:47 pm
    Oct 26, 2012 at 3:58 pm
  • *Scotas* brings a remarkable advancement to Enterprise Text Search. Scotas combines and synchronize the high-performance, full-featured Solr/Lucene text search engine with the industry leading Oracle ...
    Maximiliano KeenMaximiliano Keen
    Oct 23, 2012 at 8:35 pm
    Oct 23, 2012 at 9:06 pm
  • Hi all, The next Open Source Search Social is on the 23rd Oct at The Plough, in Bloomsbury. We usually get a good mix of regulars and newcomers, and a good mix of backgrounds and experience levels, ...
    Richard MarrRichard Marr
    Oct 11, 2012 at 9:00 pm
    Oct 21, 2012 at 7:24 pm
  • Hi all, Together with Grant Ingersoll and Robert Muir we have submitted a paper to the "SIGIR 2012 Workshop on Open Source Information Retrieval" held on 16 Aug 2012 in Portland ...
    Andrzej BialeckiAndrzej Bialecki
    Oct 9, 2012 at 12:00 pm
    Oct 9, 2012 at 5:35 pm
  • Hello, I'm trying to generate the standard tokenizer again using the jflex specification (StandardTokenizerImpl.jflex) but I'm not able to do so due to some errors (I would like to create my own ...
    Oct 4, 2012 at 11:43 pm
    Oct 5, 2012 at 12:11 am
  • Hi, Do you have any information about when the pruning package will be available for Lucene 4.0 ? Best Regards Thanks in advance ZP -- View this message in context ...
    Zeynep P.Zeynep P.
    Oct 12, 2012 at 2:34 pm
    Feb 20, 2013 at 11:57 am
  • Hi all, I am a recent user of the Lucene platform and have some difficulty to migrate from V3.6 to V4.0 a small IR fulltext application that requires to retrieve the positions of occurrence into the ...
    Pierre-Francois MarteauPierre-Francois Marteau
    Oct 22, 2012 at 8:50 am
    Nov 17, 2012 at 2:12 am
  • Hi all, Lucene 4 has introduced several state of the art ranking functions. I was wondering how could i make use of those similarities .These models obviously uses some more term and collection ...
    Parnab kumarParnab kumar
    Oct 30, 2012 at 2:20 pm
    Nov 4, 2012 at 12:54 am
  • Hi, I've got a setup in which I would like to perform an arbitrary query over one field (typically realised through a WildcardQuery) and the matches are returned as a SpanQuery because the result ...
    Carsten SchnoberCarsten Schnober
    Oct 29, 2012 at 12:41 pm
    Oct 30, 2012 at 10:17 am
  • Hi, Looking for feedback on running Solr Core/ Tika parsing engine on Azure. There's one offering for Solr within Azure from Lucid works. This offering however doesn't mention Tika. We are looking at ...
    Aloke GhoshalAloke Ghoshal
    Oct 29, 2012 at 7:22 am
    Oct 29, 2012 at 1:47 pm
  • Hi to all, I started to use benchmark 4.0 to create submission report files with the following code: BufferedReader br = new BufferedReader(fr); QualityQuery qqs[] = qReader.readQueries(br) ...
    Zeynep P.Zeynep P.
    Oct 17, 2012 at 1:48 pm
    Oct 17, 2012 at 2:59 pm
  • example : 2 documents : doc1 : title : taxi doc2 : title : taxi driver Query : TermQuery : title:taxi How could doc1 has a better score than doc2 ? That's a very basic example. By rewriting a query, ...
    emmanuel Gosseemmanuel Gosse
    Oct 14, 2012 at 11:52 am
    Oct 14, 2012 at 7:59 pm
  • Hello List, I'm currently trying to update my Lucene 3.6-application to 4.0. Most of it works (although your migration guide lacks a bit of aspects I had to figure out myself), but for one fairly ...
    Arjen van der MeijdenArjen van der Meijden
    Oct 12, 2012 at 2:04 pm
    Oct 12, 2012 at 2:22 pm
  • Hi All, How do i incorporate machine learned ranking components into lucene. I donot want to do re-ranking of documents(i.e I donot want to pre-fetch documents and then look into their features ). i ...
    Parnab kumarParnab kumar
    Oct 9, 2012 at 7:05 pm
    Oct 9, 2012 at 7:29 pm
  • Assignment Name: Search Engine Consultant (Lucene/ SOLR & J2EE exp.) Work Location: Brooklyn, NY Scheduled Work Hours: 9am to 5pm Monday to Friday Duration: 12+ Months Interview: In person/ F2F ...
    Oct 20, 2012 at 12:38 am
    Oct 20, 2012 at 12:38 am
  • Hi, I am pretty new to solr and I am trying to create Nested Indexes. But i am not able to find proper documentation or to get it working. Following is my issue/scenario: We have a List of Providers ...
    Divya ArunachalamDivya Arunachalam
    Oct 19, 2012 at 4:50 pm
    Oct 19, 2012 at 4:50 pm
  • Greetings, There is a new book from O'Reilly on Lucene and Solr, and you can submit a case study about your Lucene project for inclusion in the book. Feel free to contact me directly, <span ...
    Jason RutherglenJason Rutherglen
    Oct 19, 2012 at 12:59 pm
    Oct 19, 2012 at 12:59 pm
  • Hi All, I am currently using the solr-4.0.0 to allow searching the logs for one of our project. I am testing with a dataset of around 17Gb containing only logs. We prepare each line as one document ...
    Jain RahulJain Rahul
    Oct 18, 2012 at 6:34 pm
    Oct 18, 2012 at 6:34 pm
  • Hi folks, Is there a reason why the setMaxDocCharsToAnalyze() method of WeightedSpanTermExtractor() is protected? The class is a perfect fit for my requirement (enumerating the list of terms present ...
    Dawn Zoë RaisonDawn Zoë Raison
    Oct 17, 2012 at 8:25 pm
    Oct 17, 2012 at 8:25 pm
  • Hello,Please send me your resume if you are interested in the below position.Position: Java with Lucene Location: San Jose, CALong term Contract 7 to 10 yrs experienceLucene search experience, ...
    Oct 17, 2012 at 6:51 pm
    Oct 17, 2012 at 6:51 pm
Group Navigation
period‹ prev | Oct 2012 | next ›
Group Overview
groupjava-user @

72 users for October 2012

Jack Krupansky: 16 posts Ian Lea: 13 posts Selvakumar netaji: 10 posts Scott Smith: 8 posts Jong Kim: 7 posts Kiwi clive: 7 posts Ivan Vasilev: 6 posts Michael McCandless: 6 posts Vitaly Funstein: 6 posts Siraj Haider: 5 posts Sxam: 5 posts Thomas Matthijs: 5 posts Uwe Schindler: 5 posts Erick Erickson: 4 posts Parnab kumar: 4 posts Robert Muir: 4 posts Simon Willnauer: 4 posts Willi Haase: 4 posts Apostolis Xekoukoulotakis: 3 posts Deepak Shakya: 3 posts
show more