Search Discussions

60 discussions - 224 posts

  • I've built a search prototype feature for my application using Lucene, and it works great. The application monitors a remote system and currently indexes just a few core attributes of the objects on ...
    Becker, ThomasBecker, Thomas
    Feb 6, 2013 at 1:50 pm
    Mar 4, 2013 at 8:49 pm
  • I understand from the JIRA ticket(Lucene-3640) that the IndexSearcher.close() is no-op operation but not very clear on why it is a no-op? Could someone shed some light on this? We were using this ...
    Feb 18, 2013 at 6:33 pm
    Feb 19, 2013 at 6:47 pm
  • In Lucene 3.6 I had code that replicated a Dismax Query, and the search used fuzzy queries in some cases to match values. But I was finding the score attributed to matches on fuzzy searches was ...
    Paul TaylorPaul Taylor
    Feb 26, 2013 at 4:34 pm
    Apr 4, 2013 at 8:21 pm
  • Hi all, Debugging Solr spellchecker (IndexBasedSpellchecker, delegating on lucene Spellchecker) behaviour i think i found a bug when the input is a 6 letter word: - george - anthem - argued - fluent ...
    Samuel García MartínezSamuel García Martínez
    Feb 20, 2013 at 11:35 pm
    Feb 21, 2013 at 9:32 pm
  • Hello all, From the grouping javadoc, I read that fields that are supposed to be grouped should not be tokenized. I have an use case where the user has the freedom to group by any field during search ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Feb 18, 2013 at 2:13 pm
    Feb 20, 2013 at 5:58 am
  • Hello list, I have a scenario wherein I need an in-memory index as I need faster search. The problem goes like this : I have a list which contains a couple of thousands words. Each word has a ...
    Mohammad TariqMohammad Tariq
    Feb 11, 2013 at 3:15 pm
    Feb 12, 2013 at 8:20 pm
  • I'm looking for a way of making a query on words which contain wildcards (* or ?). In general, we use wildcards in query, not in the text. I haven't find anything in Lucene to build that ...
    Nicolas RoduitNicolas Roduit
    Feb 8, 2013 at 7:49 am
    Feb 8, 2013 at 8:18 pm
  • Just updating codebase from Lucene 3.6 to Lucene 4.1 and seems my tests that use NormalizeCharMap for replacing characters in the anyalzers are not working. Below Ive created a self-contained test ...
    Paul TaylorPaul Taylor
    Feb 20, 2013 at 11:29 am
    Feb 26, 2013 at 10:04 am
  • Dear all, I read from this page http://lucene.apache.org/core/4_1_0/grouping/index.html that, grouping is possible only when I do a block index and place a binary marker at the end of each block ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Feb 20, 2013 at 7:24 am
    Feb 25, 2013 at 6:49 am
  • What is equivalent to Document.setBoost() from Lucene 3.6 inLucene 4.1 ? --------------------------------------------------------------------- To unsubscribe, e-mail: <span class="m_body_email_addr" ...
    Paul TaylorPaul Taylor
    Feb 18, 2013 at 12:17 pm
    Feb 18, 2013 at 6:26 pm
  • Team, I am facing a strange issue with term queries and stored fields. Here is how I index and fetch the query results, Case 1 : doc.add(new StoredField("published", b.getPublished())); Query query = ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Feb 11, 2013 at 12:53 pm
    Feb 13, 2013 at 5:20 am
  • Hi! I'm sorry I didn't do any hard research on this, it's so quick to ask. ;) Is it possible to somehow find the count of each term in a set for each document returned by a query? For instance, if I ...
    Lars-Erik AabechLars-Erik Aabech
    Feb 22, 2013 at 9:14 am
    Feb 22, 2013 at 11:24 am
  • Hi, How can I add field to hold a Java map object in such way that the "[", "]", "," are preserved? Thanks!
    Feb 13, 2013 at 1:34 am
    Feb 14, 2013 at 9:50 am
  • Pierre Antoine DuBoDeNaPierre Antoine DuBoDeNa
    Feb 9, 2013 at 10:20 am
    Feb 10, 2013 at 3:14 pm
  • Hello, I want to implement a central index, and I heard about Lucene, so I would like to ask your help to install it and configure it. My OS is Windows 7/XP/Server 2008. If I could index just one ...
    Álvaro Vargas QuezadaÁlvaro Vargas Quezada
    Feb 5, 2013 at 12:58 pm
    Feb 6, 2013 at 3:31 pm
  • Hi, I have hacked together a small web front end to the Glimpse text indexing engine (see http://webglimpse.net/ for information). I am very happy with how Glimpse indexes and searches data. If I ...
    Mathias DahlMathias Dahl
    Feb 4, 2013 at 6:02 pm
    Feb 6, 2013 at 7:34 am
  • Hello all I am a lucene novice and trying to setup lucene in a .net app using lucene.net for searching through documents So far it has been fantastic, however given that the users expectations are ...
    Feb 25, 2013 at 3:24 am
    Feb 27, 2013 at 2:23 am
  • Hi, I am a Lucene user and I have a few questions about Lucene filters. I appreciate it if someone can shed light on this. 1. Is Lucene filters such as TermsFilter thread-safe in general? The ...
    Wei WangWei Wang
    Feb 25, 2013 at 1:24 am
    Feb 26, 2013 at 12:47 am
  • Hey Guys, I'm trying to figure out what would be a better approach to indexing when it comes to a large number of records (say 1 billion). As far as queries: 1. Only support exact matches (a field is ...
    Drew KutcharianDrew Kutcharian
    Feb 5, 2013 at 10:10 pm
    Feb 6, 2013 at 2:37 pm
  • Hello, Is there anything in the Lucene core/contrib that could help detect if a keyword is CJK or not? I was thinking that an okay heuristic might be to inspect if the keyword's characters unicode ...
    Gili NachumGili Nachum
    Feb 21, 2013 at 10:51 pm
    Mar 11, 2013 at 8:59 am
  • Hi all! I'm using SpanQuery.getSpans() method for retrieving the precise information about all the matches through all the documents in the index. My question, is it possible to to get the spans in ...
    Igor ShalyminovIgor Shalyminov
    Feb 23, 2013 at 6:47 pm
    Feb 25, 2013 at 10:50 am
  • We're using Lucene 3.3.0 and have an index with close to 10 million documents. When the application is started and the first search is performed, we open a read only IndexReader by calling ...
    Wendy MeadowsWendy Meadows
    Feb 13, 2013 at 6:28 pm
    Feb 14, 2013 at 2:21 pm
  • This is what I want, yes. But I begin to understand that this is not possible without storing additional meta-data as neither the index nor the documents preserve the type info (correct me if I'm ...
    Rolf VeenRolf Veen
    Feb 1, 2013 at 8:18 am
    Feb 6, 2013 at 10:30 am
  • I am using the TieredMergePolicy and using the compound index: TieredMergePolicy mergePolicy = new TieredMergePolicy(); indexWriterConfig.setMergePolicy(mergePolicy.setNoCFSRatio(1.0d)); Prior to ...
    Feb 1, 2013 at 5:51 pm
    Feb 1, 2013 at 11:44 pm
  • Hi List, I've encountered this problem using Solr (4.1.0), but as far as I could trace it, I think it belongs to this list. I have the following field type in my schema: <fieldType ...
    Paul AlexandrowPaul Alexandrow
    Feb 14, 2013 at 6:23 pm
    Mar 9, 2013 at 6:58 pm
  • This works in 3.6, but in 4.1 fails whats wrong with the code public void testTokenization() throws IOException { StringBuffer sb = new StringBuffer(); for(char i=0;i<100;i++) { Character c = new ...
    Paul TaylorPaul Taylor
    Feb 26, 2013 at 12:26 pm
    Feb 26, 2013 at 1:44 pm
  • Strange test failure after converting code from Lucene 3.6 to Lucene 4.1 public void testIndexPuid() throws Exception { addReleaseOne(); RAMDirectory ramDir = new RAMDirectory() ...
    Paul TaylorPaul Taylor
    Feb 19, 2013 at 8:57 pm
    Feb 19, 2013 at 10:56 pm
  • Any other Lucene/Solr enthusiasts attending ApacheCon in Portland next week? Maybe we can pick a time and place to grab a few beers and exchange some ideas and user knowledge. -- Nico Krijnen ...
    Nico KrijnenNico Krijnen
    Feb 19, 2013 at 11:21 am
    Feb 19, 2013 at 6:47 pm
  • Could any one throw light on this. Im using Lucene 3.0.3. I am having multiple shards and using ParallelMultiSearcher to search across shards. Exception: java.util.concurrent.ExecutionException ...
    Ganesh MGanesh M
    Feb 14, 2013 at 12:44 pm
    Feb 15, 2013 at 6:07 am
  • I'm starting with Lucene 4 and have built my own analyzer with stemming and synonyms. This works perfectly. I built a Lucene index with several documents (with an ID) containing a text (with ...
    Nicolas RoduitNicolas Roduit
    Feb 6, 2013 at 9:17 am
    Feb 8, 2013 at 7:34 am
  • The following query does not seems to work after we upgrade from 2.4 - 4.0 *+type:sometype +title:sometitle** Any ideas as to what are some of the places to look for? Is the above Query correct in ...
    Feb 26, 2013 at 11:52 pm
    Feb 27, 2013 at 4:54 am
  • This code worked in 3.6 but now throws nullpointer exception in 41, Im not expecting there to be a token created, but surely it shouldn't throw NullPointerException Tokenizer tokenizer = new ...
    Paul TaylorPaul Taylor
    Feb 26, 2013 at 12:30 pm
    Feb 26, 2013 at 2:22 pm
  • What replaces the computeNorm method in DefaultSimilarity in 4.1 Ive always subclassed DefaultSimilarity to resolve an issue whereby when document has multiple values in a field (because has one-many ...
    Paul TaylorPaul Taylor
    Feb 19, 2013 at 11:43 am
    Feb 26, 2013 at 10:48 am
  • In my 3.6 code I was adding numeric field to my index as follows: public void addNumericField(IndexField field, Integer value) { addField(field, NumericUtils.intToPrefixCoded(value)); } but I've ...
    Paul TaylorPaul Taylor
    Feb 25, 2013 at 10:51 am
    Feb 25, 2013 at 10:57 am
  • I know that general questions about aggregate functions have been asked here before a number of times, but I would like to figure out how to solve at least one specific subset of this issue. Namely, ...
    Vitaly FunsteinVitaly Funstein
    Feb 20, 2013 at 11:32 pm
    Feb 25, 2013 at 10:25 am
  • I am currently writing an indexer class to index texts from stdin. I also need the text to be tokenized and stored to access the termvector of the document. I tweaked the lucene indexer from the demo ...
    A. L. BenhenniA. L. Benhenni
    Feb 19, 2013 at 10:04 am
    Feb 19, 2013 at 11:51 pm
  • Hi, I have two questions 1.How to Get the enumeration of Terms Ending with a given word I saw we can get enumerations of word starting at a given word by Indexreader.terms(term())) method 2.Actually ...
    Feb 14, 2013 at 10:43 am
    Feb 18, 2013 at 10:24 pm
  • Hello, A bit off topic, but… could someone recommend a text summarizer? Something along the lines of Open Text Summarizer or such: http://libots.sourceforge.net What's the state of the art in text ...
    Petite AbeillePetite Abeille
    Feb 17, 2013 at 10:11 am
    Feb 17, 2013 at 1:25 pm
  • Hi, What is the equivalent analyzer of LimitTokenCountAnalyzer in 4.1? Thanks!
    Feb 14, 2013 at 12:46 pm
    Feb 14, 2013 at 2:07 pm
  • Finally I can group the results of my query, but thd sorter of the results is not my idea. what I want is the rank of *the group's total hits*. Can anyone help me ? thx ----- ...
    Feb 3, 2013 at 8:18 am
    Feb 9, 2013 at 7:25 am
  • hello, I have the following question: is it possible to combine the result of a custom score, with the normal relevance ranking score ? We'd like to have a scoring where documents that have a "title" ...
    Feb 8, 2013 at 10:41 am
    Feb 8, 2013 at 12:32 pm
  • Hello, I have an application where a great many documents may not have any terms after StandardAnalyzer has had its way with the body. In that case, depending on some other metadata, I may not wish ...
    Jon StewartJon Stewart
    Feb 6, 2013 at 7:32 pm
    Feb 6, 2013 at 7:56 pm
  • I'd like to compare the relevance scores that are returned when using the Similarity classes that are available in Lucene 4.x, and it seems like using the Benchmark component would be a good way to ...
    Michael O'LearyMichael O'Leary
    Feb 4, 2013 at 11:01 pm
    Feb 6, 2013 at 4:21 pm
  • Hi! I wonder where one can get information about current Lucene (v 4.1) core search classes - AtomicReader, CompositeReader, ReaderContexts - and how to use them properly for building custom search ...
    Igor ShalyminovIgor Shalyminov
    Feb 5, 2013 at 11:45 am
    Feb 6, 2013 at 8:40 am
  • Hello anyone, We recently ran into something people might not be fully aware of. Specifically, because codec jars require META-INF/services files in order to be discovered, and each codec has the ...
    Karl WrightKarl Wright
    Feb 4, 2013 at 4:39 pm
    Feb 4, 2013 at 4:45 pm
  • It is by design, and 2.4 works the same way. Are you closing or committing your IndexWriter after each added document? Because if you add 100 docs you should not see 100 versions of these files, only ...
    Michael McCandlessMichael McCandless
    Feb 1, 2013 at 11:51 am
    Feb 1, 2013 at 5:43 pm
  • rt, I'm totally puzzled, Can anyone explain it with an example ? thx. ----- -------------------------- Email: <span class="m_body_email_addr" title="a9ecd3c1009b3a05b4391c2a8698fbd8" ...
    Feb 1, 2013 at 10:13 am
    Feb 1, 2013 at 11:38 am
  • Hi Mike, Thanks for your reply.. MY Scenario is I am creating Lucene Index with Two Fields 1.Filename 2.File Contents For Example I initially added fields FileName:-say LuceneInAction.pdf which is ...
    Feb 1, 2013 at 6:12 am
    Feb 1, 2013 at 9:43 am
  • FuzzyQuery constructor now takes parameter maxEdits instead of parameter minSimilarity. But I'm unclear how to map from the old value to the new value or whether they are unrelated and can no longer ...
    Paul TaylorPaul Taylor
    Feb 26, 2013 at 3:02 pm
    Feb 26, 2013 at 3:02 pm
  • Hi all, I am implementing query auto completion by using TSTAutocomplete in lucene 3.6.0. In my understanding, prefixCompletion returns a set of suggestion for completion. API document says it ...
    김성찬(Seongchan Kim)김성찬(Seongchan Kim)
    Feb 25, 2013 at 2:55 pm
    Feb 25, 2013 at 2:55 pm
Group Navigation
period‹ prev | Feb 2013 | next ›
Group Overview
groupjava-user @

75 users for February 2013

Paul Taylor: 20 posts Ian Lea: 19 posts Michael McCandless: 17 posts Ramprakash Ramamoorthy: 13 posts Jack Krupansky: 10 posts Uwe Schindler: 10 posts Saisantoshi: 6 posts Chris Bamford: 5 posts Mohammad Tariq: 5 posts Simon Willnauer: 5 posts Adrien Grand: 4 posts Cheng: 4 posts Eric Charles: 4 posts Igor Shalyminov: 4 posts Lars-Erik Aabech: 4 posts Pierre Antoine DuBoDeNa: 4 posts Samuel García Martínez: 4 posts Kumar: 3 posts Mathias Dahl: 3 posts Nicolas Roduit: 3 posts
show more