Search Discussions

40 discussions - 152 posts

  • Team, A NumericDocValueField("TIME_SORT",unixTimeStamp value) is indexed along with other fields. During search time, I've used NumericDocValuesSorter to get Sorter.DocMap so that I can access the ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Mar 6, 2014 at 10:21 am
    Mar 10, 2014 at 9:54 am
  • Hi everyone When I query a lucene index, I get back a list of document ids. This index search is fast. Now for all documents matching the result I need a unique String field called "id" which is ...
    Rohit BangaRohit Banga
    Mar 21, 2014 at 8:36 pm
    Mar 28, 2014 at 4:39 am
  • Hi to all, I'm trying to use the Replicator (http://lucene.apache.org/core/4_7_0/replicator/index.html?org/apache/lucene/replicator/package-summary.html) to backup my indexes while indexing. We have ...
    Roberto FranchiniRoberto Franchini
    Mar 13, 2014 at 4:22 pm
    Mar 24, 2014 at 4:34 pm
  • Dear Team, we are using lucene in our product , it well searching for high speed and performance but Japaneese, chinese and korean language not searching properly we had use QueryParser QueryParser ...
    Mar 21, 2014 at 1:07 pm
    Mar 26, 2014 at 7:50 am
  • I am trying to analyze some japanese web pages for presence of slang/adult phrases in them using lucene-analyzers-kuromoji-4.6.0.jar. While the tokenizer breaks up the word into proper words, I am ...
    Rahul RatnakarRahul Ratnakar
    Mar 10, 2014 at 5:58 pm
    Mar 11, 2014 at 1:25 am
  • Greetings We have a problem whereby Lucene 4.7 occasionally does not apply a filter query during searching. The problem is intermittent. One in thirty or so searches will return what appears to be an ...
    Mar 28, 2014 at 11:00 am
    Mar 28, 2014 at 3:48 pm
  • Hi all, I am using lucene to index a large corpus of text, with every word being a separate document (this is something I cannot change), and I am hitting a limitation of the CompositeReader only ...
    Artem Gayardo-MatrosovArtem Gayardo-Matrosov
    Mar 21, 2014 at 4:29 pm
    Mar 21, 2014 at 9:09 pm
  • Hi We're planning to upgrade lucene-analyzers-commons 4.3.0 to 4.6.1 . While running our unit test with 4.6.1 it fails at org.apache.lucene.analysis.Tokenizer on line 88 (setReader method). There it ...
    Joe WongJoe Wong
    Mar 20, 2014 at 7:58 pm
    Mar 20, 2014 at 11:03 pm
  • Hi All, Is there any way I could construct a query that would not automatically exclude negation terms (such as "no", "not", etc)? For example, I need to find strings like "not happy", "no idea", ...
    Natalia ConnollyNatalia Connolly
    Mar 17, 2014 at 7:03 pm
    Mar 18, 2014 at 3:05 pm
  • Hi :) My index (Lucene 3.5) contains a field called title. Its value is indexed (analyzed and stored) with the WhitespaceAnalyzer and can contains html entities such as ’ or ° My problem is ...
    Mar 3, 2014 at 5:10 pm
    Mar 4, 2014 at 11:43 am
  • Dear all, I am trying to compute the cosine similarity between several documents. I have an indexed directory A made using 10000 files and another indexed directory B made using 20000 files. All the ...
    Stefy D.Stefy D.
    Mar 20, 2014 at 9:05 pm
    Mar 21, 2014 at 3:04 pm
  • Hi folks, We are currently using Lucene 4.5 and we are hitting some bottlenecks and appreciate some input from the community. This particular index (the disk size for which is about 10GB) is ...
    Arvind KalyanArvind Kalyan
    Mar 5, 2014 at 9:18 am
    Mar 5, 2014 at 9:53 pm
  • Hello, I have a doubt about index size, I am testing a program using Lucene to index some dataset. At the final the result of index size is varying a little, since i haven't finished the tests at ...
    Jose Carlos CanovaJose Carlos Canova
    Mar 25, 2014 at 5:37 am
    Mar 25, 2014 at 12:42 pm
  • Hi there I am searching for a way to store very large documents in a Lucene 4.7 index and keep them ready to use the PostingsHighlighter for search result highlighting. I do not want to read the ...
    Mirko SerticMirko Sertic
    Mar 24, 2014 at 2:03 pm
    Mar 25, 2014 at 12:47 am
  • Hi, I have few questions related to updatable DocValues API... It would be great if I can get help. 1. Is it possible to provide updateNumericDocValue(Term term, Map<String,Long ), incase I wish to ...
    Ravikumar GovindarajanRavikumar Govindarajan
    Mar 14, 2014 at 10:15 am
    Mar 15, 2014 at 7:18 am
  • Hello, I just posted an article on Comparing Document Classification Functions of Lucene and Mahout. http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.html ...
    Koji SekiguchiKoji Sekiguchi
    Mar 7, 2014 at 2:31 am
    Mar 9, 2014 at 7:57 am
  • Hi, Is there any data type in lucene can support functions like SortedDocValues for any numeric(int, long, float, double) type. SortedDocValues only supports bytes, I want some data type can get ...
    Yonghui ZhaoYonghui Zhao
    Mar 5, 2014 at 12:01 pm
    Mar 5, 2014 at 12:58 pm
  • Hello, does Lucene provide a zero or one character wildcard (like ? in Perl RegEx)? Example of what I mean: "house%" finds "house" and "houses" As far as I know in Lucene the ? wildcard is for ...
    Sven TeichmannSven Teichmann
    Mar 25, 2014 at 1:57 pm
    Mar 25, 2014 at 4:37 pm
  • I have a usage pattern where I need to package up and store away all files from an index referenced by multiple commit points. To that end, I basically call IndexWriter.commit(), followed by ...
    Vitaly FunsteinVitaly Funstein
    Mar 21, 2014 at 12:35 am
    Mar 21, 2014 at 6:01 pm
  • Hi I started migrating my lucene search application from 2.9 version to 4.7.0 . Please suggest me the best way and best practices for this. There are many files to rewrite. Thank you, Narasimha.
    NarasimhaRao DPNVNarasimhaRao DPNV
    Mar 20, 2014 at 6:39 pm
    Mar 20, 2014 at 11:10 pm
  • Hi I'm trying to submit a lucene query string to my index to return a data based on a numeric range. I'm using the syntax provided in the Query Parser Syntax document but the results I get indicate ...
    Matthew PetersenMatthew Petersen
    Mar 20, 2014 at 8:44 pm
    Mar 20, 2014 at 9:46 pm
  • Hi, I posted this question on the Solr mailing list but it has more to do with Lucene. I have a performance and scoring problem for phrase queries 1. Performance - phrase queries involving frequent ...
    Manuel Le NormandManuel Le Normand
    Mar 12, 2014 at 9:28 am
    Mar 12, 2014 at 2:13 pm
  • I'm working on a system that uses Lucene 4.6.0, and I have a couple of use cases for documents that modify themselves as they're being indexed. For example, we have text classifiers that we would ...
    Stephen GreenStephen Green
    Mar 11, 2014 at 6:34 pm
    Mar 11, 2014 at 8:40 pm
  • Hi, Phrases with stop words in them are not getting searched whereas a phrase without it gets searched using ComplexPhraseQueryParser/SpanQueryParser. SpanQueryParser reference ...
    Modassar AtherModassar Ather
    Mar 5, 2014 at 7:40 am
    Mar 7, 2014 at 6:04 am
  • Hello, I would like to use Apache *Lucene 4*.x and count words in the string, for example: "I loved cats, but now I really love dogs" - count "love" word in the String - result should be 2. I would ...
    Hollow QuincyHollow Quincy
    Mar 28, 2014 at 2:35 pm
    Mar 28, 2014 at 6:43 pm
  • Hi, I would like seek assistance regarding the following issue I'm encountering. I'm running Tomcat and have deployed Jena (2.6.2) and Lucene Core (2.9.0), whereby Jena invokes the Lucene classes ...
    Turri, Albert (ELS-NYC)Turri, Albert (ELS-NYC)
    Mar 25, 2014 at 5:29 pm
    Mar 25, 2014 at 5:34 pm
  • I was just trying to implement a StoredFieldsWriter[4.6.1] and found that finishDocument() method has an empty impl. Any reason for not declaring it abstract? We could easily miss over-riding it -- ...
    Ravikumar GovindarajanRavikumar Govindarajan
    Mar 13, 2014 at 12:22 pm
    Mar 13, 2014 at 3:07 pm
  • I have a small set of document numbers as a query result collected with some non-scoring collector. Now, I want to send high-performant successive queries only in this document number scope, as part ...
    Christian ReuschlingChristian Reuschling
    Mar 12, 2014 at 5:41 pm
    Mar 12, 2014 at 8:14 pm
  • Hello, what is the best method to score documents similar to default similarity, but the document frequency should be calculated per query against the matching result document set, not statically ...
    Christian ReuschlingChristian Reuschling
    Mar 6, 2014 at 6:34 pm
    Mar 8, 2014 at 1:28 am
  • Hi; Tf-Idf is explanation says that: *idf(t)* appears for *t* in both the query and the document, hence it is squared in the equation. DefaultSimilarity does not square it. What it the explanation of ...
    Furkan KAMACIFurkan KAMACI
    Mar 6, 2014 at 11:29 pm
    Mar 7, 2014 at 2:06 pm
  • Hi, is there a way I can search in file to match my synomyms, instead of building a SynonymMap ? My synonym list is going to be very large and I don;t want to keep it in memory. Regards Geet
    Geet GangwarGeet Gangwar
    Mar 3, 2014 at 2:02 pm
    Mar 3, 2014 at 5:10 pm
  • Sir i am PG student, my research topic is to optimize the indexing file [reduce index file size, RAM usage, CPU utilization, and create index with payload to improve searching speed]. Currently ...
    Mar 2, 2014 at 2:07 pm
    Mar 2, 2014 at 4:57 pm
  • Dear open source projectors: How to implement and search? I can't do it. ------------------ From Lmc in China
    Mar 1, 2014 at 12:52 pm
    Mar 1, 2014 at 5:30 pm
  • Hi , Please tell me how can I implement HitCollector in lucene 4.7.0, migrating from previous versions . I didn't find HitCollector in 4.7. Thanks & Regards, Narasimha.
    NarasimhaRao DPNVNarasimhaRao DPNV
    Mar 26, 2014 at 3:48 pm
    Mar 26, 2014 at 3:48 pm
  • We are using Lucene 3.6 to perform incremental indexing. We use an algorithm we found on the web to perform the incremental indexing. 1. For each file that we indexed, we create a UID field to ...
    Mar 24, 2014 at 5:02 pm
    Mar 24, 2014 at 5:02 pm
  • Hi all, Does anybody know of a way of getting a breakdown of the disk space a particular field takes up in a lucene index? I'm experimenting with different query-time and index-time field ...
    Alan WoodwardAlan Woodward
    Mar 21, 2014 at 10:34 am
    Mar 21, 2014 at 10:34 am
  • Hi , Want to Know Big Data / Hadoop ? If yes , join us for Webinar by industry experts at below link. *Webinar on Hadoop, Hosted by : Manoj , Research Director* *Join us for a webinar on Mar 19, 2014 ...
    Vivek KumarVivek Kumar
    Mar 19, 2014 at 12:21 pm
    Mar 19, 2014 at 12:21 pm
  • For example, given a new big department merged from three departments. A few employees worked for two or three departments before merging. That means, the attributes of one person might be listed ...
    Mobius ReXMobius ReX
    Mar 17, 2014 at 6:03 pm
    Mar 17, 2014 at 6:03 pm
  • Hi, the Apache Lucene/Solr committers decided with a large majority on the vote to require Java 7 for the next minor release of Apache Lucene and Apache Solr (version 4.8)! Support for Java 6 by ...
    Uwe SchindlerUwe Schindler
    Mar 12, 2014 at 2:07 pm
    Mar 12, 2014 at 2:07 pm
  • Hello, If I want to print out all strings within distance 2 of a given string, how do I do it? I'm totally new to Lucene, so below are some parts that I'm starting to assemble. Any tips are welcome ...
    Robert ServensenRobert Servensen
    Mar 5, 2014 at 12:47 am
    Mar 5, 2014 at 12:47 am
Group Navigation
period‹ prev | Mar 2014 | next ›
Group Overview
groupjava-user @

60 users for March 2014

Michael McCandless: 19 posts Uwe Schindler: 13 posts Jack Krupansky: 6 posts Ramprakash Ramamoorthy: 5 posts Rohit Banga: 5 posts Rahul Ratnakar: 4 posts Roberto Franchini: 4 posts Shai Erera: 4 posts Artem Gayardo-Matrosov: 3 posts Arvind Kalyan: 3 posts G.Long: 3 posts Herb Roitblat: 3 posts Jamie: 3 posts Joe Wong: 3 posts Jose Carlos Canova: 3 posts Manuel Le Normand: 3 posts Natalia Connolly: 3 posts Robert Muir: 3 posts Tri Cao: 3 posts Allison, Timothy B.: 2 posts
show more