Hey Stack,
Funny you should ask - I was trying to look up that "...Primetime..." thread
yesterday and after not finding it I realized user@hbase messages were missing.
Check
http://mail-archives.apache.org/mod_mbox/hbase-user/?format=atom using
Chrome now. I see "error on line 12582 at column 11: PCDATA invalid Char value
27", which matches what I see in our logs (interestingly, Firefox eats the error
just fine). The bad news is that we missed some user@hbase messages. The good
news is that this should go away very soon (as the problematic message gets
pushed down and out of top N items we fetch from there) and that we have a
mechanism to back-fill missing data. Sorry about this glitch. If we/you see
this happening, we'll see if we can make the XML parser we use more forgiving or
find one that doesn't choke as easily.
Otis
----
Sematext ::
http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search ::
http://search-lucene.com/----- Original Message ----
From: Stack <stack@duboce.net>
To: HBase Dev List <dev@hbase.apache.org>
Sent: Wed, April 13, 2011 1:50:21 PM
Subject: Otis, how do we know the age of the search-hadoop.com index?
I was looking for an email thread posted yesterday, "Append value to a
cell", and this morning its not in the index. Perhaps the indexer
hasn't run in between?
Sorry for the question. Its your fault for providing us a service
we've since come to depend on.
St.Ack