Funny you should ask - I was trying to look up that "...Primetime..." thread
yesterday and after not finding it I realized user@hbase messages were missing.
Chrome now. I see "error on line 12582 at column 11: PCDATA invalid Char value
27", which matches what I see in our logs (interestingly, Firefox eats the error
just fine). The bad news is that we missed some user@hbase messages. The good
news is that this should go away very soon (as the problematic message gets
pushed down and out of top N items we fetch from there) and that we have a
mechanism to back-fill missing data. Sorry about this glitch. If we/you see
this happening, we'll see if we can make the XML parser we use more forgiving or
find one that doesn't choke as easily.
Sematext :: http://sematext.com/
:: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
----- Original Message ----
From: Stack <firstname.lastname@example.org>
To: HBase Dev List <email@example.com>
Sent: Wed, April 13, 2011 1:50:21 PM
Subject: Otis, how do we know the age of the search-hadoop.com index?
I was looking for an email thread posted yesterday, "Append value to a
cell", and this morning its not in the index. Perhaps the indexer
hasn't run in between?
Sorry for the question. Its your fault for providing us a service
we've since come to depend on.