we're facing a problem while reading AVRO files written with FLUME using
the AVRO Java API 1.5.4 into a HADOOP cluster. The Avro Data Store
complains about missing sync marker. Investigating the problem shows us,
that's perfectly right. The sync marker is missing. Thus we have a block
of the double size.
Our software packets:
rpm -qa | grep hadoop
This is pretty much all a basic cloudera
CDH3 Update 2 Packaging installation with a patched PIG version which is
CDH3 Update 3.
Did anyone had a similar issue? Does this ring a bell?