Grokbase Groups HBase dev July 2009
[ ]

Lars George commented on HBASE-1684:

Stack, about your comments re: reducer/mapper needed. For the RestoreTable I am using both, the mapper reads from the backup files and then randomizes the rows using a random intermediate key. This is along what Ryan did with his pure randomizer MR class. That way all the RegionServers are hit equally.

For the BackupTable I am using an IdentityTableMapper and encode the data in the reducer to have it written out in the TextOutputFormat. After we discussed that a while ago with you and Jon it should also be possible to use only a Mapper and do the work there and set the Reducers to 0, which then hands out the Mapper records straight to the TextOutputFormat.

Lastly, implementing Tool seems deprecated. The new mapreduce WordCounter sample that comes with Hadoop 0.20 abandons it too. That is also why I changed RowCounter not to use it when I cleaned up the hbase.mapreduce package. The parsing of the generic options is done using the GenericParser directly inside the main(), and the remaining arguments used for the specific MR job. I have done the same in the attached two classes.
Backup (Export/Import) contrib tool for 0.20

Key: HBASE-1684
Project: Hadoop HBase
Issue Type: New Feature
Components: contrib
Affects Versions: 0.20.0
Reporter: Jonathan Gray
Assignee: Jonathan Gray
Fix For: 0.20.1

Attachments:, HBASE-1684-v1.patch,

Add a new Result/KeyValue based Export MapReduce job to contrib for 0.20.
Make it in the hadoop 0.20 and hbase 0.20 MR API, and hbase 0.20 API (Result/Put).
This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

Search Discussions

Discussion Posts


Follow ups

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 6 of 18 | next ›
Discussion Overview
groupdev @
categorieshbase, hadoop
postedJul 22, '09 at 10:04p
activeSep 24, '09 at 4:55p

1 user in discussion

stack (JIRA): 18 posts



site design / logo © 2022 Grokbase