Grokbase Groups HBase user July 2008

Search Discussions

66 discussions - 326 posts

  • Hi, after our issues ("Replay of HLog required", in a precious thread) with HBase, it seems that HBase has corrupted regions. We have, on the three region servers, errors stating that HBase cannot ...
    Renaud DelbruRenaud Delbru
    Jul 23, 2008 at 3:03 pm
    Jul 25, 2008 at 10:05 pm
  • Hi guys. Is there a way of retrieving multiple "rows" with one server call ? Something like MySQL's "where id in (a,b,c...) Or more like this. List<SortedMap<Text,byte[] rows = HTable.getRows(Text[] ...
    Marcus HerouMarcus Herou
    Jul 28, 2008 at 9:39 am
    Sep 2, 2008 at 8:40 pm
  • The first 0.2.0 release candidate is available for download: Please take this release candidate for a spin. Check the documentation, that unit ...
    Jul 22, 2008 at 10:25 pm
    Jul 25, 2008 at 3:12 pm
  • I am testing HBase 0.1.2 and am getting the following performance using RowCounter class (I had to modify the main() method of the original class because it contains some hardcoded parameters :-) ...
    Yair even-zoharYair even-zohar
    Jul 9, 2008 at 3:54 pm
    Jul 14, 2008 at 9:02 pm
  • Hello, I created a map/reduce process by extending the TableMap and TableReduce API but for some reason when I run multiple mappers, in the logs its showing that the same rows are being processed by ...
    Dru JensenDru Jensen
    Jul 30, 2008 at 8:34 pm
    Aug 4, 2008 at 5:41 pm
  • hi all, i have been reading the docs on HQL and thought it would be a great to use programatically. but after checking out the src, it seems it is only used for the shell. is that the intended usage ...
    lucio Piccolilucio Piccoli
    Jul 21, 2008 at 3:03 pm
    Jul 22, 2008 at 8:52 pm
  • Hi, I have installed Hadoop and HBase on two different linux machines one being Ubuntu and the other Xentos . I was also able to start slave(Xentos) from the master(Ubuntu). And the output of "jps" ...
    Srikanth BondalapatiSrikanth Bondalapati
    Jul 14, 2008 at 6:13 pm
    Jul 22, 2008 at 2:33 pm
  • hi all, it's a bit strange, but i cant find some class or method to get the 'size' of a created table - maybe the total size of all the HStores ? or is there any command in HQL can do this? Thanks. ...
    Jul 19, 2008 at 9:59 pm
    Jul 21, 2008 at 6:02 pm
  • Hi, I am using HBase 0.2.0-dev, Hudson Build #208. I am experiencing a problem with HBase that looks like the issue During an upload intensive task, ...
    Renaud DelbruRenaud Delbru
    Jul 18, 2008 at 9:12 am
    Jul 21, 2008 at 1:35 pm
  • Hi Guys I use hbase (amongst other things) to crawl some repos of infomation and util now I've been using the Nutch segment generation paradigm. I would very much like to skip the segment generation ...
    David AlvesDavid Alves
    Jul 31, 2008 at 1:07 pm
    Aug 8, 2008 at 12:02 am
  • I am having problems with my HRegionServers aborting following rolling their logs and thus not being able to report to the HMaster for a while. Eventually all 5 HRegionservers have exited and my MR ...
    William Clay MoodyWilliam Clay Moody
    Jul 23, 2008 at 10:18 pm
    Jul 24, 2008 at 9:23 pm
  • Hi All, I cannot start HBase master when setting root directory of HBase is a folder in HDFS. Hadoop version: 0.17.1 HBase version: 0.2.0 My hbase-site.xml configuration file <configuration <property ...
    Jul 24, 2008 at 9:55 am
    Jul 24, 2008 at 3:05 pm
  • As of Friday August 1, when the acquisition of Powerset by Microsoft closes, Michael Stack and I will not be able to make further contributions to HBase until a process is developed within Microsoft ...
    Jim KellermanJim Kellerman
    Jul 31, 2008 at 1:15 am
    Jul 31, 2008 at 10:29 pm
  • /// This is very close to the example in the javadoc already(Bytes,BatchUpdate) instead of (text/mapwritable), and i find it to be the easiest way to get people started/motivated with HBase. package ...
    Alex NewmanAlex Newman
    Jul 3, 2008 at 3:40 pm
    Jul 11, 2008 at 5:10 am
  • hbase-0.2.0 was supposed to target hadoop-0.17. When a new version of hadoop is release, should we publish a release of hbase that is compatible with it? --- Jim Kellerman, Senior Engineer; Powerset
    Jim KellermanJim Kellerman
    Jul 29, 2008 at 3:03 am
    Jul 29, 2008 at 7:22 am
  • I've had a little bit of weird behavior. I am opening a scanner in the configure method of a Map task to load a simple little in-memory map (I'd love to this with in-memory column stores, but that's ...
    Daniel LeffelDaniel Leffel
    Jul 22, 2008 at 6:01 pm
    Jul 23, 2008 at 11:01 pm
  • Greetings, I am right now using hbase in our project in "stand-alone" mode. It worked well until today I found the following message in the log: 2008-07-21 01:16:38,564 FATAL ...
    Yabo-Arber XuYabo-Arber Xu
    Jul 21, 2008 at 9:17 am
    Jul 21, 2008 at 8:02 pm
  • Hi all, I found that I can not stop thinking in RDBM way while designing tables for the application I am working on, so that I need your help. Can you please take a look at the tables below and ...
    Pavel LysovPavel Lysov
    Jul 14, 2008 at 4:17 pm
    Jul 15, 2008 at 8:24 am
  • I run a job on a old table webdata that I have had before I updated to latest trunk and everything runs fine then I made a new table just like this one with the name webdata_test and this is what I ...
    Billy PearsonBilly Pearson
    Jul 4, 2008 at 8:36 am
    Jul 15, 2008 at 3:59 am
  • Hi all, I'm having a little problem with our tests that use hbase. First, I run a test which generate all of the hbase tables, and exits. Then for each test, I copy over the hbase directory, and the ...
    Clint MorganClint Morgan
    Jul 8, 2008 at 12:24 am
    Jul 8, 2008 at 7:15 pm
  • Hi, I feel lack of mapreduce approach understanding and would like to ask some questions (mainly on its reduce part). Below is reduce job that gets values count for given row key and inserts ...
    Jul 30, 2008 at 2:10 pm
    Aug 1, 2008 at 1:49 pm
  • Hi all, I have a large dataset saved in a hadoop cluster, and now I want to copy these data from this hadoop cluster into another hadoop cluster, who can tell me how? Thank you very much ! Best ...
    Ma qiangMa qiang
    Jul 25, 2008 at 4:06 am
    Jul 25, 2008 at 4:44 am
  • I want to use hbase to maintain a very large dataset which needs to be updated pretty much continuously. I'm creating a record for each entity and including a creation timestamp column as well as ...
    Jul 18, 2008 at 8:42 pm
    Jul 21, 2008 at 8:04 pm
  • Heads up for anyone else running into this issue. I found that you cannot use internal classes for your Map or Reduce class if you're extending TableMap/TableReduce. If you try, you get a ...
    Daniel LeffelDaniel Leffel
    Jul 24, 2008 at 9:38 pm
    Jul 25, 2008 at 12:12 am
  • Hi! We are writing a MR job and want to store some intermediate result directly into a HDFS file instead of HBase. Is there an easy way of doing this or do you have to run a script from inside the ...
    Erik HolstadErik Holstad
    Jul 24, 2008 at 10:08 am
    Jul 24, 2008 at 5:24 pm
  • we've been trying for a couple of days (without success) to import our data into hbase. initially we ran into quite a few OOME errors, but we've seem to overcome that by adjusting our jvm memory heap ...
    Jul 23, 2008 at 7:25 pm
    Jul 23, 2008 at 8:51 pm
  • looking at our region logs, we've noticed that the compaction thread constantly runs into exceptions. the entire log is filled with something like this: ---------------------------------- 2008-07-22 ...
    Jul 22, 2008 at 7:42 pm
    Jul 23, 2008 at 7:47 pm
  • hi all, i'm writting a program to access my hbase table in a MR job. my first version is to get different values from get(row,column name), and now im changing to get one row each time into a map, ...
    Jul 15, 2008 at 12:54 pm
    Jul 16, 2008 at 9:47 am
  • Hi guys. A simple question: Is only the row key sorted in HBase ? What if you would like to obtain a scanner based on another column ? I thought the "auto" sorted feature was one of the reasons you ...
    Marcus HerouMarcus Herou
    Jul 14, 2008 at 7:37 pm
    Jul 14, 2008 at 8:13 pm
  • Hi, I've peeked at HBase code, TableSplit#getLocations(). I noticed that the method returns a random node for now. I was trying to think what should be returned if one wishes to have computation ...
    Naama KrausNaama Kraus
    Jul 14, 2008 at 5:00 am
    Jul 14, 2008 at 5:48 pm
  • Hi everyone, We would like to use Hbase and Hadoop. But when we tried to use real data with our test setup, we saw a lot of crashes and could not succeed to insert the amount of data we are trying to ...
    Marcus SchlüterMarcus Schlüter
    Jul 10, 2008 at 9:36 am
    Jul 11, 2008 at 5:09 am
  • Hi guys! I'm running my MR job that based on org.apache.hadoop.hbase.mapred.BuildTableIndex //map method: @Override public void map(HStoreKey key, MapWritable value, OutputCollector<Text, MapWritable ...
    Ruslan SalyakhovRuslan Salyakhov
    Jul 30, 2008 at 1:55 pm
    Aug 4, 2008 at 9:09 pm
  • Hi. What is the best practice in hbase when it comes to creating "mapping" tables between objects? Let's say you want to create two tables named "User" and "Role" where the user can be in many roles. ...
    Marcus HerouMarcus Herou
    Jul 22, 2008 at 1:34 pm
    Jul 29, 2008 at 2:17 pm
  • I have similar setup to the 0.1.2 but started from a clean startup. The hadoop 1.7.0 seems to be working fine. When I start the hbase master I get: java.lang.reflect.InvocationTargetException but the ...
    Yair Even-ZoharYair Even-Zohar
    Jul 28, 2008 at 4:50 pm
    Jul 28, 2008 at 5:01 pm
  • I'm running a hbase data import on 0.1.3. After 42million rows, the import fails with an RPC timeout exception. I've tried twice- once on a 2 node cluster and once on a 10 node cluster (ec2 with the ...
    Mark SnowMark Snow
    Jul 25, 2008 at 6:03 pm
    Jul 25, 2008 at 11:52 pm
  • It would be handy to be able to easily dump data from postgresql straight to hbase. Then keep the data in hbase up to date. I've made a simple python tool called hbreplic (I'm very willing to come up ...
    Tim SellTim Sell
    Jul 25, 2008 at 7:13 pm
    Jul 25, 2008 at 9:55 pm
  • I tried to do a big dump of data into hbase today. I'm not sure of the exact number of rows I sent it, but it was at least 6 million or so before my dumping app crashed. My app printed the following ...
    Tim SellTim Sell
    Jul 25, 2008 at 6:02 pm
    Jul 25, 2008 at 6:54 pm
  • I can't get any remote regions to start up (local ones work fine). I get this exception: Exception in thread "regionserver/" java.lang.NullPointerException at ...
    Daniel LeffelDaniel Leffel
    Jul 23, 2008 at 11:14 pm
    Jul 23, 2008 at 11:58 pm
  • Hi guys! It seems I'll need to be able to paginate over table contents. I'll need to get items starting from most recent and going past. Is there a good way to achieve that? Since the keys are sorted ...
    Pavel LysovPavel Lysov
    Jul 23, 2008 at 7:31 pm
    Jul 23, 2008 at 8:03 pm
  • 0.2.0 is now feature complete. TRUNK is frozen but for documentation and any critical-bug fixes. A few of us are running tests out on our little cluster to make sure the current TRUNK all basically ...
    Jul 21, 2008 at 8:56 pm
    Jul 22, 2008 at 10:07 pm
  • I would like to get all the versions of a row for a map-reduce task. Given the details below, I'm afraid I was just looking too hard and there's a simpler solution. Here is what I found out: 1) ...
    Yair Even-ZoharYair Even-Zohar
    Jul 21, 2008 at 3:49 pm
    Jul 21, 2008 at 7:51 pm
  • Hi, I apologize if this has been asked and answered, but the hadoop project website seems to not be responding right now, so I can't search the mail archive. A quick search of the emails I've ...
    Rick HangartnerRick Hangartner
    Jul 17, 2008 at 5:56 pm
    Jul 18, 2008 at 2:10 am
  • I'm reading but get confused about BLOCK and RECORD compression. In my understanding, the these two ...
    Rong-en FanRong-en Fan
    Jul 10, 2008 at 2:52 pm
    Jul 11, 2008 at 5:23 am
  • hi, Is it possible to control where a certain table data is located physically. For example if I know that one of my tables in the system will be heavily used then I would like it to be stored and ...
    Krzysztof SzlapinskiKrzysztof Szlapinski
    Jul 3, 2008 at 8:31 pm
    Jul 3, 2008 at 9:38 pm
  • Hi all, Quick question. I created a new column family (one created using IN_MEMORY). I expected iterating with a scanner to be much faster, but alas, it seems to operate at speeds comparable to ...
    Daniel LeffelDaniel Leffel
    Jul 3, 2008 at 2:19 am
    Jul 3, 2008 at 2:27 am
  • I get this when I run RowCounter in the hbase jar java.lang.IllegalAccessError: tried to access method org.apache.hadoop.ipc.Client.incCount()V from class org.apache.hadoop.ipc.HBaseClient at ...
    Billy PearsonBilly Pearson
    Jul 26, 2008 at 1:27 am
    Aug 5, 2008 at 10:32 pm
  • I looked at the code in the 0.2.0 and the args[0] is used twice c.set("hbase.master", args[0]); And // First arg is the output directory. c.setOutputPath(new Path(args[0])); Was anybody able to use ...
    Yair Even-ZoharYair Even-Zohar
    Jul 30, 2008 at 9:27 pm
    Aug 1, 2008 at 1:08 am
  • Is it possible to put the output from the reduce phase of job 1 to be the input to job number 2, or is the best way to write it to a HBase table or to the HDFS and the fetch it in the second job? Erik
    Erik HolstadErik Holstad
    Jul 31, 2008 at 2:43 pm
    Aug 1, 2008 at 12:47 am
  • I'm running into several problems with Hbase 0.2.0. 1) This mapreduce experiment, a modification of rowcounter, (using the exact same data) was running in parallel for hbase 0.1.2 2) I have tested ...
    Yair Even-ZoharYair Even-Zohar
    Jul 31, 2008 at 8:45 pm
    Jul 31, 2008 at 10:07 pm
  • I define HBaseConfiguration conf = new HBaseConfiguration(); And then the following line (my HBaseAdmin admin = new HBaseAdmin(conf); generated the below error: Exception in thread ...
    Yair Even-ZoharYair Even-Zohar
    Jul 29, 2008 at 5:56 pm
    Jul 29, 2008 at 6:58 pm
Group Navigation
period‹ prev | Jul 2008 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

57 users for July 2008

Jean-Daniel Cryans: 56 posts Stack: 53 posts Andrew Purtell: 21 posts Renaud Delbru: 19 posts Yair Even-Zohar: 16 posts Rick Hangartner: 12 posts Daniel Leffel: 11 posts Billy Pearson: 10 posts Jim Kellerman: 10 posts Erik Holstad: 7 posts Marcus Herou: 6 posts Thopham.asnet: 6 posts ZhaoWei: 6 posts Daniel Yu: 5 posts lucio Piccoli: 5 posts Naama Kraus: 5 posts Pavel Lysov: 5 posts Tim Sell: 5 posts Srikanth Bondalapati: 4 posts Dru Jensen: 4 posts
show more