Grokbase Groups HBase user May 2010

Search Discussions

80 discussions - 458 posts

  • Hi Wanted to find the group's experience on HBase performance with increasing number of regions/node. Also wanted to find out if there is an optimal number of regions one should aim for? We are ...
    Jacob IsaacJacob Isaac
    May 27, 2010 at 7:09 pm
    Jun 2, 2010 at 8:18 pm
  • I am wondering if anyone has thought about using HBase on other file systems like "Gluster". I think Gluster may offer much faster performance without exorbitant cost. With Gluster, you would have to ...
    Kevin ApteKevin Apte
    May 9, 2010 at 12:08 pm
    May 16, 2010 at 8:23 am
  • Hi eveyone, I just read about IHbase and seems like something I could give it a try, but I haven't been able to find information (besides descriptions and advantages) regarding to how to install it ...
    Renato Marroquín MogrovejoRenato Marroquín Mogrovejo
    May 12, 2010 at 2:26 pm
    May 17, 2010 at 8:59 am
  • Is the currently published schema the latest version? I'm just noticing differences in the entities that are being generated by JAXB and the data that they are marshalling to. The schema I'm using is ...
    Anthony IkedaAnthony Ikeda
    May 26, 2010 at 7:06 am
    May 29, 2010 at 6:44 pm
  • I'm not sure I understand why you distinguish small HFiles and a single behemoth HFile? Are you trying to understand more about disk space or I/O patterns? It looks like your understanding is ...
    Jonathan GrayJonathan Gray
    May 17, 2010 at 8:06 pm
    May 25, 2010 at 5:43 pm
  • Hello, I'm new to HBase, so excuse me if I make odd questions. I'm evaluating HBase from its documentation, and am attracted by its broad functionality such as transaction support, secondary index, ...
    Takayuki TsunakawaTakayuki Tsunakawa
    May 7, 2010 at 1:49 am
    May 10, 2010 at 12:47 am
  • Hi I would like to know how configure HBase to improve the scanner fetching data from the table or another method of using scanner, as my database is very large and scanner times out. Kind Regards, ...
    Michelan ArendseMichelan Arendse
    May 4, 2010 at 8:45 am
    May 7, 2010 at 8:50 pm
  • Hi, I'm executing HBase 0.20.4 over Hadoop 0.20.2 on Amazon EC2 and it had been running smooth for more than a month. Today it crashed apparently because of some Zookeeper exception. It's a ...
    Lucas Nazário dos SantosLucas Nazário dos Santos
    May 27, 2010 at 3:08 pm
    May 29, 2010 at 6:52 pm
  • Hi All, I'm new to hbase and columnar storage schemas, so any comments you have on the schema or the actual problem at hand are very much welcome. I'm using 0.20.4, initially testing as standalone on ...
    Oliver MeynOliver Meyn
    May 19, 2010 at 8:53 pm
    May 21, 2010 at 1:01 pm
  • Hello, I got the following error when I sent the mail. Technical details of permanent failure: Google tried to deliver your message, but it was rejected by the recipient domain. We recommend ...
    May 8, 2010 at 9:15 am
    May 9, 2010 at 3:24 am
  • Hi, First of all, thanks to all the HBase contributors for getting 0.20.4 out. We're planning on upgrading soon, and we're also looking forward to 0.20.5. Recently we've had a couple of problems ...
    James BaldassariJames Baldassari
    May 8, 2010 at 3:27 am
    Jun 8, 2010 at 8:58 pm
  • We are working on a prototype to migrate our healthcare database, currently in Oracle, to HBase. Our java based web application allows end users to search patients on over 50 different criteria ...
    May 3, 2010 at 6:42 am
    May 27, 2010 at 3:29 pm
  • Hello, I finally got some decent hardware to put together a 1 master, 4 slave Hadoop/HBase cluster. However, I'm still waiting for space in the datacenter to clear out and only have 3 of the nodes ...
    Andrew NguyenAndrew Nguyen
    May 22, 2010 at 3:41 pm
    May 24, 2010 at 1:07 am
  • Hi, I would like to understand the meaning of the storefileIndexSize metric, could someone point me to a definition or explain me what does that mean ? Also, we are performing a large table import ...
    Renaud DelbruRenaud Delbru
    May 17, 2010 at 2:27 pm
    May 19, 2010 at 5:20 pm
  • Hi, A few days ago, I had a discussion with other Japanese developers on hadoop-jp Google group. It was about HLog durability on the recent Hadoop releases (0.20.1, 0.20.2) I never looked at this ...
    Tatsuya KawanoTatsuya Kawano
    May 17, 2010 at 4:02 am
    May 18, 2010 at 1:45 am
  • Hi, I try to put some data to hbase, then i following the sample code in API below: public static void insertData(String rowid, String data) throws Exception { if(table==null) { table = new ...
    May 6, 2010 at 5:35 pm
    May 6, 2010 at 6:45 pm
  • Hello, I'm comparing HBase and Cassandra, which I think are the most promising distributed key-value stores, to determine which one to choose for the future OLTP and data analysis. I found the ...
    May 8, 2010 at 12:14 pm
    May 9, 2010 at 5:19 pm
  • Hi, Sorry for the multiple e-mails, it seems gmail didn't send my whole message last time! Anyway here it goes again... Whilst loading data via a mapreduce job into HBase I have started getting this ...
    Dan HarveyDan Harvey
    May 24, 2010 at 9:40 pm
    Jun 2, 2010 at 11:12 pm
  • Hi there, According to the previous email about HBase: minimal number of boxes. I would like to know how to deploy it for each component, if there are 60 machines for our production cluster. Trying ...
    May 26, 2010 at 3:09 am
    May 26, 2010 at 6:25 am
  • I am experiencing a problem in which Put operations transition from working just fine, to blocking forever. I am doing Put from a reducer. I have tried the following, but none of them prevents the ...
    Geoff HendreyGeoff Hendrey
    May 19, 2010 at 6:56 am
    May 20, 2010 at 1:26 am
  • Hi, Our HBase-backed app does about 95% reads and 5% writes on average, but once per hour we do a bulk update of several million rows (through HTable w/ large write buffer, not MR). Most of the time ...
    James BaldassariJames Baldassari
    May 18, 2010 at 5:47 am
    May 18, 2010 at 6:38 pm
  • Hi. I added the following properties to hbase-site.xml <property <name hbase.regionserver.class</name <value org.apache.hadoop.hbase.ipc.IndexedRegionInterface</value </property <property <name ...
    Michelan ArendseMichelan Arendse
    May 10, 2010 at 1:42 pm
    May 12, 2010 at 9:42 pm
  • hi, all, I have a table imported some data already, but I failed to import more data into it(still checking). for some reason I restarted the cluster, and in the Web interface I have found out that ...
    Steven zhuangSteven zhuang
    May 25, 2010 at 8:55 am
    May 28, 2010 at 6:07 am
  • Hi all, Let a set, S(X) = {a, b, c, d, e, f, .....}. I compute the values of the set in multiple MR job iterations i.e. multiple MR jobs would be run one after another several times. In each ...
    Raghava MutharajuRaghava Mutharaju
    May 17, 2010 at 12:44 am
    May 20, 2010 at 6:27 am
  • Hi, We are implementing delete operation that can be rolled back. I.e. we don't want to delete particular row immediately before a set of rows are processed. I want to get wisdom from the community ...
    Ted YuTed Yu
    May 19, 2010 at 11:53 pm
    May 20, 2010 at 4:14 am
  • Hi, maybe i'll get help here :) I have 2 tables, UserToAdv and AdvToUsers. UserToAdv is simple: { "row_id" = [ {"adv:<id ":<counter }, {"adv:<id ":<counter }, .....about 100 columns ] only one kind ...
    Sebastian BauerSebastian Bauer
    May 11, 2010 at 3:56 pm
    May 12, 2010 at 5:47 pm
  • Hi folks, I recently rewrote the Pig HBase loader to work with binary data, push down filters, and do other things that make it more versatile. If you use, or plan to use, both Pig and HBase, please ...
    Dmitriy RyaboyDmitriy Ryaboy
    May 3, 2010 at 7:23 pm
    May 4, 2010 at 4:31 pm
  • Hi all, I have a table "device_power" which has as its row key "device_id.timestamp", and a single column, "power". A new row is written whenever the power consumption of a device changes, which ...
    Oliver MeynOliver Meyn
    May 28, 2010 at 4:06 pm
    May 31, 2010 at 1:29 pm
  • What is the best option to just return the rowkeys when scanning a table range? Is it enough, not to set any family or column in the Scan()? Thx, Bob
    Bob SchulzeBob Schulze
    May 21, 2010 at 8:55 am
    May 21, 2010 at 3:23 pm
  • Hi, We use hbase 0.20.3 The server running master ran out of disk space. I cleaned up some space: /dev/mapper/VolGroup00-LogVol00 303346048 268141776 19795120 94% / But I still cannot stop hbase - ...
    Ted YuTed Yu
    May 19, 2010 at 4:17 am
    May 19, 2010 at 4:56 pm
  • Consider a search system with an inverted word index- in other words, an index which points to document location- with these columns- word, document ID and possibly timestamp. Given a word, how will ...
    Kevin ApteKevin Apte
    May 17, 2010 at 8:07 am
    May 17, 2010 at 5:53 pm
  • Hi, Currently we are designing an architecture for a Accounting SaaS and e-commerce website. As both of them will store financial data - transaction, redundancy, HA and data integrity is very ...
    Imran M YousufImran M Yousuf
    May 17, 2010 at 3:14 pm
    May 17, 2010 at 5:22 pm
  • For the model I am designing, read speed is the highest priority. That being said, I have a Customers table with information about Claims. Here is the design today: Table: Customers RowId: CustomerId ...
    N KapshooN Kapshoo
    May 12, 2010 at 7:21 pm
    Jun 20, 2010 at 7:26 pm
  • Hi list, With HBase's great write speed, I was thinking it would be a good thing to switch an app that logs to a database to logging to HBase. I couldn't really find anyone else who's using it that ...
    Alex ThurlowAlex Thurlow
    May 24, 2010 at 10:45 pm
    Jun 3, 2010 at 12:41 pm
  • Following the links from the Releases page ( to the Download page ( leads me to a list of links to directories on ...
    Charles WoernerCharles Woerner
    May 29, 2010 at 4:52 pm
    May 29, 2010 at 5:04 pm
  • Hi, A colleague and I are working on testing a few HBase features, notably bulk import (mentioned in ...
    Adam SilbersteinAdam Silberstein
    May 25, 2010 at 5:57 am
    May 25, 2010 at 6:32 am
  • I am working with a small cluster, trying to nail down appropriate settings for block size. We will have a single table with a single column of data averaging 300k in size, sometimes upwards of 2mb, ...
    Jason StrutzJason Strutz
    May 18, 2010 at 4:33 pm
    May 19, 2010 at 3:07 pm
  • Hi, I am new to HBase and I am trying to use hbql on HBase. There I am getting the following exception Exception in thread "main" java.lang.NoClassDefFoundError: ...
    Nitin GoelNitin Goel
    May 17, 2010 at 10:57 am
    May 18, 2010 at 1:40 am
  • It's 5pm Friday, so I'm not going into a lot of detail, but we've also seen problems with 0.20.4. Specifically: 1. possible deadlock HBASE-2545 2. Regions not getting flushed with message that it ...
    Kevin PetersonKevin Peterson
    May 15, 2010 at 12:30 am
    May 15, 2010 at 6:48 pm
  • Hi, after upgrading from 0.20.3 to 0.20.4 a list of tables almost immediately becomes inconsistent - master.jsp shows no tables even after creating test table in hbase shell, tables which were ...
    Viktors RotanovsViktors Rotanovs
    May 13, 2010 at 11:54 pm
    May 14, 2010 at 10:31 am
  • This is similar to a mail sent by another user to the group a couple of months back.. I am quite new to Hbase and I've been trying to conduct a basic experiment with Hbase.. 1. I am trying to load ...
    Vidhyashankar VenkataramanVidhyashankar Venkataraman
    May 13, 2010 at 5:36 pm
    May 13, 2010 at 6:46 pm
  • Ok, here's my story in case anyone else encounters the same issue... My question is this... Why does the table descriptor/meta table information not match the .regioninfo in each region sub dir? Is ...
    May 12, 2010 at 1:04 am
    May 12, 2010 at 6:19 am
  • Hi, Suppose I want to keep aggregated data (e.g. counters) per day for a year or so (this way I can query hbase on a specific date range). Is the proper way to do this is to use the in-built versions ...
    Lior SchachterLior Schachter
    May 9, 2010 at 3:13 pm
    May 10, 2010 at 7:07 am
  • HBase 0.20.0 is available for download: The Release Notes are available here [1]. This relesae includes critical fixes, some improvements and performance ...
    May 5, 2010 at 9:26 pm
    May 6, 2010 at 5:00 am
  • Hello, I am doing a research on HBase if we can use it efficiently in our company. I need to be able get/scan list of rows for an array of key values (sorted, long type). The array size will be 1,000 ...
    May 4, 2010 at 10:16 pm
    May 5, 2010 at 6:30 pm
  • Hi Hbase users, question related to the previous one, if we want to limit the amount of data retrieved by a a scanner, can we tell to not scan after a number of rows is reached? If I look at another ...
    TuX RaceRTuX RaceR
    May 4, 2010 at 4:03 pm
    May 5, 2010 at 4:11 am
  • Hi! I'm using a fresh version of trunk. I'm experiencing a problem where the invalid region locations are not removed from the cache of HCM. I'm only using scanners on the table and I receive the ...
    Miklós KuruczMiklós Kurucz
    May 3, 2010 at 10:12 pm
    May 3, 2010 at 10:32 pm
  • Hi folks, I have to use numerous search criteria and each having lots of distinct values. So, the secondary indexing like IHBase will require lots of memory. I think I require a custom index ...
    May 10, 2010 at 11:16 am
    Jun 4, 2010 at 3:39 am
  • The Wiki is a little out of date. I opened HBASE-2633 to address:<br <br &nbsp;&nbsp; - Andy<br <br <blockquote style="border-left: 2px solid rgb(16, ...
    Andrew PurtellAndrew Purtell
    May 30, 2010 at 7:54 pm
    Jun 2, 2010 at 4:03 am
  • hi , my hbase crashed , here is the log ,and i wanna recover my region server ,after trying "hbase/bin/ start regionserver" it failed to start, what 's the problem to shutdown the ...
    May 28, 2010 at 3:16 am
    Jun 1, 2010 at 2:50 am
Group Navigation
period‹ prev | May 2010 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

93 users for May 2010

Stack: 46 posts Jean-Daniel Cryans: 42 posts Ryan Rawson: 28 posts Jonathan Gray: 25 posts Todd Lipcon: 20 posts Vidhyashankar Venkataraman: 19 posts Andrew Purtell: 12 posts Jacob Isaac: 10 posts Kevin Apte: 9 posts MauMau: 9 posts Edward Capriolo: 8 posts James Baldassari: 8 posts SingoWong: 8 posts TuX RaceR: 8 posts Michelan Arendse: 7 posts Raghava Mutharaju: 7 posts Takayuki Tsunakawa: 7 posts Y_823910: 6 posts Amandeep Khurana: 6 posts Friso van Vollenhoven: 6 posts
show more