Grokbase Groups HBase user March 2010

Search Discussions

138 discussions - 843 posts

  • We are currently researching how to use SPARQL against data in Hbase. I understand the use of Get and Scan classes in the Hbase API, but these search classes do not return data in the same way SPARQL ...
    Basmajian, RaffiBasmajian, Raffi
    Mar 31, 2010 at 3:27 pm
    Apr 10, 2010 at 11:38 pm
  • Hi All, I'm currently running into data loss issues when bulk loading data into HBase. I'm loading data via a Map/Reduce job that is parsing XML and inserting rows into 2 HBase tables. The job is ...
    Nathan HarkenriderNathan Harkenrider
    Mar 21, 2010 at 10:51 pm
    Mar 25, 2010 at 5:51 pm
  • Hi. I'm trying to create a scanner with custom (Hbase 0.20.3) filter and it looks like RegionServer throws an exception: org.apache.hadoop.hbase.UnknownScannerException: Name: -1 I have my custom ...
    Slava GorelikSlava Gorelik
    Mar 15, 2010 at 5:16 pm
    Apr 29, 2010 at 3:38 pm
  • Hello Everyone, Recently we often got these in our client logs: org.apache.hadoop.hbase.client.RetriesExhaustedException: Trying to contact region server for region ...
    Zheng LvZheng Lv
    Mar 15, 2010 at 10:41 am
    Mar 26, 2010 at 5:55 pm
  • Hi all, I've got a question about how everyone is using HBase. Is anyone using its as online data store to directly back a web service? The text-book example of a weblink HBase table suggests there ...
    Mar 9, 2010 at 3:46 pm
    Mar 10, 2010 at 9:58 am
  • For anyone not in the bay area, we had HUG9 last night. Links to the presentations below. JG From: On Behalf Of Jonathan Gray Sent: Thursday, March 11, 2010 1:57 PM ...
    Jonathan GrayJonathan Gray
    Mar 11, 2010 at 10:49 pm
    Mar 15, 2010 at 10:15 pm
  • Hello Hbase Users List, In the SQL world, you can choose column names that clearly describe a field (i.e. long names) I believe it is different in Hbase. Is it worth choosing the shortest possible ...
    TuX RaceRTuX RaceR
    Mar 12, 2010 at 7:49 pm
    Mar 15, 2010 at 11:22 pm
  • Hi, I'm trying to debug an issue where I am getting 'partial' failures. For some reason the region servers seem to end up with multiple 'live' servers on a node. (We start with 3 servers and the next ...
    Michael SegelMichael Segel
    Mar 3, 2010 at 4:44 pm
    Mar 5, 2010 at 4:26 pm
  • greetings, I have a cluster which contains a dozens of PCs with 4cores and 4GB ram each, at first there were only 5 regionservers in the cluster, now we have added 18 regionservers into the cluseter. ...
    Steven zhuangSteven zhuang
    Mar 5, 2010 at 1:54 am
    Mar 9, 2010 at 2:42 am
  • Hi, I'm evaluating Hbase as a NoSql DB for a large scale, interactive, web service with very high uptime requirements, and have a few questions to the community. 1. I assume you've seen this ...
    Eran KutnerEran Kutner
    Mar 4, 2010 at 10:03 am
    Mar 5, 2010 at 5:36 pm
  • Hi all, I have run into some trouble while trying to port SQL DB to Hbase. The problem is my SQL DB has around 500 tables (approx) and it is very badly designed. Around 45-50 tables could be ...
    Kranthi reddyKranthi reddy
    Mar 31, 2010 at 10:05 am
    Apr 14, 2010 at 12:30 pm
  • I am new to HBase and come from a rdbms background. After looking in the sample client code it seems fairly easy to query a single table using Get and Scan, but it's not so obvious how to join data ...
    Basmajian, RaffiBasmajian, Raffi
    Mar 19, 2010 at 3:34 pm
    Mar 23, 2010 at 6:58 am
  • hi, all, I have a table which may contain super big rows, e.g. with millions of cells in one row, 1.5GB in size. now I have problem at emitting data into the table, probably because of these super ...
    Steven zhuangSteven zhuang
    Mar 7, 2010 at 5:14 am
    Mar 15, 2010 at 1:28 am
  • As I do see this Exception really often in our logs. I wonder if this indicates a regular thing (within splits etc) or if this is something that should not normally happen. I see it often in Jira as ...
    Al LiasAl Lias
    Mar 31, 2010 at 11:38 am
    Apr 1, 2010 at 4:26 am
  • hi, guys, now I have a table with 30M rows, sometime we need to do scan on a range of 20M rows, each row contains only one column, with less than 50 Bytes data, each row key is a date string, ...
    Steven zhuangSteven zhuang
    Mar 27, 2010 at 3:02 am
    Mar 29, 2010 at 5:36 pm
  • [Copied to hbase-user@] Hi Lars, No, that means the scripts are not picking up a valid public AMI. I think you may be the first to attempt to use the EC2 scripts in the EU. One thing I did not ...
    Andrew PurtellAndrew Purtell
    Mar 20, 2010 at 8:00 pm
    Mar 27, 2010 at 7:49 pm
  • Hi, How do you guys deploy your Java program to the production cluster? I worked in Eclipse in a local linux machine in pseudo distributed mode. What would be the best way to deploy my program to the ...
    William KangWilliam Kang
    Mar 23, 2010 at 5:43 am
    Mar 24, 2010 at 5:54 am
  • Hi, this blogpost < backend describes lucandra[1], a project to use cassandra as a persistence backend for lucene indices. From ...
    Thomas KochThomas Koch
    Mar 9, 2010 at 2:47 pm
    Mar 23, 2010 at 9:52 am
  • This has come up before< but I'm still unclear as to whether this is possible or ...
    George StathisGeorge Stathis
    Mar 19, 2010 at 3:01 pm
    Mar 21, 2010 at 11:11 pm
  • Hi, I am trying to load data into IHBase table with few indexes and my region server is crashing. I analyzed heap dump and now I am under impression that IHBase index data structures eat all ...
    Andrey KolyadenkoAndrey Kolyadenko
    Mar 20, 2010 at 8:09 pm
    Mar 21, 2010 at 10:18 am
  • Hi. I'm looking is there functionality that allows me to perform multiple get (to retrieve number of rows by some row id's list) ? I think this can reduce number of round-trips between client and ...
    Slava GorelikSlava Gorelik
    Mar 4, 2010 at 3:01 pm
    May 4, 2010 at 5:45 pm
  • Hi, One of our region servers was shut down with the following messages in the log. It seems like communication with the zookeeper timed out and when it later reconnected, the session was expired and ...
    Peter FalkPeter Falk
    Mar 29, 2010 at 11:27 am
    Mar 31, 2010 at 4:18 pm
  • Hi, I am quite confused about the distributions of data in a HBase system. For instance, if I store 10 videos in 10 HTable rows' cell, I assume that these 10 videos will be stored in different data ...
    William KangWilliam Kang
    Mar 27, 2010 at 5:07 pm
    Mar 30, 2010 at 12:33 am
  • I am having the hardest time tying to connect from a running webapp in Tomcat 6.0 to a remote HBase instance. Whenever the webapp comes to: HBaseConfiguration config = new HBaseConfiguration(); There ...
    A LevineA Levine
    Mar 26, 2010 at 2:21 pm
    Mar 26, 2010 at 11:44 pm
  • Hi, There: I migrated my hbase cluster from network to for all my hbase machines, and now I can't read the contents of the tables any more. I run "bin/hbase shell", and then ...
    Jinsong HuJinsong Hu
    Mar 16, 2010 at 11:06 pm
    Mar 17, 2010 at 12:39 am
  • Ok, Still trying to track down some issues. I opened up an hbase shell and decided to use count to count the number of rows in a table. As it was running, count was flying along until it hit 150,000 ...
    Michael SegelMichael Segel
    Mar 17, 2010 at 4:26 am
    Mar 18, 2010 at 3:37 am
  • Hello List, I'll be accessing a table mainly in random access and I am looking for an efficient way of randomizing the keys. I thought about a MD5 hash of the ID of the record, but as MD5 returns a ...
    TuX RaceRTuX RaceR
    Mar 11, 2010 at 10:08 am
    Mar 12, 2010 at 2:27 pm
  • Hi we are having small job which indexes hbase table with lucene. Map process will just emit hbase rows and reduce will create lucene index. In the middle of map process, while reduce jobs doing ...
    Mar 7, 2010 at 8:07 pm
    Mar 8, 2010 at 9:29 pm
  • Hi guys, I am new to HBase and have several questions. Would anybody kindly answer some of them? 1. Why HBase could provide a low-latency random access to files compared to HDFS? 2. By default, Only ...
    William KangWilliam Kang
    Mar 1, 2010 at 8:37 pm
    Mar 4, 2010 at 7:15 pm
  • Hi all, Is anyone using elastic search as an indexing layer to HBase content? It looks to have a really nice API, and was thinking of setting up an EC2 test where I maintain an ES index storing only ...
    Tim RobertsonTim Robertson
    Mar 27, 2010 at 6:46 pm
    Jun 4, 2010 at 9:33 pm
  • Hi, One of my region server is still listed on the webpage Region Server, but it raised folloing message while running my program. 10/03/30 13:11:18 INFO ipc.HbaseRPC: Server at / ...
    Mar 30, 2010 at 5:18 am
    Mar 31, 2010 at 6:32 am
  • Hi all, maybe somebody could give me advice in the following situation: Currently HBase Scan interface provides ability to set up only first and last rows for MR scanning. Is it any way to get ...
    Andrey KolyadenkoAndrey Kolyadenko
    Mar 17, 2010 at 8:32 pm
    Mar 26, 2010 at 1:59 pm
  • Hello guys, I've got a problem while adding a filter to scanner in a client app which runs on the remote (not the one from HBase cluster) box. The same code works well and scan result is fetched very ...
    Alex BaranovAlex Baranov
    Mar 17, 2010 at 4:00 pm
    Mar 22, 2010 at 5:54 pm
  • My current setup in EC2 is a Hadoop Map Reduce cluster and HBase cluster sharing the same HDFS. That is, I have a batch of nodes that run datanode and tasktracker and a bunch of nodes that run ...
    Kevin PetersonKevin Peterson
    Mar 3, 2010 at 7:28 pm
    Mar 15, 2010 at 9:56 pm
  • Bringing the discussion in hbase-user That usually happens after a DNS hiccup. There's a fix for that in J-D
    Jean-Daniel CryansJean-Daniel Cryans
    Mar 11, 2010 at 6:20 pm
    Mar 12, 2010 at 12:09 am
  • Hi, Sometimes we wish to do a clean update of a row, that is: Make sure any old column values are removed that are not in the new Put. This is how we're doing this now (hbase 0.20.3): //delRow and ...
    Mar 8, 2010 at 12:54 pm
    Mar 9, 2010 at 10:35 am
  • Hi we used to run few (2-5) map reduce jobs which imports data (csv files on hdfs) into one large hbase table (one family only). After few successful runs we managed to import about 5-10 million ...
    Mar 5, 2010 at 5:39 pm
    Mar 8, 2010 at 10:23 pm
  • Hi, We have a cluster of four nodes that run hadoop 0.20.1 data nodes and hbase 0.20.2 region servers. We occasionally loose region servers with an OOME like the following. 2010-03-24 04:22:03,027 ...
    Peter FalkPeter Falk
    Mar 24, 2010 at 10:52 am
    Jun 22, 2010 at 9:56 pm
  • Dear Mr. Crockford, I'm working on a Debian package for HBase, the Database also used by Yahoo. One dependency of hbase is agile-json[1], apparently based on your org.json library. It also contains ...
    Thomas KochThomas Koch
    Mar 26, 2010 at 5:10 pm
    Mar 29, 2010 at 9:11 pm
  • Hi Hbase fans, I'm trying to prepare as stable version as it is possible of our Hbase/Hadoop stack. This involves adding some patches. As a base i want to use hbase 0.20.3 and hadoop 0.20.2. Which ...
    Michał PodsiadłowskiMichał Podsiadłowski
    Mar 24, 2010 at 3:05 pm
    Mar 25, 2010 at 4:49 pm
  • Hi, everyone. I set up a hadoop cluster with 3 nodes: one mater and two slaves. Recently I want to add a new node as a slave to it. After configuration, the HDFS and HBase work, however, I found some ...
    Mar 10, 2010 at 11:32 am
    Mar 25, 2010 at 2:59 am
  • Hi there, I want to split META table manually but I wonder how to set the optional Region Key in the webpage. (using the value like BIG_TABLE,FRPFXRD_NF61904-0 1.001.Main.0,1268309701214) ...
    Mar 12, 2010 at 3:51 am
    Mar 15, 2010 at 3:01 am
  • Hi, The strange thing was that after several quick responses I would see: hbase(main):004:0 get 'ruletable', 'com.about.acne' COLUMN CELL lpm_1.0:category timestamp=1268347483823, value= ...
    Ted YuTed Yu
    Mar 13, 2010 at 4:25 am
    Mar 13, 2010 at 10:25 pm
  • Hi, I am hoping someone can help me with a zookeeper setup. I am running Zookeeper wih export HBASE_MANAGES_ZK=false and start it using bin/hbase zookeeper. It starts up fine. However when I start up ...
    Ahmed Suhail ManzoorAhmed Suhail Manzoor
    Mar 7, 2010 at 11:50 am
    Mar 9, 2010 at 8:26 pm
  • Hi, I need to process (with a MR job) data stored in HBase. The data is added to HBase incrementally (and stored in there forever) and so I'd like this MR job to process only the new data every time ...
    Otis GospodneticOtis Gospodnetic
    Mar 5, 2010 at 9:50 pm
    Mar 9, 2010 at 5:44 pm
  • Hi, I've read the API doc of ResultScanner and Scanner. Both of them support to get "next" data record in the table. My question is, does there exist any scanner interface that supports to get ...
    Hua SuHua Su
    Mar 6, 2010 at 4:00 pm
    Mar 8, 2010 at 2:45 am
  • Hi. I'm trying to find a way to get timestamp for a specific row and column (cell). But, since the Cell class is deprecated as well as RowResult. The only way that I found is to get a list of ...
    Slava GorelikSlava Gorelik
    Mar 3, 2010 at 4:55 pm
    Mar 4, 2010 at 5:15 am
  • Previous attempt wasn't delivered.
    Ted YuTed Yu
    Mar 3, 2010 at 5:58 pm
    Mar 3, 2010 at 7:41 pm
  • Hi, I started 200 clients(spread it to 20 machines) to run NewHTableTest like following code, which took 983 seconds. META table just resides in one region that machine CPU and network traffic are ...
    Mar 1, 2010 at 8:23 am
    Mar 2, 2010 at 3:17 am
  • Hi folks, I've seen some people around the list that recommend rolling one's own indexes. Others say to just go with the org.apache.hadoop.hbase.client.tableindexed package. A quick scan of the wiki ...
    George StathisGeorge Stathis
    Mar 29, 2010 at 7:52 pm
    Mar 31, 2010 at 3:09 pm
Group Navigation
period‹ prev | Mar 2010 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

126 users for March 2010

Jean-Daniel Cryans: 92 posts Stack: 67 posts Ryan Rawson: 57 posts Andrew Purtell: 41 posts Michael Segel: 30 posts Ted Yu: 30 posts Y_823910: 26 posts Jonathan Gray: 21 posts Slava Gorelik: 20 posts Steven zhuang: 19 posts William Kang: 16 posts George Stathis: 15 posts Zheng Lv: 15 posts Dan Washusen: 14 posts Patrick Hunt: 14 posts Tim Robertson: 14 posts TuX RaceR: 12 posts Alex Baranov: 11 posts Jonathan Gray: 11 posts Erik Holstad: 10 posts
show more