Grokbase Groups HBase user June 2012

Search Discussions

111 discussions - 586 posts

  • I watched Lars George's video about HBase and read the documentation and it's saying that it's not a good idea to have the timestamp as a key because that will always load the same region until the ...
    Jean-Marc SpaggiariJean-Marc Spaggiari
    Jun 13, 2012 at 4:16 pm
    Jul 9, 2012 at 1:11 am
  • The 'hbase.hregion.max.filesize' are set to 100G (The recommed value to act as auto-split turn off). And there is a table, we keep put datas into it. When the storefileUncompressedSizeMB reached ...
    Jun 6, 2012 at 8:42 am
    Jun 7, 2012 at 6:07 am
  • Hi All, We are running a mapreduce job in a fully distributed cluster.The output of the job is writing to HBase. While running this job we are getting an error: *Caused by ...
    Manu SManu S
    Jun 6, 2012 at 2:25 pm
    Jul 3, 2012 at 6:46 am
  • Hi, I have a small piece of code, for testing, which is putting 1B lines in an existing table, getting 3000 lines and scanning 10000. The table is one family, one column. Everything is done ...
    Jean-Marc SpaggiariJean-Marc Spaggiari
    Jun 27, 2012 at 11:34 pm
    Jun 28, 2012 at 4:49 pm
  • Hi , I need to delete rows from hbase table by criteria. For example I need to delete all rows started with "12345". I didn't find a way to set a row prefix for delete operation. What is the best way ...
    Oleg RuchovetsOleg Ruchovets
    Jun 18, 2012 at 10:09 pm
    Jun 20, 2012 at 2:11 pm
  • Hello, I have to bulkload 6 tables which contain the same information but with a different order to cover all possible access patterns. Would it be a good idea to do only one load and use ...
    Sever FundatureanuSever Fundatureanu
    Jun 26, 2012 at 4:56 pm
    Jul 14, 2012 at 7:21 pm
  • Hello list, let's say I have to fetch a lot of rows for a page-request (say 1.000-2.000). The row-keys are a composition of a fixed id of an object and a sequential ever-increasing id. Salting those ...
    Jun 4, 2012 at 8:55 pm
    Jun 7, 2012 at 6:26 am
  • Hi I'm getting some unexpected results with a pre-split table where some of the regions are not getting any data. The table keys are UUID (generated using Java's UUID.randomUUID() ) which I'm storing ...
    Simon KellySimon Kelly
    Jun 12, 2012 at 8:18 am
    Jun 13, 2012 at 7:02 am
  • Hello list, Is it possible to use Coprocessors on some specific regionservers instead of a per-region basis??As per my understanding a coprocessor allows us to run the code directly on each region ...
    Mohammad TariqMohammad Tariq
    Jun 26, 2012 at 3:45 pm
    Jun 28, 2012 at 11:32 am
  • Hi I've run into some performance issues with my hadoop MapReduce Job. Basically what I'm doing with it is: - read data from HDFS file - the output goes also to HDFS file (multiple ones in my ...
    Marcin CylkeMarcin Cylke
    Jun 19, 2012 at 8:37 am
    Jun 27, 2012 at 3:49 pm
  • What is the best practice to remove a node and add the same node back for hbase/hadoop ? Currently in our 10 node cluster; 2 nodes went down (bad disk, so node is down as its the root volume+data) ...
    David CharleDavid Charle
    Jun 22, 2012 at 3:17 am
    Jul 9, 2012 at 12:38 pm
  • Hello all, In an AWS outtage we lost about a 5th of our regionservers, and about an 8th of our total datanodes. Despite a replication factor of 3, it appears we may have lost some data from corrupt ...
    Bryan BeaudreaultBryan Beaudreault
    Jun 30, 2012 at 6:38 am
    Jul 1, 2012 at 5:08 am
  • Hi, I have read all the documentation here and I now have few questions. I currently have a mysql table with millions of lines (4 for now, but it's growing by 4 ...
    Jean-Marc SpaggiariJean-Marc Spaggiari
    Jun 12, 2012 at 9:43 pm
    Jun 16, 2012 at 10:41 am
  • Hi..I am new to Hbase. Can anyone please suggest that one HRegionServer means one DataNode? Can there be multiple data nodes in one HRegionServer??:confused: -- View this message in context ...
    Jun 12, 2012 at 9:48 am
    Jun 13, 2012 at 5:17 am
  • Hi Users I was making a Table on Hadoop on the base of Hbase and I wrote a mapReduce Program for this Table But I can't get Answer from this program. I want to read from my file and write on the ...
    Mohamad hosein jafariMohamad hosein jafari
    Jun 6, 2012 at 11:55 am
    Jun 8, 2012 at 12:40 pm
  • (this is a question for the user mailing list, I put the dev@ in bcc) HBase just needs to know the address of the Namenode given via hbase.rootdir and that's it. How do you propose we improve the ...
    Jean-Daniel CryansJean-Daniel Cryans
    Jun 29, 2012 at 5:20 pm
    Aug 15, 2012 at 10:16 pm
  • I'm struggling to understand why my deletes are taking longer than my inserts. My understanding is that a delete is just an insertion of a tombstone. And I'm deleting the entire row. I do a simple ...
    Jeff WhitingJeff Whiting
    Jun 27, 2012 at 9:04 pm
    Jun 28, 2012 at 2:38 pm
  • Hello All, As this problem has both a Hadoop and HBase component, rather than posting the same message to both groups, I'm posting the datanode portion of this problem under the title of "Single disk ...
    Peter NaudusPeter Naudus
    Jun 21, 2012 at 3:58 pm
    Jun 26, 2012 at 4:31 pm
  • Hello! I have started studied HBase and I'm having a problem when tried execute some command on Hbase shell. I configured my HBase on Standalone mode. I have started HBase without troubles, any ...
    Guilherme VanzGuilherme Vanz
    Jun 21, 2012 at 1:21 pm
    Jun 22, 2012 at 10:05 pm
  • Hi, I am using HBase client API to access HBase. My HBase version is 0.92.1 and I have three nodes in my Hadoop cluster. Two nodes are in US and one node in India. HBase master is in one of the node ...
    AnandaVelMurugan Chandra MohanAnandaVelMurugan Chandra Mohan
    Jun 29, 2012 at 6:21 am
    Jul 3, 2012 at 5:39 am
  • Hi, I've been posting questions in the mailing-list quiet often lately, and here goes another one about data locality I read the excellent blog post about data locality that Lars George wrote at ...
    Ben KimBen Kim
    Jun 15, 2012 at 4:57 am
    Jun 27, 2012 at 6:39 am
  • Hi everyone, I have a use case in HBase that I was wondering if someone may have stumbled upon. I am maintaining an ad impressions table with columns that are counters for certain metrics. I started ...
    Sid KumarSid Kumar
    Jun 19, 2012 at 12:49 am
    Jun 24, 2012 at 11:19 pm
  • Hello list, I was going through the Hbase Replication documentation(at to get myself clear with the concepts..One thing which I could not find is that ...
    Mohammad TariqMohammad Tariq
    Jun 22, 2012 at 9:19 pm
    Jun 23, 2012 at 5:17 pm
  • Hi folks, Here is how I understand the scan flow (A regular sequential scan from key A to key B): - Zookeeper is contacted for the RegionServer that has the -ROOT- regions. - The -ROOT- RS is ...
    IGZ NickIGZ Nick
    Jun 18, 2012 at 6:17 pm
    Jun 19, 2012 at 1:53 am
  • Hi all, I'm trying to better understand what's going on in the region server during write to HBase. As I understand the process: 1. Data is written to memstore. 2. Once the memstore has reached ...
    Amit SelaAmit Sela
    Jun 10, 2012 at 5:03 pm
    Jun 17, 2012 at 4:53 pm
  • Hi, I have a table with details: hbase(main):024:0 scan 'test' ROW COLUMN+CELL row1 column=cf:a, timestamp=1339581548508, value=value1 row2 column=cf:b, timestamp=1339581557585, value=value2 row3 ...
    Jun 13, 2012 at 11:18 am
    Jun 14, 2012 at 3:36 pm
  • During a recent Cloudera course we were told that it is "Best practice" to isolate a MapReduce/HDFS cluster from an HBase/HDFS cluster as the two when sharing the same HDFS cluster could lead to ...
    Atif KhanAtif Khan
    Jun 6, 2012 at 12:00 am
    Jun 7, 2012 at 5:38 am
  • I have a table that holds rotating data. It has a TTL of 3600. For some reason, when I scan the table I still get old cells that are much older than that TTL. I have tried issuing a compaction ...
    Tom BrownTom Brown
    Jun 1, 2012 at 11:00 pm
    Jun 6, 2012 at 9:06 am
  • What're the current best practices for making custom Filter implementation classes available to the region servers? My cluster is running 0.90.4 from the CDH3U3 distribution, FWIW. I searched around ...
    Evan PollanEvan Pollan
    Jun 27, 2012 at 5:47 pm
    Jul 30, 2012 at 10:03 am
  • Hi I'm doing some evaluations with HBase. The workload I'm facing is mainly insert-only. Currently I'm inserting 1KB rows, where 100Bytes go into one column. I have the following cluster machines at ...
    Martin AligMartin Alig
    Jun 20, 2012 at 1:40 pm
    Jul 12, 2012 at 9:44 am
  • Hi Hbase Users, I have seen API's supporting HFile direct reads and write. I Do understand it would create Hfiles in the location specified and it should be much faster since we would skip all the ...
    Samar kumarSamar kumar
    Jun 27, 2012 at 10:50 am
    Jun 28, 2012 at 10:39 pm
  • Dear all I am trying to optimize the retrieval code in Java for HBase. The following are the timings without cache enabled: The time taken to get 175347 columns of a row key is 677 ms The time taken ...
    Prakrati AgrawalPrakrati Agrawal
    Jun 25, 2012 at 6:13 am
    Jun 25, 2012 at 7:24 pm
  • Hi, I have read that Hbase has read committed as isolation level, but I have some doubts. Is it possible to chage this level, for instance to read uncommitted? How could I do this? Another question, ...
    Jun 15, 2012 at 12:05 pm
    Jun 22, 2012 at 9:31 am
  • Hello list, I have Hbase (CDH4) setup of a master and 4 region servers (8 core 2.0GHz, 6GB of RAM and 6 100GB disks per data node). All of them are virtual machines. I'm writing into a table using ...
    Giorgi JvaridzeGiorgi Jvaridze
    Jun 18, 2012 at 1:58 pm
    Jun 19, 2012 at 3:17 pm
  • Hi, I've been using my cluster last week for some tests. After that I've formatted the namenode with "hdfs namenode -format" and remove data from datanodes. However I can't recreate an old table ...
    Cyril ScetbonCyril Scetbon
    Jun 18, 2012 at 7:52 am
    Jun 18, 2012 at 1:12 pm
  • Hi, I have a three node Hadoop fully distributed cluster. I have HBase installed, also in fully distributed mode. I am interested in running rowcounter map reduce job bundled with HBase. I am doing ...
    AnandaVelMurugan Chandra MohanAnandaVelMurugan Chandra Mohan
    Jun 13, 2012 at 5:23 am
    Jun 13, 2012 at 8:57 am
  • Hi, I have a table with 2 column families. The column 'family1' has no qualifiers, and the row 'x' has the value \xFF. If I do the following HTable htable = new HTable(config, TABLE_NAME); Get get = ...
    Desert R.Desert R.
    Jun 10, 2012 at 3:51 pm
    Jun 11, 2012 at 9:38 pm
  • Is there any way to control introduce a different ordering scheme from the base comparable bytes? My use case is that I am using UTF-8 data for my keys, and I would like to have scans use UTF-8 ...
    Tom BrownTom Brown
    Jun 8, 2012 at 4:35 pm
    Jun 8, 2012 at 6:59 pm
  • Hi everyone, Does anyone have experience with full text search on HBase? I was reading about hbasene, but last update was 2 years ago. I also read about lily, I was planning to try out lily. But ...
    Jack chrispooJack chrispoo
    Jun 5, 2012 at 9:40 pm
    Jun 8, 2012 at 8:29 am
  • Hi all, I can't seem to understand if there is a way to dynamically load coprocessors ? The best way I found so far is using the shell: *alter 'URLS', METHOD = 'table_att', 'coprocessor'= ...
    Amit SelaAmit Sela
    Jun 7, 2012 at 1:13 pm
    Jun 8, 2012 at 2:20 am
  • (reference: A row consists of a key, and column families, along with a timestamp. So for example: key = ...
    S AhmedS Ahmed
    Jun 1, 2012 at 8:27 pm
    Jun 4, 2012 at 6:36 pm
  • Hi, I am getting heap space error while running Hbase on certain nodes of my cluster. I can't increase the Heapspace allocated to Hbase, is there some way in which I can prevent this from happening? ...
    Prakrati AgrawalPrakrati Agrawal
    Jun 28, 2012 at 8:13 am
    Jul 3, 2012 at 10:04 am
  • Hello all, Tonight in an AWS outtage we lost 11 out of 51 regionservers. All HMasters were unaffected, but the current active master continually spammed messages like this: 12/06/30 00:07:22 INFO ...
    Bryan BeaudreaultBryan Beaudreault
    Jun 30, 2012 at 5:05 am
    Jul 3, 2012 at 12:17 am
  • I "somewhat" have HBase up and running in a distributed mode. It starts fine, I can use "hbase shell" to create, disable, and drop tables; however, after a short period of time HMaster and the ...
    Jay WilsonJay Wilson
    Jun 30, 2012 at 8:03 am
    Jun 30, 2012 at 10:08 pm
  • Hi, I want to change my RDBMS to HBASE schema, to be used with Hadoop platform. I have changed two RDBMS tables into HBASE tables. I have ignored constraints, indexes and foreign key relationship ...
    Jun 28, 2012 at 6:59 pm
    Jun 29, 2012 at 9:19 pm
  • Hi, I have HBase master running on 3 nodes and region server on 4 other nodes on a Mapr hadoop cluster. We have been using it for a while, and it was working fine. Yesterday, we had a disk crash on ...
    Jun 28, 2012 at 6:30 pm
    Jun 29, 2012 at 5:57 pm
  • Hi :) I have a hbase table with rowkeys "1" ~ "10000000" with non-meaningful cells (each row has one cell that's about 10KB) I figured that all data was in one region, so I ran a command to split row ...
    Ben KimBen Kim
    Jun 28, 2012 at 7:08 am
    Jun 28, 2012 at 3:07 pm
  • I am starting out with a new application where I need to store users clickstream data. I'll have Visitor Id, session id along with other page related data. I am wondering if I should just key off ...
    Mohit AnchliaMohit Anchlia
    Jun 26, 2012 at 5:34 pm
    Jun 27, 2012 at 6:21 pm
  • Hi, In HBASE-1512 ( there is the implementation of co-processor for count and others. Is there anywhere an example of the way to use them? Because the ...
    Jean-Marc SpaggiariJean-Marc Spaggiari
    Jun 24, 2012 at 11:22 pm
    Jun 25, 2012 at 4:57 pm
  • Hi, Can I stop Hbase from any node in the cluster? We have a three node cluster and owner of two nodes is out of office. I have access to only one node now. Zookeeper is running in one of the two ...
    AnandaVelMurugan Chandra MohanAnandaVelMurugan Chandra Mohan
    Jun 22, 2012 at 6:17 am
    Jun 22, 2012 at 7:25 am
Group Navigation
period‹ prev | Jun 2012 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

130 users for June 2012

Jean-Marc Spaggiari: 33 posts Michel Segel: 28 posts NNever: 25 posts Mohammad Tariq: 24 posts Michael Stack: 22 posts Jean-Daniel Cryans: 21 posts AnandaVelMurugan Chandra Mohan: 15 posts Ramkrishna.S.Vasudevan: 15 posts Ted Yu: 13 posts Amandeep Khurana: 12 posts Anoop Sam John: 11 posts Cyril Scetbon: 11 posts Doug Meil: 11 posts Harsh J: 11 posts Infolinks: 11 posts Andrew Purtell: 10 posts Shashwat shriparv: 10 posts Lars George: 9 posts Ted Tuttle: 9 posts Ben Kim: 8 posts
show more