Grokbase Groups HBase user March 2011

Search Discussions

191 discussions - 1,095 posts

  • 1. How do I count rows fast in hbase? First I tired count 'test' , takes ages. Saw that I could use RowCounter, but looks like it is deprecated. When I try to use it, I get ...
    Vivek KrishnaVivek Krishna
    Mar 16, 2011 at 8:36 pm
    Mar 17, 2011 at 6:49 pm
  • Hi, I'm running some performance tests on a cluster with 5 member servers (not counting the masters of all kinds), each node running a data node, a region server and a thrift server. Each server has ...
    Eran KutnerEran Kutner
    Mar 28, 2011 at 10:17 am
    May 9, 2011 at 8:42 pm
  • Hi, To help avoid hotspots, I'm planning to use hashed keys in some tables. 1. I wonder if this strategy is adviced for range queries (from/to key) use case, because the rows will be randomly ...
    Eric CharlesEric Charles
    Mar 16, 2011 at 8:53 am
    Apr 21, 2011 at 3:18 pm
  • Hi, I'm trying to use replication between two HBase clusters and I'm encountering all kinds of crashes and weird behavior. First, it seems that starting a region server when the peer ZKs are not ...
    Eran KutnerEran Kutner
    Mar 22, 2011 at 5:40 pm
    Mar 29, 2011 at 11:30 am
  • Hi everyone, We are having this problem for a while and would really appreciate any suggestions. We have a 5 node cluster, 4 of them being region servers. I am running a custom workload with YCSB and ...
    M.Deniz OKTARM.Deniz OKTAR
    Mar 7, 2011 at 1:44 pm
    Mar 11, 2011 at 6:09 pm
  • All was well, until this happen: and all regionservers went down, is this xciever issue? <property <name dfs.datanode.max.xcievers</name <value 12047</value </property ...
    Jack LevinJack Levin
    Mar 10, 2011 at 6:31 pm
    Mar 31, 2011 at 6:21 am
  • I just had to upgrade our second cluster CDH3B4 (the 2GB log file problem, same as the reason for upgrading another cluster) and now the master is not coming up, it dies with this error: 2011-03-17 ...
    Chris TarnasChris Tarnas
    Mar 17, 2011 at 11:21 pm
    Mar 18, 2011 at 2:56 am
  • Hi Using hbase-0.20.6..This has happened quite often..Is this a known issue in 0.20.6 that we would n't see in 0.90.1 (or) see less of? ..Attempt to fix/avoid this earlier times by truncating table, ...
    Mar 29, 2011 at 2:39 pm
    Mar 31, 2011 at 4:17 pm
  • Hi, I recently set up a 2-node Hadoop and HBase cluster and am trying to load data into my HBase table using HBase client. The issue bothers me is that the data are always written into one node of ...
    Weiwei XiongWeiwei Xiong
    Mar 14, 2011 at 5:50 pm
    Mar 15, 2011 at 6:33 am
  • How to get the first or last row in the HBase table? like the min(), max() in mysql? Thank you.
    Weishung ChungWeishung Chung
    Mar 1, 2011 at 10:04 pm
    Mar 4, 2011 at 2:55 pm
  • Hi, We had a troubling experience today that I wanted to share. Our dev cluster got completely shut down by a developer by mistake, without said developer even realizing it. Here's how... We have ...
    Bill GrahamBill Graham
    Mar 3, 2011 at 1:23 am
    Mar 5, 2011 at 3:50 am
  • Hey folks, What would be the best approach for migrating away from a given region server implementation back to the default out-of-the box one? My goal here is to upgrade our cluster to 0.90 and ...
    George P. StathisGeorge P. Stathis
    Mar 24, 2011 at 7:15 pm
    Mar 25, 2011 at 4:22 am
  • Hi, What's the best place to learn about HBase replication? I found , but note how there is only a link there, and that link points to a 404. ...
    Otis GospodneticOtis Gospodnetic
    Mar 2, 2011 at 4:51 pm
    Mar 3, 2011 at 10:04 pm
  • How to copy HTable from one cluster to another cluster ? The table is very big . -- Thanks & Best regards jiajun
    Mar 9, 2011 at 10:28 am
    Mar 31, 2011 at 3:48 pm
  • I am using the Java client API to write 10,000 rows with about 6000 columns each, via 8 threads making multiple calls to the HTable.put(List<Put ) method. I start with an empty table with one column ...
    Bryan KellerBryan Keller
    Mar 13, 2011 at 8:15 am
    Mar 15, 2011 at 9:04 pm
  • Hello, does anybody use MapReduce streaming over HBase? When I use TableInputFormat, I get this line on std input: 72 6f 77 31 keyvalues={row1/family1:a/1298037737154/Put/vlen=1, ...
    Ondrej HolecekOndrej Holecek
    Mar 8, 2011 at 10:51 am
    Mar 9, 2011 at 11:41 pm
  • Hi all, I have just had a new requirement to store blob data. I recall seeing this discussed in the past, but my search only turned up references to Lily and the hbase hackathon: ...
    Buttler, DavidButtler, David
    Mar 8, 2011 at 6:34 pm
    Mar 9, 2011 at 10:14 pm
  • Hi, I tried to use my hbase-default.xml from 0.89 with my new 0.90.1 installation. I get a message stating "hbase-default.xml seems to be from an old version of hbase(null), this version is 0.90.1. ...
    Geoff HendreyGeoff Hendrey
    Mar 4, 2011 at 10:45 pm
    Mar 8, 2011 at 12:28 am
  • Hi, I am experiencing severe connection leak in my MR client that uses Hbase as input/output . Every job that uses TableInputFormat leaks 1 zookeeper connection per run as evidenced by netstat. I ...
    Dmitriy LyubimovDmitriy Lyubimov
    Mar 23, 2011 at 8:54 pm
    Apr 16, 2011 at 1:44 pm
  • I scan the table ,It just has 29000 rows and each row only has not reached 1 k . I save it to files which has 18M. But I used /app/cloud/hadoop/bin/hadoop fs -copyFromLocal , it has 99G . Why ? -- ...
    Mar 31, 2011 at 4:35 am
    Apr 1, 2011 at 4:52 pm
  • Some body execute the command : /app/cloud/hadoop/bin/hadoop fs -rmr /hbase/.META. So the regions of all tables is lost ,Can I rebuild the tables by hdfs files ? -- Thanks & Best regards jiajun
    Mar 30, 2011 at 7:07 am
    Mar 30, 2011 at 9:59 am
  • Hi, I am trying to create table in hbase v0.90.1 and I get the following error: 11/03/28 18:39:52 INFO zookeeper.ClientCnxn: Opening socket connection to server hadoop2/ 11/03/28 ...
    Hari SreekumarHari Sreekumar
    Mar 28, 2011 at 1:29 pm
    Mar 30, 2011 at 5:46 am
  • Is it possible using stargate interface to hbase, fetch all rows where more than one column family:<qualifier must be present? like :select rows which contains keyword:a and keyword:b ? Thanks
    sreejith P. K.sreejith P. K.
    Mar 24, 2011 at 1:19 pm
    Mar 25, 2011 at 5:38 pm
  • I am browsing through the package and was wondering what other file formats are available in hadoop other than SequenceFile and TFile? Is all data written through hadoop including those ...
    Weishung ChungWeishung Chung
    Mar 19, 2011 at 4:02 pm
    Mar 23, 2011 at 1:43 pm
  • Hi, What is the API or configuration for changing the default hash function for a specific htable. thanks, Lior
    Lior SchachterLior Schachter
    Mar 20, 2011 at 5:07 pm
    Mar 22, 2011 at 1:31 am
  • Friends, how do I best achieve intersection of sets of row ids suppose I have two tables with similar row ids how can I get the row ids present in one and not in the other? does things get better if ...
    Vishal KapoorVishal Kapoor
    Mar 11, 2011 at 4:09 am
    Mar 13, 2011 at 11:48 pm
  • Hi, I'm trying to figure out the root cause of the crush on a small HBase cluster and I need some help from the experts here. I tried to post my question earlier but it seems the message was blocked ...
    Tatsuya KawanoTatsuya Kawano
    Mar 5, 2011 at 2:27 am
    Mar 10, 2011 at 11:40 pm
  • Has anything changed with the way compression is handled in 0.90? I'm in the process of testing 0.90 CDH3B4 on my development machine (OS X). Up until 0.89 SNAPSHOTS, I have been able to re-use the ...
    George P. StathisGeorge P. Stathis
    Mar 28, 2011 at 3:04 pm
    Apr 1, 2011 at 12:52 pm
  • Trying to understand why out test program was generating so many threads (HBase 0.90.0), I discover that every time we instantiate HTable we get a new thread pool (ThreadPoolExecutor). This seems a ...
    Joe PallasJoe Pallas
    Mar 29, 2011 at 10:50 pm
    Mar 30, 2011 at 4:30 pm
  • I am trying to estimate the cost of hosting own HBase cluster vs using EC2. Could anyone give me some guidance? Cluster size ~ 6 to 8 nodes Usage ~ at least 12 hours/day with lot of read/write ...
    Weishung ChungWeishung Chung
    Mar 10, 2011 at 5:13 pm
    Mar 11, 2011 at 5:46 pm
  • Hi, I'm setting a 4-region-nodes hbase clusters, with the master running outside the data clusters. It works 'well'. The problem is after some sort of stress tests, say launching 20 threads, putting ...
    Mar 9, 2011 at 10:06 am
    Mar 10, 2011 at 10:46 pm
  • My cluster (10 nodes, hbase-0.20.6 + hadoop 0.20.2) is very very slow for any operation like disable table or delete. Master's thread dump says they are blocked by the metaScanner thread. When I ...
    Nanheng WuNanheng Wu
    Mar 2, 2011 at 1:08 am
    Mar 2, 2011 at 4:37 am
  • We had an issue a day ago with some OOME's on the region servers. The master shutdown ok, but most of the RegionServers didn't and so eventually had to kill -9 them. Brought it all back up and ran a ...
    Marc LimotteMarc Limotte
    Mar 6, 2011 at 12:23 am
    Apr 16, 2011 at 3:53 am
  • Dear Buddies, I need to re-calculate the entries in a hbase everyday, like let x = 0.9x everyday, to make the time has impact on the entry values. So I write a TableMapper to get the Entry, and ...
    Stanley XuStanley Xu
    Mar 25, 2011 at 2:37 am
    Mar 29, 2011 at 9:58 am
  • Hi all, We're experiencing a problem where a map-only job using TableInputFormat and TableOutputFormat to export data from one table into another is not reading all of the rows in the source table. ...
    Sean SechristSean Sechrist
    Mar 18, 2011 at 1:51 pm
    Mar 22, 2011 at 7:33 pm
  • Hi, Can anyone help please? How stable is Hbase when inserting data? I have seen a few emails regarding issues with this. The reason I ask is that we have built a 12 data-node cluster, with region ...
    Stuart ScottStuart Scott
    Mar 21, 2011 at 8:17 pm
    Mar 22, 2011 at 12:56 am
  • Hello experts, I have a scenario as follows, I need to maintain a huge table for a 'web crawler' project in HBASE. Basically it contains thousands of keywords and for each keyword i need to maintain ...
    sreejith P. K.sreejith P. K.
    Mar 15, 2011 at 5:19 pm
    Mar 17, 2011 at 4:56 am
  • Hi All, I am working on benchmarking different data stores to find the best fit for our use case. I would like to know views and suggestions of the HBase user and developer community on some of my ...
    Aditya SharmaAditya Sharma
    Mar 4, 2011 at 6:20 am
    Mar 7, 2011 at 11:54 pm
  • Hi, I have few basic question related to HBase shell. Please help me out in these issues. 1. When I start the HBase Shell I am not getting the HBase prompt. 2. When I enter the command wrongly shell ...
    James RamJames Ram
    Mar 1, 2011 at 9:37 am
    Mar 3, 2011 at 5:50 am
  • Hello guys, we are getting those errors: 2011-03-28 15:08:33,485 INFO org.apache.hadoop.hdfs.server.datanode.DataNode.clienttrace: src: /, dest: /, bytes: 66564, op: ...
    Jack LevinJack Levin
    Mar 28, 2011 at 11:19 pm
    Mar 30, 2011 at 5:29 pm
  • Hi, Unable to use the Increment function, can anybody suggest what am I doing wrong... I enter data by :- theput.add(Bytes.toBytes("uid"),Bytes.toBytes("1"), 1301087829999L + t, Bytes.toBytes(10)) ...
    Sulabh choudhurySulabh choudhury
    Mar 29, 2011 at 3:23 pm
    Mar 29, 2011 at 7:35 pm
  • Hello, I've set up a test HBase+Hadoop cluster yesterday and got the following error in logs during running MR job (which internally creates HTable for Reducer): KeeperErrorCode = ConnectionLoss for ...
    Alex BaranauAlex Baranau
    Mar 25, 2011 at 7:37 pm
    Mar 26, 2011 at 8:44 pm
  • Hi, In our tests, we've accumulated lots of WAL logs, in .logs, which leads to quite long time pause or even OOME when restarting either master or region server. We're doing sort of bulk import and ...
    Mar 17, 2011 at 7:12 am
    Mar 22, 2011 at 4:54 am
  • Hi, My Q is around the suggested or maximum number of CFs per table (see ) Consider the following use-case. * A multi-tenant system. * All ...
    Otis GospodneticOtis Gospodnetic
    Mar 17, 2011 at 6:30 am
    Mar 17, 2011 at 9:45 pm
  • Hi, We are trying a small hbase environment, including 2-node masters, 4-node regionservers, and 3-node zookeeper cluster. Now we're doing stress tests, using cells of bigger size(4mb - 15mb). What ...
    Mar 14, 2011 at 1:43 pm
    Mar 16, 2011 at 3:43 pm
  • I've subclassed RegionObserver and am overriding postPut. How does one obtain the row byte[] of the Put that generated the call? Is it available via from the familyMap? What is the purpose of the ...
    Jason RutherglenJason Rutherglen
    Mar 14, 2011 at 7:54 pm
    Mar 15, 2011 at 2:36 am
  • ~resending to Is there a best-practice for modeling multi-valued fields (fields that are repeated or collections of fields)? Our current data model allows for a User to ...
    Cameron LeachCameron Leach
    Mar 10, 2011 at 1:47 am
    Mar 12, 2011 at 12:47 am
  • Hi, Since HBase has a mechanism to replicate edit logs to another HBase cluster, I was wondering if people think it would be possible to implement HBase= Hive replication? (and really make the ...
    Otis GospodneticOtis Gospodnetic
    Mar 11, 2011 at 6:43 am
    Mar 11, 2011 at 7:52 pm
  • Hi, I have set up replication, and it is working. Now i am interested in the performance implications of it. What is the best way to approach this? Should I use the "verifyrep" mentioned at the ...
    Mark KerznerMark Kerzner
    Mar 7, 2011 at 2:22 pm
    Mar 7, 2011 at 11:51 pm
  • Hi, Last week I consulted he forum about hbase insertion optimization when the key format is : date_key. This key format is very good for efficient scans but creates hotspot a single region when ...
    Lior SchachterLior Schachter
    Mar 27, 2011 at 5:01 pm
    Mar 30, 2011 at 3:58 pm
Group Navigation
period‹ prev | Mar 2011 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

127 users for March 2011

Stack: 132 posts Jean-Daniel Cryans: 126 posts Ted Yu: 43 posts Ted Dunning: 32 posts 陈加俊: 29 posts Wei Shung Chung: 24 posts Chris Tarnas: 23 posts Otis Gospodnetic: 23 posts Ryan Rawson: 23 posts Vivek Krishna: 19 posts Buttler, David: 18 posts Suraj Varma: 18 posts Eran Kutner: 17 posts Lars George: 17 posts Venkatesh: 17 posts 茅旭峰: 17 posts Geoff Hendrey: 16 posts Oleg Ruchovets: 16 posts Jack Levin: 15 posts Peter Haidinyak: 15 posts
show more