Search Discussions

142 discussions - 802 posts

  • You've checked out the perf section of the refguide? http://hbase.apache.org/book.html#performance And have you read the postings by the GBIF lads starting with this one ...
    Michael StackMichael Stack
    Aug 12, 2012 at 8:18 am
    Aug 24, 2012 at 5:28 pm
  • Hi there I have a few tables which show up in a "list" in the shell, but produce "table not found" when performing any operation on them. There is no reference of them in the .META. table. It seems ...
    Marco GallottaMarco Gallotta
    Aug 10, 2012 at 9:22 pm
    Aug 11, 2012 at 4:38 am
  • Hello guys, In your experience, is it practical to use HBase directly for serving? Saying handle directly user traffic (tens of thousands QPS scale) behind Apache, and replace the role of memcached? ...
    Lin MaLin Ma
    Aug 18, 2012 at 7:12 am
    Aug 23, 2012 at 5:06 am
  • Hi All, I recently updated my cluster from HBase 0.90 to HBase 0.92. One replica of one table used to take 90 GB in 0.90 but the same table takes 45 GB in 0.92(HFilev2). The table has 1 column family ...
    Anil guptaAnil gupta
    Aug 14, 2012 at 6:16 pm
    Aug 28, 2012 at 7:08 pm
  • Hello guys, According to the notes by Werner*, "*He presented the CAP theorem, which states that of three properties of shared-data systems—data consistency, system availability, and tolerance to ...
    Lin MaLin Ma
    Aug 8, 2012 at 1:29 am
    Aug 9, 2012 at 8:44 am
  • We have a small Hbase cluster on EC2 with 6 region servers. Lately we found that the data in one of the column families is really not that useful for us and decided to chuck it. This particular ...
    Prem yadavPrem yadav
    Aug 7, 2012 at 6:51 am
    Aug 18, 2012 at 4:20 pm
  • Hi Hbase users, If all the rows of an region are deleted what happens to the region. Would it still exist . It can be removed by writing some code but can it be removed automatically. Also what is ...
    Samar kumarSamar kumar
    Aug 7, 2012 at 10:24 am
    Aug 13, 2012 at 8:51 am
  • If I use md5 hash + timestamp rowkey would hbase automatically detect the difference in ranges and peforms split? How does split work in such cases or is it still advisable to manually split the ...
    Mohit AnchliaMohit Anchlia
    Aug 29, 2012 at 10:57 pm
    Aug 31, 2012 at 3:32 pm
  • Hi all, iam new user of Hbase and i need help with secondary indexes. For example i have messages and users. Each user has many messages. Data structure will be like this: Message: - String id - Long ...
    Lukáš DrbalLukáš Drbal
    Aug 12, 2012 at 12:46 pm
    Aug 15, 2012 at 3:57 am
  • Just read this article, "Solving Big Data Challenges for Enterprise Application Performance Management." published this month @ Volume 5, No.12 of Proceedings of the VLDB Endowment, where they ...
    Cristofer WeberCristofer Weber
    Aug 30, 2012 at 1:27 pm
    Aug 31, 2012 at 1:48 pm
  • Hello guys, I am referencing the Big Table paper about how a client locates a tablet. In section 5.1 Tablet location, it is mentioned that client will cache all tablet locations, I think it means ...
    Lin MaLin Ma
    Aug 18, 2012 at 9:14 am
    Aug 24, 2012 at 1:20 am
  • The prefix encoding applies to blocks in the HFiles and in the block cache, but not to the memstore. #1 Yes. Each column family is its own store. All stores are flushed together, so have many add ...
    Lars hofhanslLars hofhansl
    Aug 25, 2012 at 9:57 pm
    Aug 28, 2012 at 7:31 pm
  • Probably what you read was in context of Column Families. HBase has concept of column family similar to Google's bigtable. And the store files on disk is per column family. All columns of a given ...
    Mohit AnchliaMohit Anchlia
    Aug 5, 2012 at 4:08 pm
    Aug 12, 2012 at 10:17 am
  • Dear all, According to my experiences, it is very slow for HBase to save data? Am I right? For example, today I need to save data in a HashMap to HBase. It took about more than three hours. However ...
    Bing LiBing Li
    Aug 29, 2012 at 2:05 pm
    Aug 30, 2012 at 8:45 am
  • Hi HBase community: I tried to use setTimeRange and setMaxVersions to limit the number of KVs return per column. The behaviour is as I would expect that is setTimeRange(0, T + 1) and ...
    Jerry LamJerry Lam
    Aug 27, 2012 at 9:41 pm
    Aug 29, 2012 at 6:37 pm
  • In a nutshell: - Puts are collected in memory (in a sorted data structure) - When the collected data reaches a certain size it is flushed to a new file (which is sorted) - Gets do a merge sort ...
    Lars hofhanslLars hofhansl
    Aug 21, 2012 at 11:07 pm
    Aug 23, 2012 at 2:13 pm
  • My hbase version is 0.92. I tried something as follows: 1.Created a table 'test' with 'course' in which ttl=5. 2. inserted one row into the table. 5 seconds later, the row was deleted. Later when I ...
    Aug 14, 2012 at 10:05 am
    Aug 22, 2012 at 10:02 am
  • Hi How can I enable encryption on hbase ? if I want to use encryption in a cluster with hbase running on top of hadoop,what can I do? Can't hadoop be configured to automatically encrypt each file ...
    Farrokh ShahriariFarrokh Shahriari
    Aug 7, 2012 at 11:07 am
    Aug 9, 2012 at 6:00 am
  • There is no such functionality currently, and there is no good way to simulate that. Currently that cannot even be done with a coprocessor endpoint, because region coprocessors have no way to create ...
    Lars hofhanslLars hofhansl
    Aug 7, 2012 at 3:55 pm
    Aug 8, 2012 at 2:05 am
  • Hi there! After I started studying HBase, I've searched for open source projects backed by HBase and I found Titan distributed graph database (you probably heard about it). As soon as I read in their ...
    Cristofer WeberCristofer Weber
    Aug 30, 2012 at 11:47 pm
    Aug 31, 2012 at 12:58 pm
  • I think for the refresh case, client first uses the older region server derived from its cache it then connects to that older region server which responds with a failure code. and then client talks ...
    Pamecha, AbhishekPamecha, Abhishek
    Aug 23, 2012 at 6:48 am
    Aug 28, 2012 at 11:04 am
  • Hello David, You could take the String.valueOf(yourInt) and store the Bytes.toBytes(yourString) representation of this amount as column qualifier. Regards, Ioakim
    Ioakim PerrosIoakim Perros
    Aug 11, 2012 at 3:41 pm
    Aug 15, 2012 at 10:40 pm
  • Are clients local to slave DC able to read data from HBase slave when replicating data from one DC to remote DC? If not then is there a way to design such a thing where clients are able to actively ...
    Mohit AnchliaMohit Anchlia
    Aug 18, 2012 at 12:36 am
    Sep 11, 2013 at 9:26 am
  • We ran into this situation today on our production instance, I'm wondering if anyone has seen or knows a way around it. We were repeatedly updating the same 60,000 column wide row. With each update, ...
    Aug 6, 2012 at 9:39 pm
    Aug 16, 2012 at 7:14 pm
  • Hi HBase users, I need to pull data from 2 HBase tables in a mapreduce job. For 1 table input, I use TableMapReduceUtil.initTableMapperJob. Is there another method for multitable inputs ? Thank you, ...
    Weishung ChungWeishung Chung
    Aug 10, 2012 at 1:11 pm
    Aug 10, 2012 at 2:53 pm
  • Hello Amlan, Issue is still unresolved...Will get fixed in 0.96.0. Regards, Mohammad Tariq
    Mohammad TariqMohammad Tariq
    Aug 6, 2012 at 11:36 am
    Aug 6, 2012 at 4:05 pm
  • We are running Hbase 0.94 with Hadoop 1.0.3. We use Python thrift to talk to Hbase. We are experiencing strange behavior when scanning specific rows from Hbase (Caching is always set to 1 in the ...
    Jay TJay T
    Aug 28, 2012 at 8:35 pm
    Aug 31, 2012 at 4:53 am
  • Hi there, While I'm performing read-intensive benchmarks, I'm seeing storm of "CallerDisconnectedException" in certain RegionServers. As the documentation says, my client received a ...
    Adrien MogenetAdrien Mogenet
    Aug 23, 2012 at 5:25 pm
    Aug 28, 2012 at 8:03 am
  • There was a question [1] in https://issues.apache.org/jira/browse/HBASE-6509JIRA comment, it makes more sense to answer it here. With the current FuzzyRowFilter I believe the only way to approach the ...
    Alex BaranauAlex Baranau
    Aug 17, 2012 at 8:42 pm
    Aug 22, 2012 at 10:42 pm
  • Hi, Please use the user mailing list (added at dest) for this type of questions instead of the dev list (now in bcc). It's a little bit strange to use the full distributed mode with a single region ...
    N KeywalN Keywal
    Aug 22, 2012 at 7:00 am
    Aug 22, 2012 at 6:46 pm
  • Hey David, Are you running your HBase instance in pseudo-distributed/fully-distributed mode or the standalone one? Does "hbase hbck" report any inconsistencies? -- Harsh J
    Harsh JHarsh J
    Aug 14, 2012 at 3:49 pm
    Aug 17, 2012 at 12:32 pm
  • I have a table on a 4 node test cluster. I also have some other tables on the cluster. The table in question has a total of 12 regions. I noticed that 1 node has 6 regions, another has zero, and the ...
    Bryan KellerBryan Keller
    Aug 2, 2012 at 6:08 am
    Aug 2, 2012 at 5:43 pm
  • Hi HBase guru: From Lars George talk, he mentions that filter has no state. What if I need to scan rows in which the decision to filter one row or not is based on the previous row's column values? ...
    Jerry LamJerry Lam
    Aug 1, 2012 at 9:45 pm
    Aug 2, 2012 at 4:28 pm
  • Hi, We wrote an HBase Refcard and published it via DZone. Here is our very brief announcement: http://blog.sematext.com/2012/08/06/announcing-hbase-refcard/ . The PDF refcard can be had from ...
    Otis GospodneticOtis Gospodnetic
    Aug 8, 2012 at 11:14 pm
    Apr 9, 2013 at 6:20 pm
  • Hi, We have some memory intensive endpoint coprocessors running on our RegionServers. As a result, we want to allocate more heap for the coprocessors, but there doesn't seem to be much documentation ...
    Young KimYoung Kim
    Aug 30, 2012 at 8:46 pm
    Sep 1, 2012 at 7:55 am
  • Hi Sonal, Thanks for your reply. How to add a new column to the existing columnFamily?The method I want to try is using 3 steps, first get the record, construct a new put, using the reocrd's( getted ...
    Jing wangJing wang
    Aug 24, 2012 at 4:34 am
    Aug 24, 2012 at 5:30 am
  • Hi, Can you go to HDFS and check if you have the file:blk_-7841650651979512601_ 775949 and its size ? Its location is probably somewhere in /hbase/.logs/<regionserver , ...
    Khang PhamKhang Pham
    Aug 20, 2012 at 2:52 pm
    Aug 23, 2012 at 3:56 pm
  • Currently I'm about to design HBase tables. In my case there is table1 with CF1 holding millions/billions of rows and CF2 with hundreds of rows. Read use cases include reading both CF data by key or ...
    Christian SchäferChristian Schäfer
    Aug 20, 2012 at 11:47 am
    Aug 22, 2012 at 8:06 am
  • Hello, I have a problem where 'put' with timestamp does not succeed. I did the following at the HBase shell. (1) Do 'put' with timestamp. # 'scan' shows 1 row. (2) Delete the row by 'deleteall'. # ...
    Takahiko KawasakiTakahiko Kawasaki
    Aug 14, 2012 at 2:54 pm
    Aug 15, 2012 at 8:47 pm
  • hi, hbase users. I got a problem when I am testing coprocessors which is released at base 0.92.1. here is the hbase shell outputs ---------------- hbase(main):001:0 truncate 'blog' Truncating 'blog' ...
    Aug 9, 2012 at 10:36 am
    Aug 11, 2012 at 1:11 am
  • I wrote a tutorial around HBase, JRuby and Pig that I thought would be of interest to the HBase users list: http://hortonworks.com/blog/pig-as-hadoop-connector-part-two-hbase-jruby-and-sinatra/ -- ...
    Russell JurneyRussell Jurney
    Aug 27, 2012 at 1:32 pm
    Aug 27, 2012 at 6:26 pm
  • Hi all, Something has changed in how OSX and java handles IPv6, and now you will get a log like: 2012-07-31 18:21:39,824 INFO org.apache.hadoop.hbase.master.HMaster: Server active/primary master ...
    Ryan RawsonRyan Rawson
    Aug 1, 2012 at 1:24 am
    Aug 13, 2012 at 7:52 am
  • Hello sir, Going for a single table with 30+ rows would be a better choice, if the data from all the sources is not very different. Since, you are considering Hbase as your data store, it wouldn't be ...
    Mohammad TariqMohammad Tariq
    Aug 5, 2012 at 9:57 pm
    Aug 9, 2012 at 8:22 am
  • I have an 8 node cluster and a table that is pretty well balanced with on average 36 regions/node. When I run a mapreduce job on the cluster against this table, the data locality of the mappers is ...
    Bryan KellerBryan Keller
    Aug 2, 2012 at 6:32 am
    Aug 7, 2012 at 3:48 pm
  • Well, if the file that you have contains TSV, you can directly use the ImportTSV utility of HBase to do a bulk load. More details about that can be found here ...
    Bijeet SinghBijeet Singh
    Aug 4, 2012 at 4:55 am
    Aug 5, 2012 at 5:32 am
  • Hi I probably know the usual answer but are there any tricks to do some sort of sort by value in HBase. The only option I know is to somehow embed value in the key part. The value is not a timestamp ...
    Pamecha, AbhishekPamecha, Abhishek
    Aug 28, 2012 at 11:11 pm
    Aug 31, 2012 at 7:42 am
  • Hi, You need to complete the bulk load. Check out http://hbase.apache.org/book/arch.bulk.load.html 9.8.2 Igal.
    Igal ShilmanIgal Shilman
    Aug 28, 2012 at 5:08 pm
    Aug 29, 2012 at 2:22 pm
  • In timeseries type data how do people deal with scenarios where one might get multiple events in a millisecond? Using nano second approach seems tricky. Other option is to take advantage of versions ...
    Mohit AnchliaMohit Anchlia
    Aug 29, 2012 at 12:21 am
    Aug 29, 2012 at 7:39 am
  • Thanks for your quick reply. The co-processor looks like: public void postGet(final ObserverContext<RegionCoprocessorEnvironment e, final Get get, final List<KeyValue results) { if table is X get ...
    Aug 28, 2012 at 3:50 am
    Aug 29, 2012 at 2:16 am
  • Anyone out there actively using the thrift2 interface in 0.94? Thrift bindings for C++ don’t seem to handle optional arguments too well (that is to say, it seems that optional arguments are not ...
    Joe PallasJoe Pallas
    Aug 21, 2012 at 3:29 pm
    Aug 28, 2012 at 7:24 pm
Group Navigation
period‹ prev | Aug 2012 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

148 users for August 2012

Michael Stack: 62 posts Lars hofhansl: 54 posts Lin Ma: 36 posts Mohamed Zahoor: 35 posts Anil Gupta: 34 posts Mohit Anchlia: 26 posts Mohammad Tariq: 25 posts Harsh J: 21 posts Jerry Lam: 19 posts Jean-Daniel Cryans: 18 posts Michel Segel: 18 posts Gurjeet Singh: 17 posts Marco Gallotta: 16 posts Alex Baranau: 15 posts Pamecha, Abhishek: 15 posts N Keywal: 14 posts Christian Schäfer: 13 posts Andrew Purtell: 11 posts Ioakim Perros: 11 posts Yonghu: 10 posts
show more