Grokbase Groups HBase user June 2011
FAQ

Search Discussions

148 discussions - 830 posts

  • Hi, Not able to see my email in the mail archive..So sending it again...!!! Guys.. need your feedback..!! Thanks, Praveenesh ---------- Forwarded message ---------- From: praveenesh kumar ...
    Praveenesh kumarPraveenesh kumar
    Jun 6, 2011 at 8:49 am
    Jun 11, 2011 at 9:20 am
  • Hi, I have a cluster of 5 nodes with one large table that currently has around 12000 regions. Everything was working fine for relatively long time, until now. Yesterday I significantly reduced the ...
    Eran KutnerEran Kutner
    Jun 30, 2011 at 10:59 am
    Jul 7, 2011 at 6:44 pm
  • hello everybody i'm trying to scan my hbase table for reporting purposes the cluster has 4 servers: - server1: namenode, secondary namenode, jobtracker, hbase master, zookeeper1 - server2: datanode, ...
    Andreas ReiterAndreas Reiter
    Jun 6, 2011 at 8:50 am
    Jun 21, 2011 at 3:02 pm
  • Hi All, I am trying to build HBase-0.90.3 on my machine. I am using Hadoop 0.20 append version described in the page: ...
    Vandana AyyalasomayajulaVandana Ayyalasomayajula
    Jun 16, 2011 at 10:56 pm
    Jun 23, 2011 at 6:06 pm
  • I was reading through the HBase book and came across the following in *6.2. On the number of column families.<http://hbase.apache.org/book.html#number.of.cfs * * * *"HBase currently does not do well ...
    Leif WicklandLeif Wickland
    Jun 2, 2011 at 9:42 pm
    Jun 22, 2011 at 6:25 pm
  • I have a table with a hundred or so regions. When I look in the hbase web ui, I see that all the regions are on one server. Of course we have many other tables and lots of data. Some tables seem to ...
    Geoff HendreyGeoff Hendrey
    Jun 7, 2011 at 6:34 pm
    Jun 9, 2011 at 1:44 am
  • Hi, We have been using Hadoop in our project as a DFS cluster to store some critical information. This critical information is stored as zip files of about 3-5 MB in size each. The number of these ...
    Aditya Karanth AAditya Karanth A
    Jun 28, 2011 at 6:38 am
    Jul 2, 2011 at 5:32 am
  • Hi, We are using HBase 0.20.3 (hbase-0.20-0.20.3-1.cloudera.noarch.rpm) cluster in distributed mode with Hadoop 0.20.2 (hadoop-0.20-0.20.2+320-1.noarch). We are using pretty much default ...
    Srikanth P. ShreenivasSrikanth P. Shreenivas
    Jun 29, 2011 at 5:24 am
    Jul 19, 2011 at 5:58 pm
  • Hi HBase community, What are the current best-practices with respect to starting up an HBase cluster in EC2? I don't see any public AMI's newer than 0.89.xxx, and starting up that one it's, clear ...
    Jim R. WilsonJim R. Wilson
    Jun 4, 2011 at 6:28 pm
    Jun 23, 2011 at 9:33 pm
  • Hi, I need to store, say, 10M-100M documents, with each document having say 100 fields, like author, creation date, access date, etc., and then I want to ask questions like give me all documents ...
    Mark KerznerMark Kerzner
    Jun 4, 2011 at 12:57 am
    Jun 18, 2011 at 6:28 am
  • We have certain tables with under 10 rows, one under 200 rows and one with 1,000,000 rows. We have found out that having a copy/cache on each node is EXTREMELY fast for our batch processing since ...
    Hiller, Dean x66079Hiller, Dean x66079
    Jun 8, 2011 at 4:01 pm
    Jun 12, 2011 at 3:21 pm
  • Hey Guys, I plan to do a tech talk here at ImageShack, on how we store and serve about 200ml images from HBASE. The stats of our are: 60 Region Servers running HBASE Configured Capacity : 517.44 TB ...
    Jack LevinJack Levin
    Jun 8, 2011 at 12:08 am
    Jul 1, 2011 at 9:10 pm
  • Hi everybody, it was not an easy way to run a map reduce job at all, ie if a third party jars are involved... a good help is the article by cloudera: ...
    Andre ReiterAndre Reiter
    Jun 22, 2011 at 9:05 am
    Jun 24, 2011 at 10:12 pm
  • I am changing the subject to reflect the discussion... If we only load data in bulk (that is, via doBulkLoad(), not using TableOutputFormat), do we still risk data loss? My understanding is that ...
    Andreas NeumannAndreas Neumann
    Jun 22, 2011 at 9:59 pm
    Jun 24, 2011 at 1:08 am
  • Hi Lars, I've given your hbase-book link on github [1] to Ioan (GSoC2011, see previous mail I just sent) to help him dig into the HBase API. I've also checked-out your repo to learn more, the basic ...
    Eric CharlesEric Charles
    Jun 11, 2011 at 3:30 pm
    Jun 15, 2011 at 4:00 pm
  • Hello, does anyone have any tools you could share that would take a table, and dump the contents as TSV text format? We want it in tsv for quick HIVE processing that we have in the another datamining ...
    Jack LevinJack Levin
    Jun 6, 2011 at 10:58 pm
    Jun 8, 2011 at 2:36 am
  • Hello there, We have a number of different groups within our organization who will soon be working within the same HBase cluster and we're trying to set up some best practices to keep thinks ...
    Bill GrahamBill Graham
    Jun 13, 2011 at 10:32 pm
    Jun 20, 2011 at 9:16 pm
  • Hello list, I was a few days ago at SIGMOD and was happy to attend Facebook's talk on HBase. As I could understand their workflow makes heavy use of incremental couters for analytics and so is mine. ...
    Claudio MartellaClaudio Martella
    Jun 18, 2011 at 4:01 pm
    Jun 20, 2011 at 6:29 pm
  • Hi, I am wondering if anybody let me know that how Hbase redirects the input row to particular region server? What is the exact algorithm which is used to distribute the incoming rows to particular ...
    Shuja RehmanShuja Rehman
    Jun 15, 2011 at 11:25 am
    Jun 15, 2011 at 10:41 pm
  • Hi, I am getting following errors while trying to transfer data from hdfs to hbase. Table at hbase: hbase(main):007:0 describe 'movies' DESCRIPTION ENABLED {NAME = 'movies', FAMILIES = [{NAME = ...
    Prashant SharmaPrashant Sharma
    Jun 14, 2011 at 10:14 am
    Jun 15, 2011 at 5:29 pm
  • Hi, I am not able to find information regarding the algorithm that decides which region a particular row belongs to in an HBase cluster. Does the algorithm take into account the number of physical ...
    Sam SeigalSam Seigal
    Jun 3, 2011 at 7:36 am
    Jun 10, 2011 at 6:12 am
  • Dear friends, Please suggest a standard hardware configuration for hbase cluster which is going to be used to pull and store a lot of data. -- Thanks, Shah
    Shahnawaz SaifiShahnawaz Saifi
    Jun 7, 2011 at 8:11 am
    Jun 10, 2011 at 5:32 am
  • Hi, We're trying to come up with right strategy for backing up HBase tables. Assumption is that sizes of tables will not grow beyond few hundred GB. Currently, we're employing exports (writing onto ...
    Manoj MurumkarManoj Murumkar
    Jun 8, 2011 at 5:22 pm
    Jun 9, 2011 at 9:19 am
  • Hi, Given the tableName, startKey and endKey for a region how do I get hold of the encodedName? We have code for identifying overlapping regions that outputs triples of the form tableName, startKey ...
    James HammertonJames Hammerton
    Jun 8, 2011 at 4:22 pm
    Jun 8, 2011 at 5:11 pm
  • Hi, Can anyone let me know how to get the size of complete table and w.r.t region servers? e.g Table 1 Total Size = 100MB Table1 RegionServer1 Size =30 MB Table 1 RegionServer2 Size = 70 MB Thanks -- ...
    Shuja RehmanShuja Rehman
    Jun 14, 2011 at 9:26 am
    Jun 30, 2011 at 5:05 pm
  • Hello! I have the following scenario: 1. A temporary HBase table with small number of rows (aprox 100) 2. A cluster with 2 machines that I would like to crunch the data contained in the rows I would ...
    Florin PFlorin P
    Jun 27, 2011 at 8:53 am
    Jun 30, 2011 at 12:24 pm
  • We have been testing random reads and from a 6 node cluster (1NN, 5DN, 1HM, 5RS each with 48G, 5 disks) right now seeing a throughput of 1100 per sec per node. Most of the configs are default, except ...
    Sateesh LakkarsuSateesh Lakkarsu
    Jun 24, 2011 at 1:08 am
    Jun 24, 2011 at 11:47 pm
  • Is it possible to insert lucene indexes to hbase table.I am very new to this hbase. Provide me some suggestions. If we store them in hbase can we run in multi node environment -- View this message in ...
    RsriramtceRsriramtce
    Jun 18, 2011 at 5:20 pm
    Jun 24, 2011 at 2:49 pm
  • Hi all, I'm new in HBase. I want to insert 4'000'000 rows in HBase (each row has 4 columns). I have already looked the HBase wiki to insert data, but i've a problem : i loss data. When i do a COUNT ...
    Laurent HatierLaurent Hatier
    Jun 20, 2011 at 5:36 pm
    Jun 21, 2011 at 2:25 am
  • We are on hbase 0.90 and using hbase for a while to perform high volume data lookup using hbase client (no map-reduce involved). Recently we observed that our "get" latencies keep increasing over the ...
    Abhijit PolAbhijit Pol
    Jun 8, 2011 at 11:40 pm
    Jun 13, 2011 at 8:52 pm
  • [Re-sending as I'm not sure this got through] Hi, Before trying to merge regions on a table in our live database we decided to copy the table and merge the regions on the copy first to test the ...
    James HammertonJames Hammerton
    Jun 10, 2011 at 10:19 am
    Jun 13, 2011 at 9:58 am
  • Hi, Where can I find the targeted release date of 0.92.0? Thanks. Ming
    Ma, MingMa, Ming
    Jun 8, 2011 at 11:43 pm
    Jun 10, 2011 at 9:44 am
  • Hello, I am trying to autogen some code off of 90.3. I made some custom additions to our thrift server, however the code that gets generated uses ByteBuffers as opposed to byte[]. How can I get ...
    Matthew WardMatthew Ward
    Jun 1, 2011 at 12:14 am
    Jun 1, 2011 at 1:47 am
  • HFileOutputFormat supports only one family in official distribution, and HBASE-1861 added multi-family support. Take no account of HBASE-1861, is that a right way to use one RecordWriter for each ...
    Gan, XiyunGan, Xiyun
    Jun 27, 2011 at 3:43 am
    Jun 28, 2011 at 2:18 am
  • Hi all, I know it's maybe ridiculous but i have problems with HBase installation in cluster. I will have follow 2-3 tutorials on it but it doesn't work. Well, first i would like to know if it ...
    Laurent HatierLaurent Hatier
    Jun 22, 2011 at 8:24 am
    Jun 23, 2011 at 1:15 am
  • hi folks, at the moment our architecture looks like this: the cluster has 4 servers: - server1: namenode, secondary namenode, jobtracker, hbase master - server2: datanode, tasktracker, hbase ...
    Andre ReiterAndre Reiter
    Jun 21, 2011 at 8:44 am
    Jun 22, 2011 at 3:45 am
  • I tried to use TableMapper and TableOutputFormat in from org.apache.hadoop.hbase.mapreduce to write a map-reduce which incremented some columns. I noticed that TableOutputFormat.write() doesn't ...
    Leif WicklandLeif Wickland
    Jun 17, 2011 at 8:43 pm
    Jun 21, 2011 at 6:36 pm
  • Hi, I'm having trouble with using the importtsv tool. I ran the following command: hadoop jar hadoop_sws/hbase-0.90.0/hbase-0.90.0.jar importtsv -Dimporttsv.columns=HBASE_ROW_KEY ,b_info:name, ...
    James RamJames Ram
    Jun 15, 2011 at 6:30 am
    Jun 16, 2011 at 4:35 pm
  • Hello everybody! I tried to implement my first HBase test application using Eclipse... I made it work and I have my table into my Hbase database with a single row... My problem is when I execute: ...
    HbaserHbaser
    Jun 16, 2011 at 9:41 am
    Jun 16, 2011 at 4:16 pm
  • I see a lot of resolved issues under HBASE-1295, however I'm not sure what the state of replication is. Eg, can one implement live MySQL'ish streaming master - slave replication today?
    Jason RutherglenJason Rutherglen
    Jun 11, 2011 at 6:06 pm
    Jun 13, 2011 at 4:31 pm
  • Hi, Is it possible to read a file inside HDFS using HBase. If yes please help me with way. What are all the class required to do it? Do i need to use MapReduce? -- With Regards, Karthik
    Karthik KumarKarthik Kumar
    Jun 6, 2011 at 3:15 pm
    Jun 8, 2011 at 4:28 am
  • I have a feature request: There should be a native function called 'count', that produces count of rows based on specific family filter, that is internal to HBASE and won't be required to read CELLs ...
    Jack LevinJack Levin
    Jun 3, 2011 at 10:20 pm
    Jun 6, 2011 at 8:15 pm
  • Hey, Could anyone give me suggestion for Hadoop/HBase upgrade? We're currently using apache hadoop 0.20.2 + hbase 0.20.3 + zookeeper-3.2.2. Has anyone done with latest stable version of ...
    Zhong, ShengZhong, Sheng
    Jun 8, 2011 at 4:26 pm
    Jul 19, 2011 at 4:17 pm
  • I've implemented my own coprocessor client, protocol and implementation that returns back to the user a List of KeyValues with values that match some criteria. I've tested this on a small table with ...
    Nichole TreadwayNichole Treadway
    Jun 29, 2011 at 10:25 pm
    Jul 1, 2011 at 6:12 am
  • Hi, I want to add a column family to a existing table. I used the following code but it shows that descriptor cannot be modified. try { HTableDescriptor descriptor = new ...
    Eranda SooriyabandaraEranda Sooriyabandara
    Jun 17, 2011 at 6:11 pm
    Jun 30, 2011 at 5:30 pm
  • Current version .20.6 Upgrade target 90.3 So Hbase .20.6 uses Thrift .2 and Hbase .90.3 makes use of Thrift .5. We currently use the supplied ThriftServer in Hbase. In my initial testing for the ...
    Karl KuntzKarl Kuntz
    Jun 22, 2011 at 6:59 pm
    Jun 27, 2011 at 10:41 pm
  • I would like to use MultipleTextOutputFormat, which is only available with the old Hadoop API (mapred). The mapred version of TableMapReduceUtil does not seem to support the use of a Scan object. Is ...
    Chan, TimChan, Tim
    Jun 27, 2011 at 8:26 pm
    Jun 27, 2011 at 9:40 pm
  • Hello, I have started to test HBase and Hive-HBase integration in our test Hadoop cluster last week. Everything was working fine until both HDFS and HBase crashed because of a space issue. We fixed ...
    N.N. GesliN.N. Gesli
    Jun 16, 2011 at 1:06 am
    Jun 17, 2011 at 12:30 am
  • Hi All. I have a question about logical division of hbase cluster, means dividing the region servers w.r.t tables. This means that If I have let say 10 computers in cluster then table t1,t2 should be ...
    Shuja RehmanShuja Rehman
    Jun 13, 2011 at 1:29 pm
    Jun 14, 2011 at 7:53 pm
Group Navigation
period‹ prev | Jun 2011 | next ›
Group Overview
groupuser @
categorieshbase, hadoop
discussions148
posts830
users147
websitehbase.apache.org

147 users for June 2011

Stack: 119 posts Jean-Daniel Cryans: 51 posts Doug Meil: 34 posts Andrew Purtell: 31 posts Joey Echeverria: 20 posts Ted Yu: 20 posts Praveenesh kumar: 19 posts Andre Reiter: 18 posts Ted Dunning: 18 posts Buttler, David: 15 posts Eric Charles: 15 posts Robert Gonzalez: 15 posts Bijieshan: 14 posts Hiller, Dean x66079: 14 posts James Ram: 14 posts Sam Seigal: 14 posts Jason Rutherglen: 13 posts Michel Segel: 13 posts Bill Graham: 12 posts Zhong, Sheng: 12 posts
show more
Archives