FAQ

Search Discussions

83 discussions - 561 posts

  • Hello, I am curious if there is a potential bottleneck in .META. ownership by a single region server. Is it possible (safe) to split meta region into several? -Jack
    Jack LevinJack Levin
    Nov 23, 2010 at 5:31 pm
    Nov 27, 2010 at 3:43 pm
  • Hi I am trying to investigate the bulk load option as described in the following link. http://hbase.apache.org/docs/r0.89.20100621/bulk-loads.html Does anybody have sample code or have used it ...
    Shuja RehmanShuja Rehman
    Nov 9, 2010 at 8:02 pm
    Nov 22, 2010 at 5:42 pm
  • Hi, Data seems to be taking up too much space when I put into HBase. e.g, I have a 2 GB text file which seems to be taking up ~70 GB when I dump into HBase. I have block size set to 64 MB and ...
    Hari SreekumarHari Sreekumar
    Nov 10, 2010 at 5:21 am
    Nov 18, 2010 at 11:39 am
  • Hi, Is it necessary that all RegionServers must also be part of the ZK Quorum? I have a 4 node cluster, with node hadoop1 being master and hadoop2, hadoop3 and ejabber being the slaves (Both in case ...
    Hari SreekumarHari Sreekumar
    Nov 21, 2010 at 6:22 pm
    Nov 23, 2010 at 5:41 pm
  • Hi, I am trying to explore HBasene for using HBase as a backend for lucene index store. But it seems that the current code in github is not in working stage, and there is no active development either ...
    Amit jaiswalAmit jaiswal
    Nov 30, 2010 at 4:49 am
    Jan 4, 2011 at 1:06 am
  • We are in the process of analyzing our options for the future purchases of our Hadoop/HBase DN/RS servers. Currently, we purchase Dell PowerEdge R710's which work well for us. However, we know that ...
    Jason LotzJason Lotz
    Nov 3, 2010 at 1:21 pm
    Nov 7, 2010 at 6:31 pm
  • Hi I am reading data from raw xml files and inserting data into hbase using TableOutputFormat in a map reduce job. but due to heavy put statements, it takes many hours to process the data. here is my ...
    Shuja RehmanShuja Rehman
    Nov 5, 2010 at 11:13 am
    Nov 9, 2010 at 4:25 pm
  • Hi all, After testing HBase for few months with very light configurations (5 machines, 2 TB disk, 8 GB RAM), we are now planing for production. Our Load - 1) 50GB log files to process per day by ...
    Oleg RuchovetsOleg Ruchovets
    Nov 21, 2010 at 1:53 pm
    Nov 22, 2010 at 5:03 pm
  • Hello All: I'm using YCSB to execute tests on the effects of region splits and I'm seeing some confusing results. The aim of the experiment was to determine how much of an impact on ...
    Suraj VarmaSuraj Varma
    Nov 12, 2010 at 6:52 pm
    Nov 19, 2010 at 5:48 am
  • Hello list, I'm kind of new to HBase, so I'll post this email with a request for comment. Very briefly, I do a lot of text processing with mapreduce, so it's very useful for me to convert string to ...
    Claudio MartellaClaudio Martella
    Nov 29, 2010 at 3:13 pm
    Dec 2, 2010 at 9:46 pm
  • Hi, I am running HBase configured in Pseudo-distributed mode (Hadoop also in Pseudo Distributed mode) Hadoop version: 0.20.2 HBase version: 0.20.6 On running any command (for instance, list) in HBase ...
    Alaukik AggarwalAlaukik Aggarwal
    Nov 17, 2010 at 4:51 am
    Nov 18, 2010 at 9:31 am
  • Hi All, I am new to hbase. I have been reading up documentation and studying how hbase suits to our requirement. We want to be able to store a key and corresponding values. However, while reading, i ...
    Gayatri RaoGayatri Rao
    Nov 12, 2010 at 1:30 pm
    Nov 18, 2010 at 9:55 am
  • Hey guys, I just want to get an idea about how everyone avoids these long GC pauses that cause regionservers to die. What kind of java heap and garbage collection settings do you use? What do you do ...
    Sean SechristSean Sechrist
    Nov 24, 2010 at 3:02 pm
    Nov 29, 2010 at 5:30 pm
  • Hi, This problem is widely know, but I'm not able to come up with a decent solution for it. I'm scanning 1.000.000+ rows from one table in order to index their content. Each row has around 100 KB. ...
    Lucas Nazário dos SantosLucas Nazário dos Santos
    Nov 17, 2010 at 2:05 pm
    Nov 24, 2010 at 3:41 am
  • hi , is there any features for me to control the client to access to my hbase. like some authority ,some user or some password? now one way to control my servers use iptables to control the access, ...
    梁景明梁景明
    Nov 3, 2010 at 8:56 am
    Nov 8, 2010 at 10:47 am
  • Hi, I am getting this exception while running m/r jobs on HBase: 10/11/21 02:53:01 INFO input.FileInputFormat: Total input paths to process : 1 10/11/21 02:53:01 INFO mapred.JobClient: Running job: ...
    Hari SreekumarHari Sreekumar
    Nov 20, 2010 at 9:33 pm
    Nov 21, 2010 at 4:51 pm
  • Hi all, We are running 0.20.6 in production. On one of our nodes, we are seeing CPU (all 8 CPUS) hovering near 60%. But the node has many tables and many regions on it. Is there an easy way to find ...
    Vaibhav PuranikVaibhav Puranik
    Nov 16, 2010 at 1:17 am
    Nov 19, 2010 at 12:23 am
  • Hi , To improve client performance I changed hbase.client.scanner.caching from 1 to 50. After running client with new value( hbase.client.scanner.caching from = 50 ) it didn't improve execution time ...
    Oleg RuchovetsOleg Ruchovets
    Nov 11, 2010 at 11:15 am
    Nov 11, 2010 at 7:03 pm
  • We have a Hadoop 0.20.2 + Hbase 0.20.6 setup with three data nodes (12GB, 1.5TB each) and one master node (24GB, 1.5TB). We store a relatively simple table in HBase (1 column familiy, 5 columns, ...
    Henning BlohmHenning Blohm
    Nov 19, 2010 at 1:09 pm
    Nov 23, 2010 at 7:45 am
  • Hello, Both packages have HBAse 0.89.20100726 installed, the former is c1.xlarge and the latter is medium). I'm trying to install some extra packages (see [1]) By the time I've come to install R, I'm ...
    Saptarshi GuhaSaptarshi Guha
    Nov 19, 2010 at 10:12 pm
    Nov 19, 2010 at 10:44 pm
  • Hi is there any option that valueFilter only check the latest version value? I want to just compare the latest version value but valueFilter compares with all the version. If I use ...
    Shuja RehmanShuja Rehman
    Nov 4, 2010 at 5:51 pm
    Nov 5, 2010 at 10:12 am
  • I have read comments on modeling one-to-many relationships in HBase and wanted to get some feedback. I have millions of customers, and each customer can make zero to thousands of orders. I want to ...
    Bryan KellerBryan Keller
    Nov 29, 2010 at 9:41 pm
    Nov 30, 2010 at 5:24 pm
  • Hi, How much of a guru do you have to be to keep say 5-10 servers humming? I'm a 1-man shop, and I dream of developing a web application, and scaling will be a core part of the application. Is it ...
    S AhmedS Ahmed
    Nov 23, 2010 at 9:14 pm
    Nov 24, 2010 at 6:35 pm
  • When is it preferable to use HTablePool over HTable and vice-versa? If I am working on just one table, will using HTablePool potentially give me any performance improvements? hari
    Hari SreekumarHari Sreekumar
    Nov 9, 2010 at 3:29 pm
    Nov 11, 2010 at 12:14 am
  • Hi, What is the cause of this exception? Is there a timeout value that can be modified to avoid this error? Does this error mean any problem with my setup, or is it normal to get these errors? In ...
    Hari SreekumarHari Sreekumar
    Nov 24, 2010 at 4:38 am
    Nov 24, 2010 at 4:12 pm
  • Hello, Thanks to apurtells github repo of the hbase-ec2 i managed to start an hbase cluster. Everything works nicely, I can check the uis of the JT/NN and Hbase Master. What I cant see are the ...
    Saptarshi GuhaSaptarshi Guha
    Nov 19, 2010 at 7:55 am
    Nov 22, 2010 at 6:49 pm
  • Hi, I just wanted to know if it is possible to copy an HBase table into another HDFS by simply copying the <tablename directory from old HDFS to local system and dumping it into it into new HDFS? ...
    Hari SreekumarHari Sreekumar
    Nov 18, 2010 at 4:25 am
    Nov 18, 2010 at 8:02 pm
  • Hi, I am getting a lot of these RetriesExhaustedExceptions when I run my m/r job. This happens with the 116 server only. What could be the issue? I have checked that RS is running on that server, and ...
    Hari SreekumarHari Sreekumar
    Nov 11, 2010 at 8:43 am
    Nov 12, 2010 at 6:49 pm
  • Hello, Let me introduce new effort around HBase: HBaseHUT. It suggests solution to mentioned many times on this mailing list problem "do Get on every Put operation to update record" (which causes bad ...
    Alex BaranauAlex Baranau
    Nov 30, 2010 at 1:41 pm
    Dec 1, 2010 at 6:18 pm
  • Hi, From which version of HBase is this available. I have v0.20.6, but couldn't find this class when I did a jar tvf on the hbase-0.20.6.jar file I have. Thanks, Hari
    Hari SreekumarHari Sreekumar
    Nov 26, 2010 at 7:13 am
    Nov 26, 2010 at 3:12 pm
  • Hello, I am very new to HBase and I hope to get some feedback from the community on this: I want to use HBase to store some data with pretty simple structure: each key has ~50 attributes. These data ...
    Nanheng WuNanheng Wu
    Nov 25, 2010 at 4:32 pm
    Nov 26, 2010 at 1:03 am
  • Hi all, I am running an MR job that is loading an HBase table in the reduce, and I am seeing hopeless performance - 10 million records of <1Kb in 2 hours so far. Please bear in mind I am software ...
    Tim RobertsonTim Robertson
    Nov 24, 2010 at 7:00 pm
    Nov 24, 2010 at 8:15 pm
  • Hi all, I recently filled an hbase table with many millions of columns in each row (!). The problem that now occured was that I always get a Heap Space Error from the JVM with a subsequent shutdown ...
    Fnord 99Fnord 99
    Nov 22, 2010 at 10:01 am
    Nov 23, 2010 at 2:28 pm
  • Hi, I am using hbase shell to verify some konwleges that I have read from the post of "Understanding HBase and BigTable" That post says: "If an application asks for a given row at a given timestamp, ...
    Pan WPan W
    Nov 19, 2010 at 5:17 am
    Nov 22, 2010 at 9:58 am
  • When I use svn plugin within Eclipse to checkout the source code of HBase, it comes up with "Operation failed" message, which indicates: svn: Processing REPORT request response failed: Premature end ...
    Marcus ChouMarcus Chou
    Nov 18, 2010 at 10:20 am
    Nov 19, 2010 at 4:39 am
  • Hi all, As far as I know, one user account could have only one HBase instance, either HMaster or HRegionServer, run on a single machine. My question is, how many cores could HBase utilize, or rather ...
    Marcus ChouMarcus Chou
    Nov 28, 2010 at 12:44 pm
    Nov 30, 2010 at 4:06 pm
  • Hi All, I'm new to HBASE. I understand that HBASE keeps its data sorted in the filesystem. So when we insert randomly, it takes time to sort. Where as when we insert sequentially, there is no need ...
    RajgopalvRajgopalv
    Nov 29, 2010 at 12:31 pm
    Nov 30, 2010 at 4:46 am
  • Hello, I'm fairly new to HBase and would appreciate your comments. [1] One way compute across an HBase dataset would be to run as many maps as regions, for each map, run a scan across the region row ...
    Saptarshi GuhaSaptarshi Guha
    Nov 18, 2010 at 4:31 am
    Nov 18, 2010 at 5:45 pm
  • I'm unit testing some of our internal framework that interacts with hbase (version cdh3b3 = 0.89.20100924+28). The unit test is small inserting and updating ~100 rows. My unit test expects the hbase ...
    Jeff WhitingJeff Whiting
    Nov 12, 2010 at 12:11 am
    Nov 12, 2010 at 9:11 pm
  • Hi , I run map/reduce job which from reducers writes to hbase. I have to write 8-9 million raws. My environment: 5 servers dual core , 8G ram. HDFS -- 9TB hadoop has 1000 heap_size , hbase has 2000 ...
    Oleg RuchovetsOleg Ruchovets
    Nov 8, 2010 at 3:52 pm
    Nov 9, 2010 at 8:30 pm
  • Hi, I have seen that the *stop-hbase.sh* script does not execute*hbase-daemons.sh stop regionservers *, although it does stop the master and zookeeper. Is this intentional? Why is it? Won't it create ...
    Hari SreekumarHari Sreekumar
    Nov 8, 2010 at 11:04 am
    Nov 9, 2010 at 4:12 am
  • Hi, HBaser I'm currently trying to run HBase, but some errors occur. Running environment: CentOS release 5.5 hadoop-0.20.2 hbase-0.20.6 I use two machines to run hbase (just for illustrate this ...
    Pan.WPan.W
    Nov 4, 2010 at 8:13 am
    Nov 9, 2010 at 2:46 am
  • Hi Does anybody know how to increase the heap size of region servers? -- Regards Shuja-ur-Rehman Baig <http://pk.linkedin.com/in/shujamughal
    Shuja RehmanShuja Rehman
    Nov 8, 2010 at 9:15 am
    Jul 9, 2014 at 9:20 pm
  • We are populating some HBase tables from daily data streams that are stored in Hive. When we see a row key that's already in the table, the data should be appended to that row's record. What is the ...
    Leo AlekseyevLeo Alekseyev
    Nov 1, 2010 at 9:28 am
    Dec 24, 2010 at 5:40 pm
  • What is the relationship between the following configuration parameters? hfile.block.cache.size hbase.regionserver.global.memstore.upperLimit Is having the sum of the two values = 1 a problem? That ...
    Rod MacDonaldRod MacDonald
    Nov 25, 2010 at 6:07 pm
    Nov 26, 2010 at 12:40 am
  • Hi all, Please forgive this rather naive question - I have a cluster and want to decommission nodes (including the RS that hold the -ROOT- and .META). Could someone please advise me the best way to ...
    Tim RobertsonTim Robertson
    Nov 25, 2010 at 5:25 pm
    Nov 25, 2010 at 6:43 pm
  • I found some posts that were using hbase in production but I am really curious what data sizes they are currently at and how it is working out for them. Any comments/experiences on that environment? ...
    Hiller, Dean (Contractor)Hiller, Dean (Contractor)
    Nov 24, 2010 at 5:49 pm
    Nov 24, 2010 at 9:24 pm
  • Hello, I just wanted to make sure that I'm interpreting a series of common issues correctly. I saw ZK expirations causing regionserver failures, and this in a GC log of one of the regionservers: ...
    Stuart SmithStuart Smith
    Nov 9, 2010 at 6:53 pm
    Nov 14, 2010 at 2:19 am
  • Hi, I am quite new to hadoop and hbase, and I am having a hard time here figuring out some issues with my cluster, and I am pretty sure many of you have gone through many of the problems I am facing ...
    Hari SreekumarHari Sreekumar
    Nov 12, 2010 at 5:05 am
    Nov 12, 2010 at 8:50 pm
  • Hello, I tried to setup HBase on 3 nodes. One of them has two network interface, one has public IP address and the other has private address. The other two nodes are inside a private network, so they ...
    Da ZhengDa Zheng
    Nov 11, 2010 at 6:24 am
    Nov 11, 2010 at 6:50 pm
Group Navigation
period‹ prev | Nov 2010 | next ›
Group Overview
groupuser @
categorieshbase, hadoop
discussions83
posts561
users90
websitehbase.apache.org

90 users for November 2010

Hari Sreekumar: 50 posts Lars George: 47 posts Stack: 31 posts Jean-Daniel Cryans: 30 posts Michael Segel: 29 posts Shuja Rehman: 27 posts Todd Lipcon: 17 posts Ted Yu: 16 posts Andrew Purtell: 15 posts Jack Levin: 15 posts Oleg Ruchovets: 15 posts Friso van Vollenhoven: 14 posts Ryan Rawson: 13 posts Saptarshi Guha: 13 posts Buttler, David: 11 posts Jonathan Gray: 10 posts Henning Blohm: 8 posts Alaukik Aggarwal: 7 posts Erdem Agaoglu: 7 posts Jonathan Gray: 7 posts
show more
Archives