Search Discussions

90 discussions - 363 posts

  • Hi, Does anyone have any tips to share regarding optimization for random read performance? For writes I've found that setting a large write buffer and setting auto-flush to false on the client side ...
    James BaldassariJames Baldassari
    Feb 15, 2010 at 11:46 pm
    Feb 18, 2010 at 8:31 pm
  • Hi, I use org.apache.hadoop.hbase. filter.PrefixFilter in my export utility. I like the flexibility of RegExpRowFilter but it cannot be used in Scan.setFilter(org.apache.hadoop.hbase.filter.Filter) ...
    Ted YuTed Yu
    Feb 27, 2010 at 5:06 pm
    Mar 4, 2010 at 8:20 pm
  • Hi, I got http://issues.apache.org/jira/browse/HBASE-2037 that can create a new table with index, but can I add index in an existing table? Any code examples? Thanks. Shen
    Feb 26, 2010 at 2:18 am
    Mar 18, 2010 at 9:36 pm
  • Hello, I did some testing to figure out which compression algo I should use for my HBase tables. I thought that LZO was the good candidate, but it appears that it is the worst one. I uses one table ...
    Vincent BaratVincent Barat
    Feb 23, 2010 at 6:39 pm
    Mar 2, 2010 at 10:43 am
  • Hi, When trying to restart HBase, I'm getting the following in the regionservers: http://pastebin.com/GPw6yt2G and cannot get HBase fully restarted. I'm on the latest version 0.20.3. Where would I ...
    Bluemetrix DevelopmentBluemetrix Development
    Feb 23, 2010 at 3:35 pm
    Mar 1, 2010 at 9:18 pm
  • Hi, I'm currently trying to run a count in hbase shell and it crashes right towards the end. This is turn seems to crash hbase or at least causes the regionservers to become unavailable. Here's the ...
    Bluemetrix DevelopmentBluemetrix Development
    Feb 16, 2010 at 7:08 pm
    Mar 3, 2010 at 11:30 pm
  • Hi all, Trend Micro would like to host HUG9 at our offices in Cupertino: ...
    Andrew PurtellAndrew Purtell
    Feb 11, 2010 at 8:44 pm
    Feb 23, 2010 at 12:46 am
  • Hello all, we're looking at using HBase for the backend datastore for a large-scale site where many Tomcat app servers would access HBase in realtime. Our data access pattern is not completely ...
    Brad McCartyBrad McCarty
    Feb 17, 2010 at 3:29 am
    Feb 18, 2010 at 7:54 pm
  • Hi, I came across this problem recently. I tried to query a table with rowkey '3ec1aa5a50307aed20a222af92a53ad1'. The query hits on a region with startkey '3d9d1175a7f8bf861bf75638bb1eb231', and ...
    Zhenyu ZhongZhenyu Zhong
    Feb 17, 2010 at 4:33 pm
    Mar 6, 2010 at 9:12 pm
  • I¹ve been loading some large data sets over the last week or so, but keep running into failures between 4 and 15 hours into the process. I¹ve wiped HBase and/or HDFS a few times hoping that would ...
    Rod CopeRod Cope
    Feb 20, 2010 at 4:14 pm
    Feb 22, 2010 at 5:25 pm
  • I'm looking for some examples for reading data out of hbase for use with mapreduce and for inserting data into hbase from a mapreduce job. I've seen the example shipped with hbase, and, well, it ...
    David HawthorneDavid Hawthorne
    Feb 11, 2010 at 8:14 pm
    Feb 11, 2010 at 11:46 pm
  • Hi, I have noticed that the performance of the full table scan (table contains about 5M rows) is extremely slow in our case. We are running 0.20.2, r834515 and it takes about 3 min / 5000 rows to ...
    Boris AleksandrovskyBoris Aleksandrovsky
    Feb 8, 2010 at 10:43 pm
    Feb 9, 2010 at 12:19 am
  • Hi all, I wrote yesterday evening (of my time :)) about missing file and today i did a restart of whole hbase and it looks like problem disappeared. According to my taste it looks like either client ...
    Michał PodsiadłowskiMichał Podsiadłowski
    Feb 4, 2010 at 9:08 am
    Feb 5, 2010 at 10:30 am
  • hello, I have a program loading formated data into HTable, but it crashed when connecting to the quorum server because of wrong configuration, seems that it load configuration from hbase-default.xml, ...
    Steven zhuangSteven zhuang
    Feb 25, 2010 at 2:46 am
    Feb 25, 2010 at 4:18 am
  • wondering if there a compelling reason to go one way or another for a Hadoop/Hbase cluster on EC2 EBS volume. host OS : Ubuntu 9.04 x64 thanks Sujee http://sujee.net
    Sujee ManiyamSujee Maniyam
    Feb 20, 2010 at 7:13 am
    Feb 20, 2010 at 10:48 pm
  • Quick question about data local vs rack local tasks when running map reduce jobs against hbase. I've just run a job against a table that was split into 1,645 tasks. Looking at the job page it's ...
    Bryan McCormickBryan McCormick
    Feb 17, 2010 at 8:11 am
    Feb 19, 2010 at 5:09 am
  • Hi Guys, I am having another problem with hBase that is probably related to the problems I was emailing you about earlier this year. I have finally had a chance to at least try one of the suggestions ...
    Seraph ImaliaSeraph Imalia
    Feb 8, 2010 at 1:36 pm
    Feb 17, 2010 at 9:09 am
  • I have the Map Reduce function whose job is to process the database , MySql, and give us some output. For this purpose, I have created the map reduce fucntion and have used the DBInputFormat, but Im ...
    Gaurav VashishthGaurav Vashishth
    Feb 12, 2010 at 12:32 pm
    Feb 13, 2010 at 7:38 am
  • Hi, I'm wondering if it's possible to export all data from one HBase cluster and import it into another. We have a lot of data that we've imported into our staging HBase environment, and rather than ...
    James BaldassariJames Baldassari
    Feb 10, 2010 at 2:48 am
    Feb 10, 2010 at 5:27 pm
  • Ok, so I posted this to the wrong list. (Hadoop vs HBase) So I apologize for any duplication... Here's the skinny. I've got secondary indexes up and running on our Sandbox machines running Cloudera's ...
    Michael SegelMichael Segel
    Feb 16, 2010 at 9:54 pm
    Apr 23, 2010 at 6:15 pm
  • Hi, I'm pretty new to Hbase so bear with me - I am porting over a system for storage of timeseries data to both HBase and Cassandra. This is pretty straightforward but it opens a ton of questions ...
    Eric MalandEric Maland
    Feb 15, 2010 at 5:32 am
    Mar 4, 2010 at 6:15 am
  • Greetings, While browning a table, I noticed a strange thing in a couple of regions. I have two regions with same start_key, and two others with same end_key. Here are an extract of my regions list ...
    Manuel de FerranManuel de Ferran
    Feb 25, 2010 at 5:02 pm
    Mar 2, 2010 at 8:45 pm
  • Setting up a development cluster. Using Cloudera's latest release which has HBase-20.3. We have 3 nodes running ZooKeeper which is managed by HBase. We have a quorum set up. One of the developers ran ...
    Michael SegelMichael Segel
    Feb 26, 2010 at 10:15 pm
    Feb 27, 2010 at 12:35 am
  • Hello everyone, Does anyone know if HBase + Hadoop perform well on Solaris 10 ? Also are there any benchmarking results for HBase 0.19 available ? On the wiki page on Performance Evaluation I found ...
    Adrian PopescuAdrian Popescu
    Feb 17, 2010 at 4:34 pm
    Feb 27, 2010 at 12:22 am
  • Hi, We have 20 1U servers (4 core, 12G ram) as a cluster, 3 zookeepers, 10 region servers. My program is to read hbase table data one by one. Read - hbaseTable1 hbaseTable2 hbaseTable3 . . . ...
    Feb 25, 2010 at 10:00 am
    Feb 26, 2010 at 9:55 am
  • Hello Currently what is the best way to communicate with hbase outside of java? I think thrift appears to be the best , is it still very much part of Hbase 0.20.3? Thank you saptarshi
    Saptarshi GuhaSaptarshi Guha
    Feb 9, 2010 at 4:30 pm
    Feb 9, 2010 at 7:00 pm
  • Hello, I have used Hbase before, but many months back. So this is basic beginner's questions. The HBase thrift api for get getRow is list<TCell get( /** name of table */ 1:Text tableName, /** row key ...
    Saptarshi GuhaSaptarshi Guha
    Feb 9, 2010 at 2:57 pm
    Feb 9, 2010 at 4:45 pm
  • Hi, I was reading http://www.slideshare.net/schubertzhang/hbase-0200-performance-evaluation. Could someone explain what has changed to improve the random reads to sequential reads ratio from 1:1 to ...
    Gabriel KiGabriel Ki
    Feb 3, 2010 at 11:58 pm
    Feb 6, 2010 at 10:35 pm
  • Hello For example if we have table, which have rows with many columns (10000 or more) how this data will by partitioned?? Does row will by split by some regions servers?
    Ruslan usifovRuslan usifov
    Feb 20, 2010 at 7:53 pm
    Feb 21, 2010 at 8:31 pm
  • Greetings, we're running a small HBase 0.20.2 cluster composed of 3 Region Servers. We would like to merge a couple of regions (PIG prefers fewer regions). So we're trying to use the merge tool the ...
    Manuel de FerranManuel de Ferran
    Feb 18, 2010 at 4:12 pm
    Feb 19, 2010 at 8:51 am
  • HI I have a table with rowkey is composed of userid + timestamp. I need to figure out 'top-100' users. One approach is running a scanner and keeping a hashmap of user-count in memory. Wondering if ...
    Sujee ManiyamSujee Maniyam
    Feb 14, 2010 at 8:57 am
    Feb 15, 2010 at 3:59 am
  • Hi, I would like a filter that accepts rows as long as the first X bytes of the row key are less than or equal to a certain byte array. The RowFilter combined with the BinaryComparator comes close, ...
    Bruno DumonBruno Dumon
    Feb 10, 2010 at 3:16 pm
    Feb 12, 2010 at 8:11 pm
  • I noticed that the wiki page for the Stargate contrib stuff at: http://wiki.apache.org/hadoop/Hbase/Stargate might need some updating. I cranked up the server (after moving the libs) and I couldn't ...
    Patterson, JoshPatterson, Josh
    Feb 4, 2010 at 9:51 pm
    Feb 6, 2010 at 10:38 pm
  • Hi, I'm running MapReduce with TableOutputFormat, and seems to have similar trouble as HBASE-1603. After digging to the log, I realized that the time when my reducer failed might possibly related to ...
    Victor HsiehVictor Hsieh
    Feb 4, 2010 at 2:38 am
    Feb 5, 2010 at 4:52 pm
  • Hi, Our cluster with 3 zookeepers, 10 region servers, 19 data nodes. Each machine has 4 core cpu, 12G ram. There are 1322 regions in our cluster now. We fired up to 3000 hbase client in parallel to ...
    Feb 3, 2010 at 8:27 am
    Feb 4, 2010 at 2:06 am
  • Dear all, I have been experiencing an issue that one of my HBase table, which contains 1800+ regions, sometimes cannot be enabled. Sometimes I tried to restart the HBase in order to let this big ...
    Zhenyu ZhongZhenyu Zhong
    Feb 1, 2010 at 9:45 pm
    Feb 2, 2010 at 6:00 pm
  • We generated a python Thrift client and wrote a simple wrapper around it, but as we move this into production, we need a couple of enhancements: 1. Intelligently handle all the different types of ...
    Daniel EinspanjerDaniel Einspanjer
    Feb 25, 2010 at 6:59 pm
    Mar 1, 2010 at 10:39 am
  • Hbase version : 0.20.3, r902334 EC2 c1.xlarge, 5 machine cluster (1 + 4 I have a couple of tables with 300M rows. Truncate command hangs... hbase shell truncate 'tablename' Truncating ...
    Sujee ManiyamSujee Maniyam
    Feb 26, 2010 at 12:05 am
    Feb 27, 2010 at 12:07 am
  • hello, everyone. I found that we can leave hbase.master unset in hbase-site.xml, and we can have a hbase cluster running OK. Is there any mechanism like auto-election make one node the master? -- ...
    Steven zhuangSteven zhuang
    Feb 24, 2010 at 6:07 am
    Feb 24, 2010 at 6:45 am
  • Hi, I wrote a test code as below: (TableInputFormat using "new Scan(scan)") String tablename = "test8-1"; HBaseConfiguration config = new HBaseConfiguration(); HTable table = new HTable(config, ...
    Feb 22, 2010 at 7:43 am
    Feb 23, 2010 at 2:30 am
  • Have the slides from January's HBase User Group at StumbleUpon been posted anywhere? There were some really good talks, and I'd definitely like to grab Ryan's slides (and everyone else's) if ...
    Michael DaltonMichael Dalton
    Feb 22, 2010 at 9:19 pm
    Feb 22, 2010 at 9:50 pm
  • Sorry if this is a dup for those of you following me on twitter (http://twitter.com/phunt) but I wanted to let you know that twitter (the company) has contributed a Ruby client binding for ZooKeeper. ...
    Patrick HuntPatrick Hunt
    Feb 19, 2010 at 6:24 pm
    Feb 20, 2010 at 1:22 am
  • We are running HBase 20.2 and Hadoop 20.1 It looks like the hadoop cluster crashed. When I brought it back up, hbase was missing a table, but I could still see it in HDFS.... ...
    Ananth SarathyAnanth Sarathy
    Feb 19, 2010 at 8:57 pm
    Feb 19, 2010 at 9:38 pm
  • Quick question about the location of the PID files. I recently went to upgrade my cluster to the latest 0.20.3 and found that when I issued the ./hbase-stop.sh command that it was reporting that no ...
    Bryan McCormickBryan McCormick
    Feb 17, 2010 at 8:29 am
    Feb 17, 2010 at 3:51 pm
  • Hello, I've seen the following in a few HBase presentations now: * What to store in HBase? * Maybe not your raw log data... * ...but the results of processing it with Hadoop e.g. slides 26 & 27: ...
    Otis GospodneticOtis Gospodnetic
    Feb 16, 2010 at 12:45 am
    Feb 16, 2010 at 6:18 am
  • I cant seem to find the stargate REST server log4j.properties file --- how would I find or set that up? Josh Patterson TVA
    Patterson, JoshPatterson, Josh
    Feb 12, 2010 at 4:01 pm
    Feb 12, 2010 at 10:53 pm
  • Hi I have hbase table with 3 column families and some number of rows stored in it here I want to ask that how I can search values from the table (like select name from employee where age='35': query ...
    Muhammad MudassarMuhammad Mudassar
    Feb 12, 2010 at 1:38 pm
    Feb 12, 2010 at 8:43 pm
  • Hi, Does HBase 0.20.3 support multiget? How can I use it? Any sample code would be great! We have over 1000 regions' data in HBase 0.20.2; We want to upgrade it to 0.20.3. Any quick ways? or we need ...
    Feb 10, 2010 at 1:32 am
    Feb 11, 2010 at 8:05 pm
  • Hi guys, We have a table which stored previously uncompressed data which we changed to store GZ-compressed data. We performed a compaction on that table which shrank its size three-fold. However, I ...
    Boris AleksandrovskyBoris Aleksandrovsky
    Feb 11, 2010 at 5:59 pm
    Feb 11, 2010 at 6:24 pm
  • Hi, If I alter the table and change the IN_MEMORY option from false to true, do I need to restart HBase for the change to take effect? I am presuming yes, please let me know if not. -- Thanks, Boris ...
    Boris AleksandrovskyBoris Aleksandrovsky
    Feb 10, 2010 at 11:05 pm
    Feb 10, 2010 at 11:07 pm
Group Navigation
period‹ prev | Feb 2010 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

80 users for February 2010

Stack: 49 posts Jean-Daniel Cryans: 41 posts Ryan Rawson: 19 posts Andrew Purtell: 16 posts Dan Washusen: 16 posts James Baldassari: 16 posts Bluemetrix Development: 12 posts Michał Podsiadłowski: 12 posts Y_823910: 10 posts Boris Aleksandrovsky: 9 posts Sujee Maniyam: 8 posts Michael Segel: 7 posts Steven zhuang: 6 posts Ted Yu: 6 posts Bradford Stephens: 5 posts Patrick Hunt: 5 posts Saptarshi Guha: 5 posts Vincent Barat: 5 posts Zhenyu Zhong: 5 posts Adrian Popescu: 4 posts
show more