Grokbase Groups HBase user April 2009

Search Discussions

60 discussions - 357 posts

  • Hi, I have a 20 node cluster on ec2(small instance).... i have a set of tables which store huge amount of data (tried wid 10,000 rows... more to be added).... but during my map reduce jobs, some of ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 7, 2009 at 9:46 am
    Apr 14, 2009 at 5:50 am
  • Hi, i have a map reduce program with which i read from a hbase table. In my map program i check if the column value of a is xxx, if yes then continue with processing else skip it. however if my table ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 7, 2009 at 9:27 am
    Jul 3, 2009 at 5:27 am
  • Hi, I have been trying crawling data using MapReduce on HBase. Here is the scenario: 1) I have a Fetch list which has all the permalinks to be fetched .They are stored in a PermalinkTable 2) A ...
    Ninad RautNinad Raut
    Apr 20, 2009 at 9:49 am
    Apr 27, 2009 at 12:15 am
  • Hi I am new HBase user. I have some doubts regards functionality of HBase. I am working on HBase, things are going fine but I am not clear how are things happening. Please help me by answering these ...
    Puri, AseemPuri, Aseem
    Apr 13, 2009 at 2:12 pm
    Apr 14, 2009 at 8:56 am
  • Hi When I am implementing rowcount program it is throwing a exception: 09/04/23 17:35:50 INFO mapred.TableInputFormatBase: split: 0- slave:, 09/04/23 17:35:55 INFO mapred.JobClient: Running job: ...
    Puri, AseemPuri, Aseem
    Apr 23, 2009 at 12:30 pm
    Aug 19, 2009 at 3:11 pm
  • Hello folks, I have just tried using Ryan's doCommit() method for my bulk upload into one Hbase table. No luck. I still start to get errors around row 160,000. On-screen, the program starts to ...
    Taylor, Ronald CTaylor, Ronald C
    Apr 3, 2009 at 11:36 pm
    Apr 9, 2009 at 8:42 pm
  • Greetings, On a fresh installation of Hbase .20-dev, I go into the shell and type "create 'metadata'", just trying to make an empty table. This line pops up: 09/04/30 11:05:29 INFO ...
    Bradford StephensBradford Stephens
    Apr 30, 2009 at 6:18 pm
    May 1, 2009 at 8:54 pm
  • Greetings, I am prototyping a financial time series database on top of HBase and trying to head my head around what a good design would look like. As I understand it, I have rows, column families, ...
    Bradford CrossBradford Cross
    Apr 1, 2009 at 2:25 am
    Apr 5, 2009 at 9:39 am
  • Hi I am a new HBase user. I have a table in which I am storing data. Can anybody tell me how should I search values in table with help of map reduce. If somebody have some code snippet then please ...
    Puri, AseemPuri, Aseem
    Apr 22, 2009 at 3:22 pm
    May 12, 2009 at 3:21 pm
  • Hi, I have a table with N records, now i want to run a map reduce job with 4 maps and 0 reduces. is there a way i can create my own custom input split so that i can send 'n' records to each map?? if ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 22, 2009 at 9:20 am
    Apr 23, 2009 at 4:36 pm
  • Hi, I wanted to find do some calculation on a huge table which will have millions of rows in it.... Whats the optimal way to do it? should i just pass this to a map reduce job and it will take care ...
    Apr 27, 2009 at 8:19 pm
    Apr 29, 2009 at 1:48 am
  • Hi, As there is no easy way to query HBase, can pig be used to query HBase tables? If so, can any one give me an example as to how to use it.... Regards, Ninad.
    Ninad RautNinad Raut
    Apr 21, 2009 at 3:18 am
    Apr 22, 2009 at 4:38 pm
  • hi coleagues, We are doing data inserting on 32 nodes hbase cluster using mapreduce framework recently, but the operation always gets failed because of regionserver exceptions. We issued 4 map task ...
    11 Nov.11 Nov.
    Apr 13, 2009 at 12:29 pm
    Apr 16, 2009 at 5:02 am
  • Say I want to scan down a table that looks like this: Col A Col B row1 x x row2 x row3 x x Normally a scanner would return all three rows, but what's the best way to scan so that only row1 and row3 ...
    Vincent Poon (vinpoon)Vincent Poon (vinpoon)
    Apr 9, 2009 at 9:45 pm
    Apr 12, 2009 at 8:32 am
  • I, like many others, am having difficulty getting a mapred job that bulk imports data into an HBase table to run successfully to completion. At this time, rather than get into specifics of my ...
    Stuart WhiteStuart White
    Apr 2, 2009 at 8:28 pm
    Apr 5, 2009 at 7:46 am
  • Hi, In our system we have some fast growing tables and some slow growing tables (as in every system). Some slow growing tables only have 100s of rows and they grow at a very slow rate - e.g. users in ...
    Vaibhav PuranikVaibhav Puranik
    Apr 2, 2009 at 11:16 pm
    Apr 3, 2009 at 11:48 pm
  • Hi, I saw that function on the web UI but it seems not implemented yet. public void modifyTable(final byte[] tableName, int op, Writable[] args) switch (op) { ... case MODIFY_TABLE_SPLIT: case ...
    Edward J. YoonEdward J. Yoon
    Apr 29, 2009 at 7:53 am
    Apr 30, 2009 at 1:11 am
  • Hi all, I'd like to replicate a large dataset from a relational database into HBase for better throughput of MapReduce jobs. Has anyone had success replicating from a relational database (in my case ...
    Brian ForneyBrian Forney
    Apr 17, 2009 at 10:30 pm
    Apr 19, 2009 at 9:36 pm
  • Hi, We are doing some feasibility study of HBase and have questions. 1. Once we have build tables on certain number node and decide to add more nodes, how it is working? I know that it is not hbase, ...
    Jae JooJae Joo
    Apr 16, 2009 at 9:31 pm
    Apr 17, 2009 at 12:47 am
  • Hi, I am using hbase-0.19 on 20 node ec2 cluster. I have a map-reduce program which performs some analysis on each row. when i process about 17k rows in ec2 cluster, after performing 65%, my job ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 8, 2009 at 8:34 am
    Apr 8, 2009 at 12:22 pm
  • So far, few if any of the schema designs I've come across have really talked about using the timestamp field and HBase's automatic deletion of old cells in a smart way. What is the timestamp ...
    Wes ChowWes Chow
    Apr 1, 2009 at 1:56 pm
    Apr 3, 2009 at 5:04 pm
  • Hello, I'm currently studying Hbase/BigTable . I kind of understand their conceptual data structure, but I don't get how it's physically implemented, especially how it's column oriented. I referenced ...
    Yamada hiroyukiYamada hiroyuki
    Apr 2, 2009 at 6:13 am
    Apr 3, 2009 at 12:00 am
  • Is Hbase' Connection manager already taking care of the connection pooling? Does it make sense to Pool HTable instances in a multi-threaded (web like) usage scenario? Thx, Bob
    Bob SchulzeBob Schulze
    Apr 23, 2009 at 7:34 am
    Apr 24, 2009 at 8:42 am
  • Hi, I have a scanario, i have a table... which has 2 be read into say 'n' maps. so now in each map... i need 2 access say 'm' records at once... so that i can spawn them using threads.. to increase ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 21, 2009 at 10:19 am
    Apr 22, 2009 at 5:05 pm
  • Hi, Its been several days since we have been trying to stabilize hadoop/hbase on ec2 cluster. but failed to do so. We still come across frequent region server fails, scanner timeout exceptions and OS ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 17, 2009 at 4:40 pm
    Apr 21, 2009 at 12:38 pm
  • Hi, I've been using HBase and now I'm looking at Cassandra. What's particularly interesting about Cassandra is its typed data model. Apparently it involves JSON, but what matters the most to me is ...
    Tom NicholsTom Nichols
    Apr 17, 2009 at 3:21 pm
    Apr 18, 2009 at 4:52 pm
  • I getting a bunch of WARNS WARN org.apache.hadoop.hdfs.DFSClient: NotReplicatedYetException sleeping This is only happening on the hlogs on the servers while under heave import 30K/sec on 7 server I ...
    Billy PearsonBilly Pearson
    Apr 12, 2009 at 12:02 am
    Apr 12, 2009 at 6:09 am
  • I made a table with 10,000 rows. BTW, It seems stored on a single node at this time, and requires some time for distributing. Is it right? Then, I would like to know whether distributed to each node ...
    Edward J. YoonEdward J. Yoon
    Apr 9, 2009 at 6:03 am
    Apr 11, 2009 at 11:04 am
  • Hi, I'm doing some experiments to import large datasets to Hbase using a Map job. Before posting some numbers, here is a summary of my test cluster: I have 7 regionservers and 1 master. I also run ...
    Jim TwenskyJim Twensky
    Apr 29, 2009 at 10:33 pm
    May 26, 2009 at 12:21 am
  • hi all, We are doing data inserting on hbase, and the table table only have one column family with one qualifier, which is about 20 bytes' lenth. There is another table carrying 5TB data in the same ...
    11 Nov.11 Nov.
    Apr 29, 2009 at 6:31 am
    Apr 29, 2009 at 2:15 pm
  • Hi, Is there any way to rebuild the .META. Table with information in HDFS? Long story short, the namenode ran out of disk and crashed and we couldn't have it load the file at startup (the ...
    Cosmin LeheneCosmin Lehene
    Apr 23, 2009 at 9:53 am
    Apr 28, 2009 at 8:22 am
  • Hi, I am trying to use HBase to store and process communication relation data, for example, communications between cell phones, in fact that is fit for RDBMS to store and process. I have referred the ...
    Apr 11, 2009 at 5:05 am
    Apr 15, 2009 at 2:52 pm
  • can we get someone to post the all the correct options for 'table_att' in the shell in the wiki faq or somewhere? I know there is these below but I thank their is a major compaction setting also but ...
    Billy PearsonBilly Pearson
    Apr 11, 2009 at 9:38 pm
    Apr 13, 2009 at 2:28 am
  • Hi, New question: I just tried clearing a table, going row by row, using deleteAll(rowID) on each row. This works, but takes a really long amount of time when used on ~250,000 rows (at least in ...
    Taylor, Ronald CTaylor, Ronald C
    Apr 8, 2009 at 10:10 pm
    Apr 9, 2009 at 8:11 am
  • When running MapReduce processes with HBase, is it possible to have Hadoop move the job to the machine that contains the relevant HStore? I thought I read that it does do this at some point, but I'm ...
    Wes ChowWes Chow
    Apr 1, 2009 at 2:19 pm
    Apr 1, 2009 at 5:37 pm
  • Hey list, Just a small tip for those who uses the scanners in HBase and that their processing time takes more than 2-3 seconds per row : lower the hbase.client.scanner.caching. When I wrote that ...
    Jean-Daniel CryansJean-Daniel Cryans
    Apr 17, 2009 at 1:40 am
    Apr 20, 2009 at 4:18 pm
  • I just installed another hbase system on Ubuntu 8.04 and as far as I can tell its an exact same deployment as I have on an existing system that works fine. But on this new one when I try to access ...
    Robert J BergerRobert J Berger
    Apr 17, 2009 at 11:38 pm
    Apr 18, 2009 at 12:10 am
  • Hi, I tried to list all the tables on hbase and i get the following exception: hbase(main):001:0 list NativeException: org.apache.hadoop.hbase.client.RetriesExhaustedException: Trying to contact ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 16, 2009 at 10:14 am
    Apr 17, 2009 at 12:48 pm
  • Hi, Incase we migrate from hadoop 0.19.0 and hbase 0.19.0 to hadoop 0.20.0 and hbase 0.20.0 respectively, how would it affect the existing data on hadoop dfs and hbase tables? can we migrate the data ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 16, 2009 at 10:25 am
    Apr 16, 2009 at 3:48 pm
  • Hi I am a new HBase user. I have a small table in which I am inserting some data. My row key increases as I insert data. Data is stored properly in HBase in lexicographical order. But if I stop the ...
    Puri, AseemPuri, Aseem
    Apr 10, 2009 at 10:14 am
    Apr 11, 2009 at 1:26 pm
  • Does anyone help me to run HQL? I ran bin/hbase shell, but it looks like tha it is not HQL. Jae Joo
    Jae JooJae Joo
    Apr 3, 2009 at 7:50 pm
    Apr 3, 2009 at 10:08 pm
  • Hi, all I heard from somebody on the list that the xceivers number is actually the number of worker threads of the data node process. Assuming Java uses native threads we can monitor the number of ...
    Michael DagaevMichael Dagaev
    Apr 2, 2009 at 1:30 pm
    Apr 2, 2009 at 11:43 pm
  • Hey all, I've checked out the latest trunk and built it successfully. HBase seems to have started up normally, but when I go to masternode:60010, it just 'sits and spins'. No 404 errors or anything. ...
    Bradford StephensBradford Stephens
    Apr 28, 2009 at 11:15 pm
    Apr 29, 2009 at 12:04 am
  • /////////////////////////////////////// Sorry for cross posting. ////////////////////////////////////// Hi,all Hadoop in China Salon is a free discussion forum on Hadoop related technologies and ...
    He YongqiangHe Yongqiang
    Apr 24, 2009 at 12:00 am
    Apr 24, 2009 at 1:12 am
  • Here is the MapReduce I use to randomize the lines of a file. I've omitted the imports for brevity - your IDE can fix that. Enjoy! -ryan public class Randomize { // technically text/text could be ...
    Ryan RawsonRyan Rawson
    Apr 22, 2009 at 4:53 am
    Apr 22, 2009 at 4:42 pm
  • Dear all, - I have two jobs (JobA & JobB), one JobControlerX to add JobA & JobB. - Now, I will be run 10 JobControllerX as follow: for(int i = 0; i < 10; i++) { new JobControllerX[i] start; } - ...
    Trong Tran BinhTrong Tran Binh
    Apr 21, 2009 at 1:47 pm
    Apr 21, 2009 at 3:56 pm
  • Hi, all I see a lot of errors like "ERROR org.apache.hadoop.dfs.LeaseManager: /hbase/<tablle name /compaction.dir/943239101/<column data /info/6358397599897724359 not found in lease.paths" in the ...
    Michael DagaevMichael Dagaev
    Apr 19, 2009 at 6:54 pm
    Apr 19, 2009 at 7:58 pm
  • Hi, My hbase suddenly goes down, when i check the logs, i get the following exception at master node's region server: 2009-04-15 08:37:09,158 FATAL org.apache.hadoop.hbase.regionserver.HRegionServer: ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 15, 2009 at 10:31 am
    Apr 17, 2009 at 11:28 am
  • Hi, I was running a mapreduce job which takes data from table ContentTable, processes it, and store the results into another table. my mapreduce program had 20 maps out of which 19 maps completed ...
    Rakhi KhatwaniRakhi Khatwani
    Apr 15, 2009 at 7:45 am
    Apr 15, 2009 at 2:20 pm
  • I am looking for the way to put the values and commit later. Ex. Put 10000 rows and commoit one. It should improve the performance, but it may not working properly on Hadoop. Jae -- Sent from my ...
    Jae JooJae Joo
    Apr 13, 2009 at 10:47 pm
    Apr 13, 2009 at 10:50 pm
Group Navigation
period‹ prev | Apr 2009 | next ›
Group Overview
groupuser @
categorieshbase, hadoop

57 users for April 2009

Rakhi Khatwani: 43 posts Stack: 37 posts Ryan Rawson: 29 posts Andrew Purtell: 19 posts Puri, Aseem: 19 posts Jean-Daniel Cryans: 17 posts Lars George: 14 posts Erik Holstad: 12 posts Taylor, Ronald C: 12 posts Billy Pearson: 11 posts Amandeep Khurana: 10 posts Edward J. Yoon: 10 posts Jonathan Gray: 10 posts Bradford Stephens: 6 posts Jae Joo: 6 posts Vaibhav Puranik: 6 posts 11 Nov.: 5 posts Bradford Cross: 5 posts Derek Pappas: 5 posts Ninad Raut: 5 posts
show more