I just started hive today. Following instructions I did set it up, and
made it work to play with my web server log files.

I created two tables:

CREATE TABLE item_view(view_time BIGINT, ip_number STRING, session_id
STRING, session_cookie STRING, referrer_url STRING, eser_sid INT,
sale_status INT, maker_name STRING, title STRING) CLUSTERED
BY(eser_sid) SORTED BY(view_time) INTO 32 BUCKETS row format delimited
fields terminated by '\t' lines terminated BY '\n' STORED AS


CREATE TABLE item_view_raw(view_time BIGINT,log_level STRING, log_tag
STRING, ip_number STRING, session_id STRING, session_cookie STRING,
referrer_url STRING, agent_string STRING, eser_sid INT, sale_status
INT, maker_name STRING, title STRING) row format delimited fields
terminated by '\t' lines terminated BY '\n' STORED AS TEXTFILE;

then I load data into item_view_raw table:

LOAD DATA LOCAL INPATH '/tmp/prod.log' INTO TABLE item_view_raw;

whıch puts the data item_view_raw table.

this is the staging table, and i want to insert rows from it into my
item_view table so I:

FROM item_view_raw ivr INSERT OVERWRITE TABLE item_view SELECT
ivr.view_time, ivr.ip_number, ivr.session_id, ivr.session_cookie,
ivr.referrer_url, ivr.eser_sid, ivr.sale_status, ivr.maker_name,

but I get:

Total MapReduce jobs = 2
Launching Job 1 out of 2
Number of reduce tasks is set to 0 since there's no reduce operator
java.io.IOException: Call to / failed on local
exception: java.io.EOFException
at org.apache.hadoop.ipc.Client.wrapException(Client.java:775)
at org.apache.hadoop.ipc.Client.call(Client.java:743)
at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
at org.apache.hadoop.mapred.$Proxy5.getProtocolVersion(Unknown Source)
at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359)
at org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:429)
at org.apache.hadoop.mapred.JobClient.init(JobClient.java:423)
at org.apache.hadoop.mapred.JobClient.(ExecDriver.java:683)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:107)
at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:55)
at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:633)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:506)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:384)
at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:138)
at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:197)
at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:302)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
Caused by: java.io.EOFException
at java.io.DataInputStream.readInt(DataInputStream.java:375)
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:508)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:446)
Job Submission failed with exception 'java.io.IOException(Call to
/ failed on local exception: java.io.EOFException)'
FAILED: Execution Error, return code 1 from

I did fiddle with it for couple of hours, then I decided to write to
this list. no matter what I tried i could not solve the problem.

Any ideas, recomendations and help greatly appreciated,

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
posts ‹ prev | 1 of 1 | next ›
Discussion Overview
groupuser @
categorieshive, hadoop
postedJan 31, '11 at 2:19p
activeJan 31, '11 at 2:19p

1 user in discussion

Cam Bazz: 1 post



site design / logo © 2022 Grokbase