|
Alex Minnaar |
at Jun 3, 2013 at 7:40 pm
|
⇧ |
| |
The original text file was uploaded to hdfs via the Hue file browser. Then
I used the 'create a new table from a file' link in the Metastore Manager
and followed the appropriate steps. When I look at the sample for the
table, it looks correct (all rows are separated correctly). I don't know
how I would upload the original text file via command line because I am
running Hadoop ontop of Amazon ec2 so I would first have to get the file
into ec2 somehow.
On Monday, June 3, 2013 3:25:52 PM UTC-4, Ricky Saltzer wrote:Could you provide a little more information on this table? For example,
how was this table created and populated? Usually when you see all your
data in one column, it's because the field delimiter is incorrectly set and
so Impala (or Hive) is not able to correctly parse the columns.
On Mon, Jun 3, 2013 at 3:16 PM, Alex Minnaar <minna...@gmail.com<javascript:>
wrote:
Yeah, its taking forever, I don't think I can run this query. Any other
suggestions?
On Monday, June 3, 2013 2:57:31 PM UTC-4, Ricky Saltzer wrote:Hey Alex -
I should have also probably asked if you can query this table in Hive
okay...
Try:
$ hive
$ select * from <table_name> LIMIT 10;
How does the output look?
On Mon, Jun 3, 2013 at 2:55 PM, Alex Minnaar wrote:The result of that is
# col_name data_type comment
one string None
two string None
three string None
four string None
five string None
# Detailed Table Information
Database: default
Owner: hdfs
CreateTime: Mon Jun 03 15:38:28 UTC 2013
LastAccessTime: UNKNOWN
Protect Mode: None
Retention: 0
Location: hdfs://ip-10-245-112-238.us-**
west-2.compute.internal:8020/**user/hive/warehouse/text_try
Table Type: MANAGED_TABLE
Table Parameters:
numFiles 1
numPartitions 0
numRows 0
rawDataSize 0
totalSize 48047371
transient_lastDdlTime 1370273911
# Storage Information
SerDe Library: org.apache.hadoop.hive.**
serde2.lazy.LazySimpleSerDe
InputFormat: org.apache.hadoop.mapred.**TextInputFormat
OutputFormat: org.apache.hadoop.hive.ql.io.**
HiveIgnoreKeyTextOutputFormat
Compressed: No
Num Buckets: -1
Bucket Columns: []
Sort Columns: []
Storage Desc Params:
field.delim \u0001
serialization.format \u0001
Time taken: 0.409 seconds
On Monday, June 3, 2013 2:42:02 PM UTC-4, Ricky Saltzer wrote:Hey Alex -
Can you open up a Hive shell (*$ hive) *and perform an extended
describe on the table?
$ hive
$ describe formatted <table_name>
or
$ describe extended <table_name>
The formatted table description is nicer to read...
Thanks!
Ricky
On Mon, Jun 3, 2013 at 2:40 PM, Alex Minnaar wrote:Ok disregard my last response. I was able to run it in the impala
shell and got the same result. All data was in the first column, the rest
were empty.
On Monday, June 3, 2013 2:16:57 PM UTC-4, Udai wrote:Hi Alex,
What do you see when you run the query on Impala shell?
Thanks,
Udai
On Mon, Jun 3, 2013 at 11:01 AM, Alex Minnaar wrote:
I have a table in my default database with 5 columns. When I look
at the sample in the Hue Metastore Manager it looks fine. However, when I
do a "select * from table" in the Impala query editor, the result has all
of the data squished into the first column and nothing in the remaining 4.
Can anyone tell me how to fix this?
Thanks,
Alex
--
Ricky Saltzer
Tools Developer
http://www.cloudera.com --
Ricky Saltzer
Tools Developer
http://www.cloudera.com --
Ricky Saltzer
Tools Developer
http://www.cloudera.com