Hi,
I have a table which is partitioned per hour. I wanted to get a list of partitions which are less than a certain time limit. The issue that I am facing is: the result of the query is unpredictable between continuous run - sometimes it give expected result, but sometimes, the same query is giving list of all partitions without applying the date filter.
Table schema:
CREATE TABLE blah ( EVENT_TIME STRING, SEQUENCE_NUMBER string, MACHINE_NAME STRING, APP_INSTANCE_NAME STRING, LOG_TYPE STRING, MESSAGE STRING)
PARTITIONED BY (TIME_IN_HOUR STRING)
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe' WITH SERDEPROPERTIES
("input.regex" = "(?m)(?s)^(\\d{4}-\\d{2}-\\d{2}T\\d{2}:\\d{2}:\\d{2}\\.\\d{3}Z) (\\d+)? ?([^ ]+) ([^ ]+) ([^ ]+) (.*$)",
"output.format.string" = "%1$s %2$s %3$s %4$s %5$s %6$s")
STORED AS TEXTFILE;
Query:
hive -S -e "add jar /usr/lib/hive/lib/hive-serde-0.5.0.jar; add jar /usr/lib/hive/lib/hive_contrib.jar; select distinct time_in_hour from blah where time_in_hour < \"2010_10_17T01\";"
Note: I noticed same unpredictable behavior when using hive cli.
Any pointers or thoughts are greatly appreciated.
Thanks,
Ankita
The information contained in this email message and its attachments is intended only for the private and confidential use of the recipient(s) named above, unless the sender expressly agrees otherwise. Transmission of email over the Internet is not a secure communications medium. If you are requesting or have requested the transmittal of personal data, as defined in applicable privacy laws by means of email or in an attachment to email, you must select a more secure alternate means of transmittal that supports your obligations to protect such personal data. If the reader of this message is not the intended recipient and/or you have received this email in error, you must take no action based on the information in this email and you are hereby notified that any dissemination, misuse or copying or disclosure of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by email and delete the original message.