so I do not mistakenly grab too much data.
Suhail
On Wed, Mar 25, 2009 at 6:43 PM, Zheng Shao wrote:
There is a SORT BY.
You can do:
SELECT * FROM tableA SORT BY c1 DESC;
Then each of the partition will be sorted.
However in order to get the global 10, we will need to do LIMIT 10 on top
of that. LIMIT 10 and SORT BY do not work exactly as the user wants now.
Zheng
--
Yours,
Zheng
There is a SORT BY.
You can do:
SELECT * FROM tableA SORT BY c1 DESC;
Then each of the partition will be sorted.
However in order to get the global 10, we will need to do LIMIT 10 on top
of that. LIMIT 10 and SORT BY do not work exactly as the user wants now.
Zheng
On Wed, Mar 25, 2009 at 3:23 PM, Suhail Doshi wrote:
Since Hive does not have an ORDER BY...yet what is the solution for
getting the top 10 rows based on a field without having your client in
thrift getting too much data back? Seems like it is possible to actually get
too much data but unfortunately you have to get all rows and sort by
yourself.
Suhail
Since Hive does not have an ORDER BY...yet what is the solution for
getting the top 10 rows based on a field without having your client in
thrift getting too much data back? Seems like it is possible to actually get
too much data but unfortunately you have to get all rows and sort by
yourself.
Suhail
--
Yours,
Zheng
--
http://mixpanel.com
Blog: http://blog.mixpanel.com