we are currently exploring Impala as engine for our near-real-time
analytical backend. And Impala really looks great.
There are, however, two things where I am not entirely sure if Impala
is a good fit for us:
1) We are expecting a lot of new data each day (max 200GB)
2) We expect some tiny amounts of historical data to change from time to
Is Impala able to cope with that amount of changing data well?
Many thanks in advance!