FAQ
Hi all -
Impala's documentation states that "For clusters running production
workloads, you might load-balance between the nodes by submitting each
query to a different Impala daemon in round-robin style, using the JDBC or
ODBC interfaces."

Has any of you successfully implemented such a load balancer?

I'm asking this because most of our Impala queries are run using Hue, and
Hue needs to be configured to connect to a specific Impala daemon. We worry
that this daemon may become over-loaded as more and more users concurrently
run queries.

Thanks!
Noam

To unsubscribe from this group and stop receiving emails from it, send an email to [email protected].

Search Discussions

  • Alan Choi at Jan 15, 2014 at 6:00 pm
    Hi Noam,

    Have you tried using HA Proxy as the load balancer? We've document how to
    config it:

    http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Impala/ciiu_admin.html?scroll=tut_proxy_unique_2

    Thanks,
    Alan

    On Wed, Jan 15, 2014 at 9:56 AM, Noam Cohen wrote:

    Hi all -
    Impala's documentation states that "For clusters running production
    workloads, you might load-balance between the nodes by submitting each
    query to a different Impala daemon in round-robin style, using the JDBC or
    ODBC interfaces."

    Has any of you successfully implemented such a load balancer?

    I'm asking this because most of our Impala queries are run using Hue, and
    Hue needs to be configured to connect to a specific Impala daemon. We worry
    that this daemon may become over-loaded as more and more users concurrently
    run queries.

    Thanks!
    Noam

    To unsubscribe from this group and stop receiving emails from it, send an
    email to [email protected].
    To unsubscribe from this group and stop receiving emails from it, send an email to [email protected].
  • Noam Cohen at Jan 15, 2014 at 7:26 pm
    Thanks Alan! I'll give it a try..


    On Wednesday, January 15, 2014 8:00:06 PM UTC+2, Alan wrote:

    Hi Noam,

    Have you tried using HA Proxy as the load balancer? We've document how to
    config it:


    http://www.cloudera.com/content/cloudera-content/cloudera-docs/Impala/latest/Installing-and-Using-Impala/ciiu_admin.html?scroll=tut_proxy_unique_2

    Thanks,
    Alan


    On Wed, Jan 15, 2014 at 9:56 AM, Noam Cohen <[email protected]<javascript:>
    wrote:
    Hi all -
    Impala's documentation states that "For clusters running production
    workloads, you might load-balance between the nodes by submitting each
    query to a different Impala daemon in round-robin style, using the JDBC or
    ODBC interfaces."

    Has any of you successfully implemented such a load balancer?

    I'm asking this because most of our Impala queries are run using Hue, and
    Hue needs to be configured to connect to a specific Impala daemon. We worry
    that this daemon may become over-loaded as more and more users concurrently
    run queries.

    Thanks!
    Noam

    To unsubscribe from this group and stop receiving emails from it, send an
    email to [email protected] <javascript:>.
    To unsubscribe from this group and stop receiving emails from it, send an email to [email protected].

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupimpala-user @
categorieshadoop
postedJan 15, '14 at 5:56p
activeJan 15, '14 at 7:26p
posts3
users2
websitecloudera.com
irc#hadoop

2 users in discussion

Noam Cohen: 2 posts Alan Choi: 1 post

People

Translate

site design / logo © 2023 Grokbase