FAQ
Hi all,

I'm another just getting started. I've created an HBase Tap (with maple
0.2.0) to a table. The table has a column qualifier of a string giving a
language (e.g. "en", "nl", "fr" etc).

I want to find the number of distinct language groups so my query is:
(?<- (stdout) [?count] (hbase-tap ?language)
(c/distinct-count ?language :> ?count))

However, I'm getting an error of: local step failed
[Thrown class cascading.flow.FlowException]
Backtrace:
0: FlowStepJob.java:191
cascading.flow.planner.FlowStepJob.blockOnJob
1: FlowStepJob.java:137
cascading.flow.planner.FlowStepJob.start
2: FlowStepJob.java:122
cascading.flow.planner.FlowStepJob.call
...

I'm stumped. Has anyone got any ideas?

Thanks,

Simon

Search Discussions

  • Sam Ritchie at Jun 20, 2012 at 11:52 pm
    Hey Simon, can you please send the entire stack trace? The good stuff is
    usually closer to the bottom.
    On Wed, Jun 20, 2012 at 7:51 AM, Simon Holgate wrote:

    Hi all,

    I'm another just getting started. I've created an HBase Tap (with maple
    0.2.0) to a table. The table has a column qualifier of a string giving a
    language (e.g. "en", "nl", "fr" etc).

    I want to find the number of distinct language groups so my query is:
    (?<- (stdout) [?count] (hbase-tap ?language)
    (c/distinct-count ?language :> ?count))

    However, I'm getting an error of: local step failed
    [Thrown class cascading.flow.FlowException]
    Backtrace:
    0: FlowStepJob.java:191
    cascading.flow.planner.FlowStepJob.blockOnJob
    1: FlowStepJob.java:137
    cascading.flow.planner.FlowStepJob.start
    2: FlowStepJob.java:122
    cascading.flow.planner.FlowStepJob.call
    ...

    I'm stumped. Has anyone got any ideas?

    Thanks,

    Simon


    --
    Sam Ritchie, Twitter Inc
    703.662.1337
    @sritchie09

    (Too brief? Here's why! http://emailcharter.org)
  • Simon Holgate at Jun 21, 2012 at 2:23 pm
    Hi Sam and all,

    I thought I should back up a little and see if I could get something more
    basic to work i.e. the cascalog examples.

    I created an HBase table:

    create 'age-table', 'cf'

    Then populated it:
    put 'age-table', 'alice', 'cf:age', 28
    put 'age-table', 'bob', 'cf:age', 33 etc...

    hbase(main):014:0> scan 'age-table'
    ROW COLUMN+CELL

    alice column=cf:age, timestamp=1340286719365, value=28

    bob column=cf:age, timestamp=1340286724687, value=33

    ...

    I then ran the code that's in this gist: https://gist.github.com/2965839
    and I get a job that runs forever and produces an immense amount of
    repetitive [INFO] lines. I've attached as much of my screen buffer as I
    could manage but there was much, much more.

    I'm probably being exceptionally dumb but any pointers would be gratefully
    received.

    I'm running cascalog 1.9.0, maple 0.2.0 and (possibly an issue)
    HBase 0.92.1 and Hadoop 1.0.3.

    Thanks,

    Simon
  • Sam Ritchie at Jun 25, 2012 at 8:44 pm
    Simon, this is the issue I referenced in an earlier post with recursive
    require calls. Try wrapping this in a function and calling that function to
    exec the job:

    (let [age-table (hbase-tap "age-table" "?person" "cf" "?age")]
    (?<- (stdout) [?p ?a] (age-table ?p ?age) (as-string ?age :> ?a)))

    On Thu, Jun 21, 2012 at 7:23 AM, Simon Holgate wrote:

    Hi Sam and all,

    I thought I should back up a little and see if I could get something more
    basic to work i.e. the cascalog examples.

    I created an HBase table:

    create 'age-table', 'cf'

    Then populated it:
    put 'age-table', 'alice', 'cf:age', 28
    put 'age-table', 'bob', 'cf:age', 33 etc...

    hbase(main):014:0> scan 'age-table'
    ROW COLUMN+CELL

    alice column=cf:age, timestamp=1340286719365, value=28

    bob column=cf:age, timestamp=1340286724687, value=33

    ...

    I then ran the code that's in this gist: https://gist.github.com/2965839
    and I get a job that runs forever and produces an immense amount of
    repetitive [INFO] lines. I've attached as much of my screen buffer as I
    could manage but there was much, much more.

    I'm probably being exceptionally dumb but any pointers would be gratefully
    received.

    I'm running cascalog 1.9.0, maple 0.2.0 and (possibly an issue)
    HBase 0.92.1 and Hadoop 1.0.3.

    Thanks,

    Simon


    --
    Sam Ritchie, Twitter Inc
    703.662.1337
    @sritchie09

    (Too brief? Here's why! http://emailcharter.org)
  • Simon Holgate at Jun 26, 2012 at 8:43 am
    Hi Sam,

    yes, thanks. I'd realised that after reading your other post. All is
    working well now.

    Cheers,

    S.
    On Monday, 25 June 2012 21:44:32 UTC+1, Sam Ritchie wrote:

    Simon, this is the issue I referenced in an earlier post with recursive
    require calls. Try wrapping this in a function and calling that function to
    exec the job:

    (let [age-table (hbase-tap "age-table" "?person" "cf" "?age")]

    (?<- (stdout) [?p ?a] (age-table ?p ?age) (as-string ?age :> ?a)))

    On Thu, Jun 21, 2012 at 7:23 AM, Simon Holgate wrote:

    Hi Sam and all,

    I thought I should back up a little and see if I could get something more
    basic to work i.e. the cascalog examples.

    I created an HBase table:

    create 'age-table', 'cf'

    Then populated it:
    put 'age-table', 'alice', 'cf:age', 28
    put 'age-table', 'bob', 'cf:age', 33 etc...

    hbase(main):014:0> scan 'age-table'
    ROW COLUMN+CELL

    alice column=cf:age, timestamp=1340286719365, value=28

    bob column=cf:age, timestamp=1340286724687, value=33

    ...

    I then ran the code that's in this gist: https://gist.github.com/2965839
    and I get a job that runs forever and produces an immense amount of
    repetitive [INFO] lines. I've attached as much of my screen buffer as I
    could manage but there was much, much more.

    I'm probably being exceptionally dumb but any pointers would be
    gratefully received.

    I'm running cascalog 1.9.0, maple 0.2.0 and (possibly an issue)
    HBase 0.92.1 and Hadoop 1.0.3.

    Thanks,

    Simon


    --
    Sam Ritchie, Twitter Inc
    703.662.1337
    @sritchie09

    (Too brief? Here's why! http://emailcharter.org)
  • Simon Holgate at Jun 22, 2012 at 2:20 pm
    Answering myself, a reply to a related question solves this issue:
    https://groups.google.com/forum/?fromgroups#!topic/cascalog-user/CiUpCk4075M

    Simon
    On Wednesday, 20 June 2012 15:51:46 UTC+1, Simon Holgate wrote:

    Hi all,

    I'm another just getting started. I've created an HBase Tap (with maple
    0.2.0) to a table. The table has a column qualifier of a string giving a
    language (e.g. "en", "nl", "fr" etc).

    I want to find the number of distinct language groups so my query is:
    (?<- (stdout) [?count] (hbase-tap ?language)
    (c/distinct-count ?language :> ?count))

    However, I'm getting an error of: local step failed
    [Thrown class cascading.flow.FlowException]
    Backtrace:
    0: FlowStepJob.java:191
    cascading.flow.planner.FlowStepJob.blockOnJob
    1: FlowStepJob.java:137
    cascading.flow.planner.FlowStepJob.start
    2: FlowStepJob.java:122
    cascading.flow.planner.FlowStepJob.call
    ...

    I'm stumped. Has anyone got any ideas?

    Thanks,

    Simon

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcascalog-user @
categoriesclojure, hadoop
postedJun 20, '12 at 10:35p
activeJun 26, '12 at 8:43a
posts6
users2
websiteclojure.org
irc#clojure

2 users in discussion

Simon Holgate: 4 posts Sam Ritchie: 2 posts

People

Translate

site design / logo © 2022 Grokbase