FAQ

Search Discussions

9 discussions - 40 posts

  • Hi, Does anyone has an example that uses Avro files? Thanks, Vitaly
    VitalipVitalip
    Dec 21, 2012 at 7:15 pm
    Dec 21, 2012 at 7:15 pm
  • My company has just started using Crunch and I'm interested in using Scrunch. I'm puzzled by the distribution, though and noticed the recent pom refactoring. Is the intention that scrunch will be ...
    Chad Urso McDanielChad Urso McDaniel
    Jul 25, 2012 at 12:31 am
    Jul 27, 2012 at 10:36 pm
  • Hi, I recently discovered crunch and it looks very promising. We use plain MR jobs heavily at work but I really like the idea of raising the abstraction level a bit while still working with Java and ...
    Matthias FriedrichMatthias Friedrich
    Jul 8, 2012 at 8:57 am
    Jul 9, 2012 at 2:06 am
  • Well, this looks like it will be pretty cool. https://github.com/cloudera/crunch/pull/42 -- Director of Data Science Cloudera Twitter: @josh_wills
    Josh WillsJosh Wills
    Jun 28, 2012 at 10:27 pm
    Jun 28, 2012 at 10:27 pm
  • Hey folks, I am meeting w/Robert this afternoon to discuss this further, but I thought it would be good to get these thoughts out on the list. I'd like to propose that we switch Crunch to always use ...
    Josh WillsJosh Wills
    Jun 18, 2012 at 5:13 pm
    Jun 18, 2012 at 5:13 pm
  • Hey Everybody, I'd like to start a discussion about using automated code review tools to improve the crunch development process. I'd I'm personally a big fan of using tools that can help improve code ...
    Robert ChuRobert Chu
    Jun 15, 2012 at 10:14 pm
    Jun 23, 2012 at 5:02 am
  • Hi everyone, Separate topic, so a separate posting. One of the functions that I find most useful in Pig is the map side join; Pig will put a file in the distributed cache, load it into memory, and do ...
    Joseph AdlerJoseph Adler
    Jun 15, 2012 at 3:47 pm
    Jun 21, 2012 at 7:09 am
  • Hi guys, I've written a CombinedFile input format for avro files in Crunch. Before sending a pull request, I was wondering if folks would find the functionality useful for other types of input ...
    Joseph AdlerJoseph Adler
    Jun 15, 2012 at 3:45 pm
    Jun 15, 2012 at 3:55 pm
  • Hi, I am writing a Distributed Collections DSL with compiler optimizations as my master thesis. I have added Crunch as a backend, since we were not happy with the performance we were getting from our ...
    Acki ChAcki Ch
    Jun 14, 2012 at 10:24 am
    Jun 18, 2012 at 12:00 pm
Group Navigation
period‹ prev | Latest | first ›
Group Overview
groupcrunch-dev @
categorieshadoop
discussions9
posts40
users11
websitecloudera.com
irc#hadoop