FAQ
I'm using hadoop streaming and currently have these properties in my command
line:
-Dstream.map.output.field.separator=' ' \
-Dstream.num.map.output.key.fields=1 \

This works for me as my test data happens to have a space at column 14. If I
want to use a fixed length split, is there a simple cut function I could use
like undefining the separator and counting 13 bytes?
-Dstream.map.output.field.separator= \
-Dstream.num.map.output.key.fields=13 \

I have searched the forum for discussions on fixed length or splitting keys
but have not found my answer. Perhaps this is not possible, at least on the
command line?

Thanks for any help!
Kevin
--
View this message in context: http://old.nabble.com/How-do-I-split-input-key-on-fixed-length--tp31176690p31176690.html
Sent from the Hadoop core-user mailing list archive at Nabble.com.

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupcommon-user @
categorieshadoop
postedMar 17, '11 at 9:43p
activeMar 17, '11 at 9:43p
posts1
users1
websitehadoop.apache.org...
irc#hadoop

1 user in discussion

Kevin.leach: 1 post

People

Translate

site design / logo © 2022 Grokbase