I understand that that text records in Hadoop streaming are delimited by the newline character and that there is a configurable delimiter between keys and values (defaults to tab).
1) The structure of the rawbytes format suggests that no record or key/value delimiters are necessary, but can someone confirm that this is the case?
2) In the typedbytes format, how are keys and values delimited, and how are records delimited?
3) Also, how are keys sorted in the typedbytes and rawbytes format?
See https://hadoop.apache.org/docs/current2/api/org/apache/hadoop/typedbytes/package-summary.html
Antonio