If I want to read values out of input files as binary data, is this what BytesWritable is for?
I've successfully run my first task that uses a SequenceFile for output. Are there any examples of SequenceFile usage out there? I'd like to see the full range of what SequenceFile can do. What are the trade-offs between record compression and block compression? What are the limits on the key and value sizes? How do you use the per-file metadata? My intended use is to read files on a local filesystem into a SequenceFile, with the value of each record being the contents of each file. I hacked MultiFileWordCount to get the basic concept working... but I'd appreciate any advice from the experts. In particular, what's the most efficient way to read data from an InputStreamReader/BufferedReader into a BytesWritable object? Thanks, John