I wrote a Hadoop mapper-only job that uses BulkOutputFormat to load a Cassandra table. That job would consistently fail with a flurry of exceptions (primary cause looks like EOFExceptions streaming between nodes).
I restructured the job to use an identity mapper and perform the updates in the reducer. Same code other be moving to the reducer. That job runs correctly. Is this expected behavior? TIA Brian