Stephen Watt <sw...@us.ibm.com> wrote: > Please let me know if any of assertions are incorrect. I'm going to be > adding any feedback to the Hadoop Wiki. It seems well documented that the > LZO Codec is the most performant codec ( > http://blog.oskarsson.nu/2009/03/hadoop-feat-lzo-save-disk-space-and.html)
Speedwise, yes. > 1) It appears most folks are using LZO. Given that it is GPL, are you not > worried about it virally infecting your project ? "Viral infections" are more of a Windows concept, not so much source code or licenses. There are literally _piles_ of information available on this, and you really should go read up on it. But the upshot is that (1) GPLv2 triggers for distribution, not use, and (2) even if you're distributing in violation of the license, the worst that can happen is that you lose all privileges with respect to the GPL'd code and perhaps have to pay damages for copyright infringement. It can't "infect" your own code, but to the extent that the combined work is legally considered a derived work, you can be barred from distributing the combination. Greg