Hi all, I have written a python udf and its working fine as expected in hive. But had a doubt regarding how its getting called from the backend. so as i understand the hive uses the udf as a mapper function. so for each row processed does it create a python process? if thats the case wont it be an over head to the cluster.
-- Sreenath S Kamath Bangalore