Have a look at how pyspark works in conjunction with spark as it is not
just a matter of language preference. There are several implications and a
performance price to pay if you go with python.
At the end of the day only you can answer whether that price is worth over
retraining your team in anot
I don't have any specific wisdom for you on that front. But I've always
been served well by the 'Try both' approach.
Set up your benchmarks, configure both setups... You don't have to go the
whole hog, but just enough to get a mostly realistic implementation
functional. Run them both with some