Maciek,

I fear that Python - or better: Jython - UDFs no longer work since C* 3.0.

Back in C* 2.2.x, there was the idea to allow the use of “all” JSR223 languages 
for UDFs - basically all languages that are listed in the lib/jsr223 directory.

UDFs in 2.2.x were not “sandboxed” - i.e. unrestricted access to files, 
network, classes etc - so users could actually execute “evil” code on the nodes 
by creating and executing a UDF. This is definitely something nobody wants to 
allow to see in production (e.g. a UDF body like Runtime.getRuntime().exec(“rm 
-rf /“) ).

Therefore we added a so called “sandbox” to C* 3.0.0, which means access to 
classes and even specific functions is restricted. Additionally, runtime quotas 
(heap usage and CPU time consumption) are checked. This is pretty straight 
forward for Java-UDFs. Unfortunately it is not straight forward for JavaScript 
UDFs - frankly speaking, it is difficult - and honestly speaking it’s annoying 
to secure all the possible runtime characteristics via JSR223.

I strongly recommend to use Java UDFs for various reasons:
* performance - Java UDFs get compiled to bytecode and are subject to Hotspot 
optimizations
* security - Java bytecode is inspected and rejected if a UDF calls an “evil” 
function. JSR223 (including JavaScript!) is not and we have to rely on the 
(limited) security checks for example in Nashorn. See also CASSANDRA-9954 - 
improving both performance and security for Java UDFs
* maintenance - Java code (or better: bytecode) is well defined. However, 
JavaScript (i.e. the Nashorn implementation) changes.

IMHO your “best” option is to switch to Java UDFs.

TL;DR Python and probably all script languages except JavaScript don’t work 
since 3.0.

Robert

PS: Honestly, looking backwards it was maybe a mistake to allow “all” JSR-223 
languages, so I’ve opened https://issues.apache.org/jira/browse/CASSANDRA-12883 
<https://issues.apache.org/jira/browse/CASSANDRA-12883>.

—
Robert Stupp
@snazy

> On 3 Nov 2016, at 07:15, Maciej Bryński <mac...@brynski.pl> wrote:
> 
> Hi,
> I have following problem with Jython UDF.
> 
> 1) I'm using Cassandra 3.9 deb packages and Ubuntu 14.04. I'm running Oracle 
> Java 1.8.0_101-b13)
> 
> 2) I added jython jar to /usr/share/cassandra/lib. (jython version 2.7.0)
> This makes creating python function possible
> 
> 3) I want to test function.
> 
> cqlsh:e> CREATE FUNCTION IF NOT EXISTS test123 (input bigint) CALLED ON NULL 
> INPUT RETURNS text LANGUAGE python AS 'return "123"';
> 
> This worked, but running select with udf returns exception:
> Traceback (most recent call last):
>   File "/usr/bin/cqlsh.py", line 1264, in perform_simple_statement
>     result = future.result()
>   File 
> "/usr/share/cassandra/lib/cassandra-driver-internal-only-3.5.0.post0-d8d0456.zip/cassandra-driver-3.5.0.post0-d8d0456/cassandra/cluster.py",
>  line 3650, in result
>     raise self._final_exception
> FunctionFailure: Error from server: code=1400 [User Defined Function failure] 
> message="execution of 'e.test123[bigint]' failed: 
> java.security.AccessControlException: access denied: 
> ("java.lang.RuntimePermission" 
> "accessClassInPackage.org.python.jline.console")
> 
> 4) I tried to modify /etc/java-8-oracle/security/java.policy and added:
> 
> grant codeBase "file:/usr/share/cassandra/lib/*" {
>         permission java.security.AllPermission;
> };
> 
> Still no improvement.
> 
> Any ideas how to run python UDFs in Cassandra ?
> 
> Regards,
> -- 
> Maciek Bryński

Reply via email to