Re: [HACKERS] pl/Ruby, deprecating plPython and Core

Dave Cramer Wed, 17 Aug 2005 13:08:19 -0700


On 17-Aug-05, at 12:40 PM, Thomas Hallgren wrote:

Andrew Dunstan wrote:
Dave Cramer wrote:
As there are two java procedural languages which are availablefor postgreSQL Josh asked for an explanation as to theirdifferences.They are quite similar in that both of them run the function ina java vm, and are pre-compiled. Neither attempt to compile thecode.
The biggest difference is how they connect to the java VM.
PL/Java uses Java Native Interfaces (JNI) and does a direct callinto the java VM from the language handler.
PL-J uses a network protocol to connect to a java VM.


There are advantages and disadvantages to both approaches.
+ JNI is simpler, doesn't require a protocol, or an applicationcontainer to manage the User Defined Functions- JNI requires that the vm runs on the server machine, and aseparate vm be instantiated for every connection that calls afunction.This is mitigated somewhat in java 1.5, by sharing data,however this may or may not be a Sun only feature ( does anyoneknow );
    either way a separate vm is required for each connection.
- startup time for the vm on the first call for the connection.
- Possible ( not as likely any more ) for the java VM to takethe server down.
Using a network protocol such as a pl-j does has the following( basically the opposite of the JNI (dis)advantages )
+ The java VM does not have to run on the server.
+ Only one vm per server
- More complex, requires a micro kernel application server tomanage the UDF's currently http://loom.codehaus.org/
I think Dave miss a couple of important points.
1. Speed. One major reason for moving code from the middle tierdown to the database is that you want to execute the code close tothe actual persistence mechanisms in order to minimize networktraffic and maximize throughput.

I think until there are actual benchmarks, there are too manyvariables here to suggest one is faster than the other. The overheadof having multiple java vm's is not easily estimated. Even with aconnection pool, consider the memory footprint of even 10 java VM's

2. A growing percentage of db-clients utilize some kind ofconnection pool (an overwelming amount of the java clients certanlydo), which minimizes the problem with startup times.
3. Transaction visiblity. A function that in turn issues new SQLcalls must do that wihtin the scope of the caller transaction. Aremote process must hence call back into it's caller. PL/Java hasits own JDBC driver that interacts directly with SPI.

PL-J maintains transaction visibility, it has it's own JDBC driver aswell. The protocol between the language handler and the java portionis based upon the FE/BE protocol which made it easy to use pg's JDBCdriver with some modification.

4. Isolation. Using separate VM's, instabilities in the VM can onlyaffect one single connecton. One VM can be debugged or monitoredwithout affecting the others. No data can be inadvertidely movedbetween connections, etc.

Loom deals with data integrity, debugging would have to be done by aremote debug connection and can connect to any thread.

I try to shed more light on the pros and cons here: http://gborg.postgresql.org/project/pljava/genpage.php?jni_rationale
That's a pretty good explanation and ought to be published morewidely. It's almost a pity that we couldn't have one project witha server setting saying how we want it to run.
There are a couple of reasons that make me a bit reluctant to jointhe projects:
PL/Java have no dependencies at all besides a Java RuntimeEnvironment (or GCJ). PL/J reqires a fair amount of other modulesjust to compile.

PL-J requires one other module, which the build environment willfetch automatically to compile.

PL/Java is at release 1.1 and have a community of users. To myknowledge, PL/J has not reached its first release yet.
PL/Java and PL/J use completely different approaches and sharealmost no code. The code that we do share (public interfaces, manlyfor trigger management) is published at the Maven repository atibiblio.org.
I think it's better to keep the two projects separate. But I alsothink that it is extremely important that we ensure that the userexperience is similar for both projects so that there's nothing toprevent a server setting that decides which one to use providedboth are present.
Kind regards,
Thomas Hallgren
---------------------------(end ofbroadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster



---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
      subscribe-nomail command to [EMAIL PROTECTED] so that your
      message can get through to the mailing list cleanly

Re: [HACKERS] pl/Ruby, deprecating plPython and Core

Reply via email to