Re: Proposed RDF FHIR syntax feedback

David Booth Sat, 07 Mar 2015 17:27:56 -0800

There are a few things going on here that I think are causing someconfusion. One is discussion of RDF serializations (syntax). Anotheris discussion of ontologies (i.e., data models or TBox) versus instancedata (i.e., ABox, or data that is expressed in terms of those datamodels or ontologies). A third is discussion of dereferenceable FHIRURIs. I'll try to help untangle them, but first I'd like to suggestsome simple terminology to help reduce confusion in these discussions.

ONTOLOGY: I suggest we use the word "ontology" when we are talkingabout the definitions of classes and properties, relationships betweenthem or restrictions on their use, such as cardinality.

INSTANCE DATA: Similarly, I suggest we use the term "instance data" whenwe are talking about data that is represented *using* those classes andproperties. An example would be specific patient data (such as anobservation) that is transmitted in a FHIR payload.

I think this "ontology" versus "instance data" dichotomy will helpclarify our discussions. HOWEVER, there are several circumstances thatcause this distinction to be blurred:

- RDF itself makes no distinction between ontologies and instance data(TBox and ABox) -- it's all just sets of assertions to RDF. "Triplesall the way down." :)

- RDF file formats are *not* a reliable indicator of whether a filecontains an ontology, instance data or a combination of both. A .rdffile (RDF/XML) can hold OWL ontology definitions, as can a .ttl (Turtle)file or any other standard RDF serialization. To add even moreconfusion, if you're using a tool like Protege, the tool might storeeverything in .owl files, regardless of whether the data is acting asontologies or as instance data. The .owl extension does *not*necessarily mean the file contains an ontology (as defined above).

- Terms from OWL and RDFS vocabularies can be freely intermingled inan RDF document -- and they typically are, especially when that documentacts as an ontology.

- FHIR profile definitions can be transmitted in a FHIR payload justas patient data can be transmitted. In that sense a FHIR profile canact like instance data, but in its use -- defining extensions andconstraining the content of other FHIR resources -- it acts more like anontology.

For FHIR, we need to define both a FHIR *ontology* -- a set of classesand properties -- and bi-directional mappings that will convert FHIR*instance* *data* from FHIR XML or FHIR JSON to FHIR RDF and vice versa.

Because RDF is independent of serialization, file formats andserializations are largely irrelevant to our FHIR RDF/ontology effort:we'll be producing a FHIR ontology, using standard RDF, RDFS and OWLvocabularies, and it can be serialized to any standard RDF format. Forthis reason, I don't think we should spend much time worrying about whatRDF serialization to use for the FHIR ontology. It's pretty muchirrelevant.

However, for FHIR RDF *mappings*, for convenience we may choose todefine those mappings in terms of specific FHIR XML, FHIR JSON and/orFHIR RDF serializations. For example, the Shape Expressions (ShEx)approach that Eric Prud'hommeaux demonstrated transforms FHIR *XML* to*Turtle*. And in the JSON-LD approach that I'm investigating, themapping from JSON-LD to RDF will simply be the standard RDFinterpretation of the JSON-LD: no additional mapping definition will berequired.

In summary: (a) RDF serializations can hold a mixture of RDF, RDFSand/or OWL -- and they often do; and (b) the serialization format isindependent of whether the document contains an ontology or instancedata or both.


David Booth

Re: Proposed RDF FHIR syntax feedback

Reply via email to