Hi Oleg, Great to hear from you. I wondered how many of the old-timers were still around.
> Why not use composite type ? For simple interval approach it's worked for us > (see attached hdate.sql). I have just begun looking at your hdate example; I see potentially useful stuff in it, but the first thing that I noticed is hat it is not fully equivalent to my problem. It looks like you only need to match intervals, while I need to match intervals and something else — ideally, in a single operation. I attempted to explain that in my reply to Craig Ringer. > If you need to specify distribution > function, Not in this case; there is no uncertainty associated with the loci; where there is uncertainty is in the existence of a feature called at a locus: is it real or is it a technogenic artifact? But that is a different problem for a later day. > than it may be > worth to see orion project http://orion.cs.purdue.edu/index.html > 6 years ago we was thinking about implementation special UNCERTAINTY data type > (http://www.sai.msu.su/~megera/postgres/talks/big_uncertain_data.pdf), but > never > started :( It'd be nice if you start this very interesting for science > project. I love uncertainty, and I’ve always wished I could make it computable. I also wish folks around me had the same appreciation for it. My job is to say yes or no where the data suggest maybe, or maybe not. Needless to say, I feel a bit exercised. I am reading the info you provided with keen interest. > btw, now you can use range data type, check > https://wiki.postgresql.org/images/7/73/Range-types-pgopen-2012.pdf > <https://wiki.postgresql.org/images/7/73/Range-types-pgopen-2012.pdf> Great stuff, I was not aware of it. I saw it in early development but did not know it made it to the core. I tried it (and will go and update a few kludgy apps where I had to use bad surrogates). It is not directly applicable to genomic loci because it will require additional constraints for intelligent matching. I want to go for compete encapsulation of constraints. Part of the reason for such a perverse desire is that I use the database as a calculator — that is, I load some data in a one-off experiment and I literally type everything in psql while I muddle through. There is a limit on how much I can type and not screw things up beyond comprehension, so I want the query language to be as easy and interactive as possible. Having to drag along a set of additional constraints is not quite interactive and is error-prone. Regards, —Gene