Hi,

I'm trying to build something like an embedded data base system around
arrow with some properties which I will list below.
The question is if someone can point me to existing solutions (or attempts)
for such kind of problem.
I'm trying to develop a data layout such that
* typically reading is not for single value but for a range of keys/fields
* this is all in-memory
* it provides low overhead for reading and writing
* multithreaded reading supported, but updates are single threaded
* 99% of updates are primarily adding a new data point to the end of column
(a-la log)
* return data in arrow tables with minimal overhead (better not to copy
data from internal storage but do slicing whenever possible)

On a high level I had in mind something like SSTables with level compaction
when the data is accessed for reading. But before doing this I wanted to
check if there is a solution for such a problem leveraging an arrow or if
there are all the bricks to build such a solution. Any input (including
papers) is welcomed.

-- 
Best regards,
Kirill Lykov

Reply via email to