I'm looking for some feedback on the design I'm using for a basic key/value
storage using postgres.

Just some quick background.  This design is for large scale games that can
get up to 10K writes per second or more.  The storage will be behind a
distributed memory cache that is built on top of Akka, and has a write
behind caching mechanism to cut down on the number of writes when you have
many updates in a short time period of the same key, which is common for a
lot of multiplayer type games.

I have been using Couchbase, but this is an open source project, and
Couchbase is basically a commercial product for all intents and purposes,
which is problematic.  I will still support Couchbase, but I don't want it
have to tell people if you really want to scale, couchbase is the only
option.

The schema is that a key is a string, and the value is a string or binary.
I am actually storing protocol buffer messages, but the library gives me
the ability to serialize to native protobuf or to json.  Json is useful at
times especially for debugging.

This is my current schema:

CREATE TABLE entities
(
  id character varying(128) NOT NULL,
  value bytea,
  datatype smallint,
  CONSTRAINT entities_pkey PRIMARY KEY (id)
);

CREATE OR REPLACE RULE entities_merge AS
    ON INSERT TO entities
   WHERE (EXISTS ( SELECT 1
           FROM entities entities_1
          WHERE entities_1.id::text = new.id::text)) DO INSTEAD  UPDATE
entities SET value = new.value, datatype = new.datatype
  WHERE entities.id::text = new.id::text;

Additional functionality I want is to do basic fuzzy searches by key.
Currently I'm using a left anchored LIKE query.  This works well because
keys are left prefixed with a scope, a delimiter, and then the actual key
for the data.  These fuzzxy searches would never be used in game logic,
they would be admin only queries for doing things like  obtaining a list of
players.  So they should be infrequent.

The scope of the query ability will not expand in the future.  I support
multiple backends for the key/value storage so I'm working with the lowest
common denominator.  Plus I have a different approach for data that you
need to do complex queries on (regular tables and an ORM).

 Chris

Reply via email to