Hello,

I have an application which is based on mysql + sphinx (for search) + PHP + caching. I am running the mysql part on a multi server master-slave and currently sphinx is over one box only.

Currently I have performance issues with writes (INSERT/DELETE/UPDATE/REPLACE) on Sphinx and I am looking for an alternative on search part. I was checking by 1+ year for some alternative solutions (when I didn't had current issues with sphinx) and on the short list is Riak and Mark Logic.

The app I am currently handing has about 130-170 search queries (SELECT with full text) at peak time (but occasionally can go to 200-500 qps) and 5-20 writes per second (INSERT/DELETE/UPDATE/REPLACE). The "bucket" size is close to 1 million. I am handling through sphinx mainly the search part, with only some non-search queries been sent to Sphinx because would take more to run on mysql.

The reads/searches are fine, average is 0.04/query. But currently I am having issues because of the way Sphinx is handling writes (it seems writes are waiting for all reads to complete), a write can even take 7 seconds to finish. Beside some momentary optimizations, as the traffic goes up I only have the solution to bring more capacity (which is not impossible but wont help so much based on my tests, the performance/box is decreasing as we add more boxes). So I am ending up checking for Riak for search part of the app.

My questions:
* what capacity/server specs could I start ? Can I start with just one box (the app will grow so definetly I will benefit from scaling features of Riak at later moment) for the above specs (200+ search queries, 5-20 write queries, 1 Million bucket size) ?

* what response times should I expect for search requests ? How about the write requests ? I don't have lined up here the queries and the data so this is impossible to know but I have no idea now how Riak works on performance. As you see for the moment scaling is not my focus. I plan to use the bench tool to do some testing but some overall insights still are helping.

* high offset (eg: list 10 items from a search located at offset 200,000) search requests how are expected to work ?

Thank you!
Cristian
_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to