Re: Index Replication / Clustering

Nader Henein Mon, 27 Jun 2005 02:14:58 -0700

I implemented a JMS based solution about a year ago because I thought itwould solve my atomicity problem and give me a centralized way ofindexing, you'll have to use the pluggable persistence (if you useActiveMQ) to be able to recover from a failure and you'll also need someway of maintaining which records have and haven't been indexed in apersistent store so you can run sanitization later on, assuming thatevery time you add a document to one of you clustered indecieseverything goes well is a sure way of guaranteeing that you end up withn indecies with no two alike especially if you have a high volumeindecies with high rates of change.


Nader Henein


Stephane Bailliez wrote:

Hi Paul,

Thanks for the reply. Many interesting points.

Paul Smith wrote:
Why not try using JMS messaging to send messages to the indexingserver that Document X needs to be updated via a JMS queue? Thisgives you the flexibility to have the indexing system down but notlose the message that it needs to be indexed, and also allows theindexing server to be 'busy' without affecting the application thatis performing the updates from slowing down too.
Excellent idea.
If you use ActiveMQ for JMS, you can take advantage of it'sComposite Destination feature and have a virtual Queue/Topic that isactually several Queues/Topics. This is what we use to keep amirror index server completely in sync. The application sends anupdate message to a queue named "queue://index1, queue://index2",which becomes 2 separate queues for the 2 servers, allowing them toprocess the same message whenever they can get around to it.
Ah, the composite topic, is indeed a good nice. But out ofcuriosity...did you put your 2 nodes (consumers) as embedded brokersor is the producer as the main broker ?
We then place Apache in front of these 2 mirrored Index/Search nodesso the application can use web-services to query the search nodewithout actually being aware that there is 2 of them behind thescenes, leaving Apache to do the load-balancing and fail-over as theindex/search nodes come up/down without the main application knowinganything about it.
Ideally, the 2 nodes have the same state when running.
What happens when a node fails and that you put it back online andthat it needs to catch up with all missing messages in its queue ?Is it considered 'offline' until it catches up ? If yes how do you doit ? If no, I guess you don't mind that a search request may not givethe same result depending on the node it is load-balanced, correct ?
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

---


--

Nader S. Henein
Senior Applications Architect

Bayt.com

---


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Index Replication / Clustering

Reply via email to