Alan Connor <(E-Mail Removed)> wrote in message news:<reLDb.7344$(E-Mail Removed) link.net>...
> On Tue, 16 Dec 2003 08:14:28 -0500, Neil Horman <(E-Mail Removed)> wrote:
> >
> >
> > Alan Connor wrote:
> >> Let's say that I have a database distributed among 20 boxes, all at different
> >> locations with different IPs.
> >>
> >> The full database takes only 5 of these boxes, so there are 3 mirrors.
> >>
> >> Can someone tell me the best way to set things up so that a client can
> >> just enter a URL and access whichever group of 5 boxes currently isn't
> >> being used to capacity? How to connect those 5 boxes together so that
> >> they are effectively ONE box?
> >>
> > DNS is a cheap solution. Several name server packages provide features
> > (or at least have patches available) to allow you to specify some level
> > of load balancing policy, be it a simple round robin approach, or
> > something a little more complex. It takes a little work to set up, and
> > it can get a little askew, as once a DNS lookup is cached by a client
> > there is no rebalancing available to the system
> >
> > Alternatively there are professional hardware/software solutions to do
> > this. IBM offers websphere (I think) which enables this sort of
> > transparent load balancing feature, and it appears as one IP address to
> > the outside world, so multiple accesses from the same client can be
> > reblanaced as needed. Expect to pay corporate prices for this though.
> >
> > HTH
> > Neil
> >
>
> Indeed it does. Thanks Neil.
>
>
> The simplest solution at this point seems to be to have a seperate application
> keep track of the work sent to a particular mirror, and to temporarily
> remove that IP from the available list when it has all that it can handle.
>
>
> AC
Forgive me, but I am easily confused. Are you saying that it takes 5
separate boxes to _hold_ all the db data? Located in different
locations (thus different IP's, different nets). Does each site have
5 boxes dedicated to the db? And what's with the mirrors? Are these
replicated db's or do they need to be up to date within say 10
minutes? I'm just trying to get the spec right. Like I said, I'm
easily confused.
Depending on the nature of your data and db, the load you expect, the
nature of the traffic on the different nets and how much work you're
willing to do youself or pay for, you may want to look at ZEO. Did
not look through old mail, but I seem to recall some one or more
people using it in a fashion somewhat like what I _think_ you're
seeking. They may have used RSS to push ZEO updates out to different
sites. I recall talk of schemes trying to achieve some load
balancing, but to tell the truth I'm not clear what they or what you
really want to achieve in this regard.
Anyway, it may be worth a look:
http://zope.org/Products/ZEO/ZEOFactSheet
http://mail.zope.org/pipermail/zodb-dev/ (mailiing list archive
with some ZEO)
Note that to use ZEO you do NOT have to use Zope. Technically, you
don't _have_ to use the ZODB, but most people do because of its
design, integration with ZEO, and "ease" with which it can interface
with rdbs's if you need that kind of data storage. And if you are
willing to work with Python, there's probably a complete solution
(with hand built glue code and gap fillers) . It might give you a
piece of the puzzle, anyway.
hth,
prg
email above disabled