On Sat, 18 Mar 2006 22:03:24 -0800, David Schwartz wrote:
>
> "Andrew Gideon" <(E-Mail Removed)> wrote in message
> news
(E-Mail Removed)...
>
>> I'm seeing errors on one side of a GigE link. The link is a
>> point-to-point, just connecting two computers. As I move a lot of data,
>> the error count increases.
>
> Why do you think this is a problem?
Interesting question. I'm blaming this for poor performance I'm seeing
under certain conditions.
Specifically, I'm copying data from an iSCSI-based SAN (testing the use of
the SAN, in fact). When I have a single process doing the copy (ie. a dd
from a volume mounted from the SAN to /dev/null), I get pretty good
performance. When I run two dd processes concurrently, I get awful
performance.
I noticed that I see no (or occasionally very few) overruns in the case of
the single dd process, but more overruns in the case of the two dd
processes.
So I've been assuming that overruns are, if not the cause, at least a
common symptom.
Is that reasonable, or do you believe otherwise? I'd be most interested
in other directions in which I should be looking.
[...]
> I doubt it can handle the peak bursts 100% of the time.
I would think the same thing of my backup server. Yet asking both the
switch to which it is connected and that machine itself, no errors of any
sort (including overruns) are reported. Yet flow control is listed as off
on both switch and computer.
So something else is throttling.
I've no problem "blaming" rsync or ssh for this throttling. I seem to
recall, in fact, reading of some "problem" in one of those two where the
TCP window was too small for fully exploiting certain links (high
speed, high latency? I don't recall the details).
Of course, I run multiple rsync/ssh processes concurrently, with streams
at the backup server coming from all over my network. So I'm not sure I
entirely believe my own explanation.
So perhaps the solution is in tuning parameters like the TCP window for
iSCSI. It looks like the initiator software on Linux permits this, so
I'll be checking. I've no idea, though, how well this will impact the
SAN's choices. Unfortunately, it's more of a black box than I'd prefer.
But I welcome any and all insights into what's occurring and what I can do
to get better performance over this "point to point GigE" link.
Thanks...
Andrew