Networking Forums

Networking Forums > Computer Networking > Linux Networking > Linux NFS server reliability?

Reply
Thread Tools Display Modes

Linux NFS server reliability?

 
 
Shi Jin
Guest
Posts: n/a

 
      04-13-2004, 08:47 PM
Hi,there,

We are using two linux clusters. Both of them have a similiar NFS
problem. The fileserver has a RAID attached to it and it is exporting
the /home as NFS to many linux client nodes. Whenever the clients are
trying to write too much data to the NFS /home or when the somebody is
doing a lot of IO on the fileserver itself, the writes on the clients
to the NFS /home will frequently leads to an error. We tried to tune
the parameters but still have the same problem. The current mounting
parameters are
rw,nosuid,soft,intr,rsize=8192,wsize=8192
And the NFS server exporting parameters are simple:
rw,sync,root_squash

Could anybody tell me whether this is our own problem or it is common
to all linux NFS servers. Is there a way to solve it?
Thank you very much.

Shi Jin
 
Reply With Quote
 
 
 
 
Joshua Baker-LePain
Guest
Posts: n/a

 
      04-14-2004, 02:07 PM
In article <(E-Mail Removed) >, Shi Jin wrote:
>
> We are using two linux clusters. Both of them have a similiar NFS
> problem. The fileserver has a RAID attached to it and it is exporting
> the /home as NFS to many linux client nodes. Whenever the clients are
> trying to write too much data to the NFS /home or when the somebody is
> doing a lot of IO on the fileserver itself, the writes on the clients
> to the NFS /home will frequently leads to an error. We tried to tune


What is the error you're getting? What hardware do you have on
the server side? What is your network setup?

> the parameters but still have the same problem. The current mounting
> parameters are
> rw,nosuid,soft,intr,rsize=8192,wsize=8192


The 'hard' mount option is preferred over 'soft', but that's a bit of
a religous issue.

> And the NFS server exporting parameters are simple:
> rw,sync,root_squash
>
> Could anybody tell me whether this is our own problem or it is common
> to all linux NFS servers. Is there a way to solve it?


The Linux NFS HOWTO has a whole chapter on tuning:

http://nfs.sourceforge.net/nfs-howto/

Not knowing what exact error you're getting, I can can only guess. One
thing to check is to make sure you have enough server threads. By default,
e.g., Red Hat only launches 8 nfsd threads, which is definitely not enough
for a busy server. Check the output of 'cat /proc/net/rpc/nfsd'. Look
at the output of the 4th line, which should look something like this:

th 256 846 2422.580 993.510 436.910 195.680 123.210 54.990 36.530 27.830 19.010 32.410

The first number is the number of server threads (256 in this case). The
second number is the most important one. That's the number of times
requests have had to be queued because all the threads were busy. In
this case, I had been using 128 threads, saw that 846, and then upped the
threads to 256 (without rebooting and thus without resetting the statistics).
The number hasn't increased since then. The rest of the numbers represent
the amount of time a particular percentage of the threads have been busy
at the same time. I.E., 10% of the threads were busy for 2422.58 units (I'm
not sure of the units), 20% were busy for 993.51, etc. Having
the last few numbers be low indicates that you've got enough threads.

--
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University
 
Reply With Quote
 
Shi Jin
Guest
Posts: n/a

 
      04-15-2004, 04:08 AM
Thank you very much.
I think my server threads are not enough. But how can I increase this
number?
I am using Redhat, unfortunately.
Thanks.

Shi Jin

On Wed, 14 Apr 2004 14:07:25 +0000, Joshua Baker-LePain wrote:

> In article <(E-Mail Removed) >, Shi Jin wrote:
>>
>> We are using two linux clusters. Both of them have a similiar NFS
>> problem. The fileserver has a RAID attached to it and it is exporting
>> the /home as NFS to many linux client nodes. Whenever the clients are
>> trying to write too much data to the NFS /home or when the somebody is
>> doing a lot of IO on the fileserver itself, the writes on the clients
>> to the NFS /home will frequently leads to an error. We tried to tune

>
> What is the error you're getting? What hardware do you have on
> the server side? What is your network setup?
>
>> the parameters but still have the same problem. The current mounting
>> parameters are
>> rw,nosuid,soft,intr,rsize=8192,wsize=8192

>
> The 'hard' mount option is preferred over 'soft', but that's a bit of
> a religous issue.
>
>> And the NFS server exporting parameters are simple:
>> rw,sync,root_squash
>>
>> Could anybody tell me whether this is our own problem or it is common
>> to all linux NFS servers. Is there a way to solve it?

>
> The Linux NFS HOWTO has a whole chapter on tuning:
>
> http://nfs.sourceforge.net/nfs-howto/
>
> Not knowing what exact error you're getting, I can can only guess. One
> thing to check is to make sure you have enough server threads. By default,
> e.g., Red Hat only launches 8 nfsd threads, which is definitely not enough
> for a busy server. Check the output of 'cat /proc/net/rpc/nfsd'. Look
> at the output of the 4th line, which should look something like this:
>
> th 256 846 2422.580 993.510 436.910 195.680 123.210 54.990 36.530 27.830 19.010 32.410
>
> The first number is the number of server threads (256 in this case). The
> second number is the most important one. That's the number of times
> requests have had to be queued because all the threads were busy. In
> this case, I had been using 128 threads, saw that 846, and then upped the
> threads to 256 (without rebooting and thus without resetting the statistics).
> The number hasn't increased since then. The rest of the numbers represent
> the amount of time a particular percentage of the threads have been busy
> at the same time. I.E., 10% of the threads were busy for 2422.58 units (I'm
> not sure of the units), 20% were busy for 993.51, etc. Having
> the last few numbers be low indicates that you've got enough threads.


 
Reply With Quote
 
Joshua Baker-LePain
Guest
Posts: n/a

 
      04-15-2004, 01:11 PM
In article <(E-Mail Removed)>, Shi Jin wrote:
> Thank you very much.
> I think my server threads are not enough. But how can I increase this
> number?
> I am using Redhat, unfortunately.


That's all we use here (of course, Red Hat itself is about 15min away...).
With a recent version of RedHat, put the following line in
/etc/sysconfig/nfs:

RPCNFSDCOUNT=128

replacing 128 with however many threads you want to run. Then do
'/etc/init.d/nfs restart'.

--
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University
 
Reply With Quote
 
Rex Dieter
Guest
Posts: n/a

 
      04-15-2004, 01:48 PM
Shi Jin wrote:

> I think my server threads are not enough. But how can I increase this
> number?
> I am using Redhat


Create/edit
/etc/sysconfig/nfs
to contain
RPCNFSDCOUNT=8

(The default # threads is 8, so edit to suit your needs).

And restart the nfs service,
/sbin/service nfs restart

-- Rex
 
Reply With Quote
 
 
 
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
wireless reliability n6trf Wireless Networks 1 04-17-2008 05:01 PM
wireless reliability n6trf Wireless Networks 4 04-08-2008 02:09 AM
GRE reliability Jean-Francois Smigielski Linux Networking 1 04-16-2005 12:13 PM
Sipgate reliability Brian A Broadband 9 02-26-2005 03:16 AM
Who else is good for reliability, reliability, and reliability ? Medallion Man Broadband 10 01-30-2005 08:36 AM



1 2 3 4 5 6 7 8 9 10 11