Bull GNU/Linux NFSv4 project

2.6.25-rc6-CITI_NFS4_ALL-1
Robustness regression tests
and Performances
June 2008

Introduction

Here are the last results for the Robustness and Performances testing  of the kernel and CITI patch:

linux-2.6.25-rc6-CITI_NFS4_ALL-1

Tests performed

The following tools have been used:

Performances Results


We retrieve logically a degradation following the security flavor used.
For any flavors but krb5p, we get a drastic degradation performance between 256MB and  512 Mbytes files and over even having 2G RAM memory. If it was able to see that in linux-2.6.22-rc5-CITI_NFS4_ALL-1.diff, we didn't see that in the previous 2.6.19-rc6-CITI_NFS4_ALL-1

Robustness Results

Remarks:

We get randomly in a such overloaded environment the following error:
Second check for lost reply on non-idempotent requests
testing 50 idempotencies in directory "testdir"
rmdir 1: Directory not empty
special tests failed

See BUGZILLA
#162


We get randomly nfsd looping
 PID USER      PR  NI  VIRT  RES  SHR S   %CPU %MEM    TIME+        COMMAND
10826 root       20    0     0        0       0   R     100     0.0         769:34.99      nfsd
 5      root       RT   -5     0        0       0   S       5      0.0          20:42.48      watchdog/0

A similar bug was found by Miklos Szeredi about infinite loop in generic_file_splice_read() got with fsx one the benchs we are using

When using the patch found at http://lkml.org/lkml/2008/2/29/443, the infinite loop no longer appear and we were able to run benchmarks during 60 hours and more.




Kerberos Robustness Results


We get randomly in a such overloaded environment the following error:
Second check for lost reply on non-idempotent requests
testing 50 idempotencies in directory "testdir"
rmdir 1: Directory not empty
special tests failed

See BUGZILLA
#162


We get randomly nfsd looping
 PID USER      PR   NI  VIRT  RES  SHR S  %CPU %MEM     TIME+       COMMAND
10826 root       20     0     0       0        0   R   100       0.0         769:34.99       nfsd
 5       root       RT   -5     0       0        0   S     5         0.0          20:42.48       watchdog/0

A similar bug was found by Miklos Szeredi about infinite loop in generic_file_splice_read() got with fsx one the benchs we are using

When using the patch found at http://lkml.org/lkml/2008/2/29/443, the infinite loop no longer appear and we were able to run benchmarks during 60 hours and more.

kernel BUG at net/sunrpc/auth_gss/svcauth_gss.c:1243!
See BUGZILLA #165


Loopback Robustness Results:

Testing   Tool

NFSV4
sec=sys
2hours

NFSV4 
sec=sys
  15 hours
NFSV4
sec=krb5
2Hours

NFSV4
sec=krb5
15 Hours

NFSV4
sec=krb5i
2 Hours

NFSV4
sec=krb5p
2 hours

NFSV4
sec=krb5p
15 Hours

Simultaneously in loopback
fsstress
+fsx
+iozone (no -U)
+ffsb
+cthon04 -t





PASSED
(1)





PASSED
(1)





PASSED
(1)





FAILED
(3)





FAILED
(1)(2)





PASSED
(1)



FAILED
(1)(3)
iozone -U
PASSED PASSED

PASSED


locktests
PASSED



PASSED


dbenchs
PASSED









(1) special tests from connectathon suite randomly  fail
(2)
kernel bug See BUGZILLA #165
(3) Hang

testing
 tool

NFSV4
sec=sys
15 hours

NFSV4
sec=krb5
2 hours

NFSV4
sec=krb5
15 hours
NFSV4
sec=krb5i
2 hours

NFSV4
sec=krb5p
2 hours

Simultaneously in loopback
fsstress
+fsx
+iozone (no -U)
+ffsb
+cthon04-t (ext)



PASSED
(1)



PASSED
(1)



FAILED
(3)



FAILED
(1) (2)



PASSED
(1)

(1) special tests from connectathon suite randomly  fail
(2) kernel bug See BUGZILLA #165
(3) Hang

Remarks:
We get randomly in a such overloaded environment the following error:
Second check for lost reply on non-idempotent requests
testing 50 idempotencies in directory "testdir"
rmdir 1: Directory not empty
special tests failed

See BUGZILLA
#162
See BUGZILLA #165

Software versions

Linux linux-2.6.25-rc6-CITI_NFS4_ALL-1
Client userland package util-linux-2.12
+ util-linux-2.12-CITI_NFS4_ALL-3.dif
Linux nfs-utils version nfs-utils-1.1.2
+nfs-utils-1.1.2-CITI_NFS4_ALL-1.dif
gssapi library
libgssglue-0.1
rpcsecgss
library
librpcsecgss-0.17
nfsidmap
library
libnfsidmap-0.20
acl
library
acl_2.2.29-1
+acl-2.2.29-CITI_NFS4_ALL-3
Linux TI-RPC 1.0.8

Hardware configuration

 Client

Server

Conclusion

Core linux-2.6.25-rc6-CITI_NFS4_ALL-1 functions can be considered as stable for the security flavors sys, krb5 ,krb5i and krb5p when the "splice patch" is applied. Note that is the first release we have with all the benchmarks running simultaneously more than  2 hours.

There is nevertheless the random issue about the special tests of the connectathon suite.
About write performances we also have  a big performance degradation when using iozone with file sizes from 512Mbytes.

There is no longer the mount issue
BUGZILLA#145 which could be troubling in many scripts.