nfs client hanging
Guillaume Rousse
Guillaume.Rousse at inria.fr
Wed Jul 2 09:44:50 EDT 2008
Guillaume Rousse a écrit :
> Given this problem only occurs for home directories for a host with a
> very specific usage (transient ssh logins for svn commits, or web home
> page access), I'm more and more considering an autofs issue with
> repeated mount/umount events, rather than a pure nfs issue. With a
> similar setup, we have no problem for long-lived mount points, such as
> homedirs on workstations. And I just found report on autofs ML of such
> kind of issues:
> http://linux.kernel.org/pipermail/autofs/2008-June/004814.html
With a static NFS mount, the problem still appears, but a bit
differently, as the filer is still connected
tcp 0 0 node3.cluster.msr-inria:755
msr-nas1.msr-inria.inri:nfs ESTABLISHED
And this time the log output is a bit different:
Jul 2 15:43:35 localhost kernel: RPC: new task initialized,
procpid 22288
Jul 2 15:43:35 localhost kernel: RPC: allocated task ffff81042bc77900
Jul 2 15:43:35 localhost kernel: RPC: 0 looking up UNIX cred
Jul 2 15:43:35 localhost kernel: RPC: 491 __rpc_execute flags=0x80
Jul 2 15:43:35 localhost kernel: RPC: 491 call_start nfs4 proc 1 (sync)
Jul 2 15:43:35 localhost kernel: RPC: 491 call_reserve (status 0)
Jul 2 15:43:35 localhost kernel: RPC: 491 reserved req
ffff810429d60e70 xid c2586af8
Jul 2 15:43:35 localhost kernel: RPC: 491 call_reserveresult (status 0)
Jul 2 15:43:35 localhost kernel: RPC: 491 call_allocate (status 0)
Jul 2 15:43:35 localhost kernel: RPC: 491 allocated buffer of size
972 at ffff81042509d000
Jul 2 15:43:35 localhost kernel: RPC: 491 call_bind (status 0)
Jul 2 15:43:35 localhost kernel: RPC: 491 call_connect xprt
ffff81042bd17800 is not connected
Jul 2 15:43:35 localhost kernel: RPC: 491 xprt_connect xprt
ffff81042bd17800 is not connected
Jul 2 15:43:35 localhost kernel: RPC: 491 sleep_on(queue
"xprt_pending" time 4311285843)
Jul 2 15:43:35 localhost kernel: RPC: 491 added to queue
ffff81042bd179d0 "xprt_pending"
Jul 2 15:43:35 localhost kernel: RPC: 491 setting alarm for 60000 ms
Jul 2 15:43:35 localhost kernel: RPC: xs_connect scheduled xprt
ffff81042bd17800
Jul 2 15:43:35 localhost kernel: RPC: 491 sync task going to sleep
Jul 2 15:43:35 localhost kernel: RPC: xs_bind4 0.0.0.0:755: ok (0)
Jul 2 15:43:35 localhost kernel: RPC: worker connecting xprt
ffff81042bd17800 to address: addr=193.55.250.11 port=2049 proto=tcp
Jul 2 15:43:35 localhost kernel: RPC: ffff81042bd17800 connect
status 115 connected 0 sock state 2
Jul 2 15:43:35 localhost kernel: RPC: xs_tcp_state_change client
ffff81042bd17800...
Jul 2 15:43:35 localhost kernel: RPC: state 1 conn 0 dead 0 zapped 1
Jul 2 15:43:35 localhost kernel: RPC: 491 __rpc_wake_up_task (now
4311285843)
Jul 2 15:43:35 localhost kernel: RPC: 491 disabling timer
Jul 2 15:43:35 localhost kernel: RPC: 491 removed from queue
ffff81042bd179d0 "xprt_pending"
Jul 2 15:43:35 localhost kernel: RPC: __rpc_wake_up_task done
Jul 2 15:43:35 localhost kernel: RPC: 491 sync task resuming
Jul 2 15:43:35 localhost kernel: RPC: 491 deleting timer
Jul 2 15:43:35 localhost kernel: RPC: 491 xprt_connect_status:
connection established
Jul 2 15:43:35 localhost kernel: RPC: 491 call_connect_status (status 0)
Jul 2 15:43:35 localhost kernel: RPC: 491 call_transmit (status 0)
Jul 2 15:43:35 localhost kernel: RPC: 491 xprt_prepare_transmit
Jul 2 15:43:35 localhost kernel: RPC: 491 call_encode (status 0)
Jul 2 15:43:35 localhost kernel: RPC: 491 marshaling UNIX cred
ffff81042ed96f00
Jul 2 15:43:35 localhost kernel: RPC: 491 using AUTH_UNIX cred
ffff81042ed96f00 to wrap rpc data
Jul 2 15:43:35 localhost kernel: RPC: 491 xprt_transmit(140)
Jul 2 15:43:35 localhost kernel: RPC: xs_tcp_send_request(140) = 140
Jul 2 15:43:35 localhost kernel: RPC: 491 xmit complete
Jul 2 15:43:35 localhost kernel: RPC: 491 sleep_on(queue
"xprt_pending" time 4311285843)
Jul 2 15:43:35 localhost kernel: RPC: 491 added to queue
ffff81042bd179d0 "xprt_pending"
Jul 2 15:43:36 localhost kernel: RPC: 491 setting alarm for 60000 ms
Jul 2 15:43:36 localhost kernel: RPC:
wake_up_next(ffff81042bd17980 "xprt_resend")
Jul 2 15:43:36 localhost kernel: RPC:
wake_up_next(ffff81042bd17930 "xprt_sending")
Jul 2 15:43:36 localhost kernel: RPC: 491 sync task going to sleep
Jul 2 15:43:36 localhost kernel: RPC: xs_tcp_data_ready...
Jul 2 15:43:36 localhost kernel: RPC: xs_tcp_data_recv started
Jul 2 15:43:36 localhost kernel: RPC: reading TCP record fragment
of length 228
Jul 2 15:43:36 localhost kernel: RPC: reading XID (4 bytes)
Jul 2 15:43:36 localhost kernel: RPC: reading reply for XID c2586af8
Jul 2 15:43:36 localhost kernel: RPC: XID c2586af8 read 224 bytes
Jul 2 15:43:36 localhost kernel: RPC: xprt = ffff81042bd17800,
tcp_copied = 228, tcp_offset = 228, tcp_reclen = 228
Jul 2 15:43:36 localhost kernel: RPC: 491 xid c2586af8 complete (228
bytes received)
Jul 2 15:43:36 localhost kernel: RPC: 491 __rpc_wake_up_task (now
4311285843)
Jul 2 15:43:36 localhost kernel: RPC: 491 disabling timer
Jul 2 15:43:36 localhost kernel: RPC: 491 removed from queue
ffff81042bd179d0 "xprt_pending"
Jul 2 15:43:36 localhost kernel: RPC: __rpc_wake_up_task done
Jul 2 15:43:36 localhost kernel: RPC: xs_tcp_data_recv done
Jul 2 15:43:36 localhost kernel: RPC: 491 sync task resuming
Jul 2 15:43:36 localhost kernel: RPC: 491 deleting timer
Jul 2 15:43:36 localhost kernel: RPC: 491 call_status (status 228)
Jul 2 15:43:36 localhost kernel: RPC: 491 call_decode (status 228)
Jul 2 15:43:36 localhost kernel: RPC: 491 validating UNIX cred
ffff81042ed96f00
Jul 2 15:43:36 localhost kernel: RPC: 491 using AUTH_UNIX cred
ffff81042ed96f00 to unwrap rpc data
--
Guillaume Rousse
Moyens Informatiques - INRIA Futurs
Tel: 01 69 35 69 62
More information about the NFSv4
mailing list