[pnfs] Kernel crash on access dCache NFS
Tigran Mkrtchyan
tigran.mkrtchyan at desy.de
Mon Jun 4 10:42:12 EDT 2007
and dump from wireshark:
Transmission Control Protocol, Src Port: nfs (2049), Dst Port: 798
(798), Seq: 3300, Ack: 3004, Len: 224
Source port: nfs (2049)
Destination port: 798 (798)
Sequence number: 3300 (relative sequence number)
[Next sequence number: 3524 (relative sequence number)]
Acknowledgement number: 3004 (relative ack number)
Header length: 32 bytes
Flags: 0x18 (PSH, ACK)
0... .... = Congestion Window Reduced (CWR): Not set
.0.. .... = ECN-Echo: Not set
..0. .... = Urgent: Not set
...1 .... = Acknowledgment: Set
.... 1... = Push: Set
.... .0.. = Reset: Not set
.... ..0. = Syn: Not set
.... ...0 = Fin: Not set
Window size: 230
Checksum: 0xc415 [correct]
[Good Checksum: True]
[Bad Checksum: False]
Options: (12 bytes)
NOP
NOP
Timestamps: TSval 7349967, TSecr 4294941661
Remote Procedure Call, Type:Reply XID:0xf0f84339
Fragment header: Last fragment, 220 bytes
1... .... .... .... .... .... .... .... = Last Fragment: Yes
.000 0000 0000 0000 0000 0000 1101 1100 = Fragment Length: 220
XID: 0xf0f84339 (4042801977)
Message Type: Reply (1)
[Program: NFS (100003)]
[Program Version: 4]
[Procedure: COMPOUND (1)]
Reply State: accepted (0)
[This is a reply to a request in frame 43]
[Time from request: 0.063057000 seconds]
Verifier
Flavor: AUTH_NULL (0)
Length: 0
Accept State: RPC executed successfully (0)
Network File System
[Program Version: 4]
[V4 Procedure: COMPOUND (1)]
Status: NFS4_OK (0)
Tag: <EMPTY>
length: 0
contents: <EMPTY>
Operations (count: 3)
Opcode: SEQUENCE (53)
Status: NFS4_OK (0)
sessionid: <DATA>
contents: <DATA>
seqid: 0x00000014
slot ID: 0
slot ID: 0
slot ID: 0
status: 0
Opcode: PUTFH (22)
Status: NFS4_OK (0)
Opcode: LAYOUTGET (50)
Status: NFS4_OK (0)
return on close?: Yes
offset: 0
length: 0
IO mode: IOMODE_RW (2)
layout type: LAYOUT_NFSV4_FILES (1)
stripe type: STRIPE_DENSE (2)
MDS commit?: Yes
stripe unit: 4096
length: 0
device ID: 174
device index: 0
filehandle
length: 46
[hash: 0xe04b8bba]
decode type as: unknown
filehandle:
303A696E6F64653A30303030323436384636414344374641...
0000 00 0c 29 98 68 ef 00 50 56 c0 00 08 08 00 45 00 ..).h..PV.....E.
0010 01 14 1a 97 40 00 40 06 6a aa ac 10 ae 01 ac 10 .... at .@.j.......
0020 ae 80 08 01 03 1e c4 79 f4 d8 ee 11 d4 06 80 18 .......y........
0030 00 e6 c4 15 00 00 01 01 08 0a 00 70 26 cf ff ff ...........p&...
0040 9b dd 80 00 00 dc f0 f8 43 39 00 00 00 01 00 00 ........C9......
0050 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
0060 00 00 00 00 00 00 00 00 00 03 00 00 00 35 00 00 .............5..
0070 00 00 33 23 23 23 23 23 23 23 23 23 23 23 23 23 ..3#############
0080 23 23 00 00 00 14 00 00 00 00 00 00 00 00 00 00 ##..............
0090 00 00 00 00 00 00 00 00 00 16 00 00 00 00 00 00 ................
00a0 00 32 00 00 00 00 00 00 00 01 00 00 00 00 00 00 .2..............
00b0 00 00 00 00 00 00 00 00 00 00 00 00 00 02 00 00 ................
00c0 00 01 00 00 00 5c 00 00 00 02 00 00 00 01 00 00 .....\..........
00d0 00 00 00 00 10 00 00 00 00 00 00 00 00 00 00 00 ................
00e0 00 00 00 00 00 01 00 00 00 ae 00 00 00 00 00 00 ................
00f0 00 2e 30 3a 69 6e 6f 64 65 3a 30 30 30 30 32 34 ..0:inode:000024
0100 36 38 46 36 41 43 44 37 46 41 34 43 34 34 42 46 68F6ACD7FA4C44BF
0110 36 44 36 30 37 42 44 39 43 33 30 35 39 46 3a 30 6D607BD9C3059F:0
0120 00 00 ..
Frame 46 (66 bytes on wire, 66 bytes captured)
Arrival Time: Jun 4, 2007 16:29:26.009605000
[Time delta from previous packet: 0.129990000 seconds]
[Time since reference or first frame: 15.001609000 seconds]
Frame Number: 46
Packet Length: 66 bytes
Capture Length: 66 bytes
[Frame is marked: False]
[Protocols in frame: eth:ip:tcp]
Ethernet II, Src: Vmware_98:68:ef (00:0c:29:98:68:ef), Dst:
Vmware_c0:00:08 (00:50:56:c0:00:08)
Destination: Vmware_c0:00:08 (00:50:56:c0:00:08)
Address: Vmware_c0:00:08 (00:50:56:c0:00:08)
.... ...0 .... .... .... .... = IG bit: Individual address
(unicast)
.... ..0. .... .... .... .... = LG bit: Globally unique address
(factory default)
Source: Vmware_98:68:ef (00:0c:29:98:68:ef)
Address: Vmware_98:68:ef (00:0c:29:98:68:ef)
.... ...0 .... .... .... .... = IG bit: Individual address
(unicast)
.... ..0. .... .... .... .... = LG bit: Globally unique address
(factory default)
Type: IP (0x0800)
Internet Protocol, Src: 172.16.174.128 (172.16.174.128), Dst:
172.16.174.1 (172.16.174.1)
Version: 4
Header length: 20 bytes
Differentiated Services Field: 0x00 (DSCP 0x00: Default; ECN: 0x00)
0000 00.. = Differentiated Services Codepoint: Default (0x00)
.... ..0. = ECN-Capable Transport (ECT): 0
.... ...0 = ECN-CE: 0
Total Length: 52
Identification: 0x5b13 (23315)
Flags: 0x04 (Don't Fragment)
0... = Reserved bit: Not set
.1.. = Don't fragment: Set
..0. = More fragments: Not set
Fragment offset: 0
Time to live: 64
Protocol: TCP (0x06)
Header checksum: 0x2b0e [correct]
[Good: True]
[Bad : False]
Source: 172.16.174.128 (172.16.174.128)
Destination: 172.16.174.1 (172.16.174.1)
Transmission Control Protocol, Src Port: 798 (798), Dst Port: nfs
(2049), Seq: 3004, Ack: 3524, Len: 0
Source port: 798 (798)
Destination port: nfs (2049)
Sequence number: 3004 (relative sequence number)
Acknowledgement number: 3524 (relative ack number)
Header length: 32 bytes
Flags: 0x10 (ACK)
0... .... = Congestion Window Reduced (CWR): Not set
.0.. .... = ECN-Echo: Not set
..0. .... = Urgent: Not set
...1 .... = Acknowledgment: Set
.... 0... = Push: Not set
.... .0.. = Reset: Not set
.... ..0. = Syn: Not set
.... ...0 = Fin: Not set
Window size: 5064
Checksum: 0x63b6 [correct]
[Good Checksum: True]
[Bad Checksum: False]
Options: (12 bytes)
NOP
NOP
Timestamps: TSval 4294941682, TSecr 7349967
[SEQ/ACK analysis]
[This is an ACK to the segment in frame: 45]
[The RTT to ACK the segment was: 0.129990000 seconds]
0000 00 50 56 c0 00 08 00 0c 29 98 68 ef 08 00 45 00 .PV.....).h...E.
0010 00 34 5b 13 40 00 40 06 2b 0e ac 10 ae 80 ac 10 .4[. at .@.+.......
0020 ae 01 03 1e 08 01 ee 11 d4 06 c4 79 f5 b8 80 10 ...........y....
0030 13 c8 63 b6 00 00 01 01 08 0a ff ff 9b f2 00 70 ..c............p
0040 26 cf &.
looks like valid reply.
Regards,
Tigran.
Tigran Mkrtchyan wrote:
>
>
> Hi *,
>
> Last weeks I try to get may code synchronized with draft 10 ( as all of
> us ) and have a trouble which can't solve for now. Linux client crushes.
> 100% reproducible. The symptoms:
>
>
> after mount, client happy and recognize server as pNFS server,
> but any read/write operation ends up with kernel panic. Last operation
> seen by server - LAYOUTGET. I expect GETDEVICEINFO next one ( while I
> return new device id every time ), but client dies.
>
>
> kernel log attached.
>
> kernel used: git-clone git://linux-nfs.org/linux-pnfs.git linux-git
>
>
> tigran at nairi:~/work/linux-git> git pull
> Already up-to-date.
> tigran at nairi:~/work/linux-git> date
> Mon Jun 4 11:51:22 CEST 2007
>
>
>
> Regards,
> Tigran.
>
>
> ________________________________________________________________________
> Tigran Mkrtchyan DESY, IT,
> tigran.mkrtchyan at desy.de Notkestrasse 85,
> Tel: + 49 40 89983946 Hamburg 22607,
> Fax: + 49 40 89984429 Germany.
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> pNFS mailing list
> pNFS at linux-nfs.org
> http://linux-nfs.org/cgi-bin/mailman/listinfo/pnfs
--
________________________________________________________________________
Tigran Mkrtchyan DESY, IT,
tigran.mkrtchyan at desy.de Notkestrasse 85,
Tel: + 49 40 89983946 Hamburg 22607,
Fax: + 49 40 89984429 Germany.
More information about the pNFS
mailing list