[pnfs] New GPFS for CITI ibmcl cluster and crash info for current image

Marc Eshel eshel at almaden.ibm.com
Wed Jan 23 11:34:25 EST 2008


No, it is not identical but I thought that maybe different routine are 
in-line with different compiler, I was just hoping that it is related some 
how since they both have a bad fp.
Marc.




Benny Halevy <bhalevy at panasas.com> 
01/23/2008 08:11 AM

To
Marc Eshel <eshel at almaden.ibm.com>
cc
"William A. (Andy) Adamson" <andros at citi.umich.edu>, pnfs at linux-nfs.org, 
androsadamson at gmail.com
Subject
Re: [pnfs] New GPFS for CITI ibmcl cluster and crash info for   current 
image






On Jan. 23, 2008, 18:03 +0200, Marc Eshel <eshel at almaden.ibm.com> wrote:
> I will build and send GPFS later today so we can compare results, but 
you 
> will have to use a branch that includes Dean's changes to export ops. It 

> looks like Benny might be hitting the same problem with nfsd4_close.

Hmm, the trace below is not the same as the one I fixed today.
(see 22feaa118f497260b2dec5cdf83009915343a4f9)

Benny

> Marc.
> 
> 
> 
> 
> "William A. (Andy) Adamson" <andros at citi.umich.edu> 
> Sent by: androsadamson at gmail.com
> 01/23/2008 07:53 AM
> 
> To
> "Marc Eshel" <eshel at almaden.ibm.com>
> cc
> pnfs at linux-nfs.org
> Subject
> Re: New GPFS for CITI ibmcl cluster and crash info for current image
> 
> 
> 
> 
> 
> 
> Hi Marc
> 
> I'm running a 2.6.24-rc4 kernel based on a pre-benny ricardo 
> linux-2.6-latest git tree.
> Quite old.
> 
> I built the kernel on ibmcl1:/usr/local/src/kernel/linux-pnfs-2.6-latest
> 
> Which connectathon lock test failed? 
> 
> I want to install the latest 2.6.24-rc8 on the cluster and debug it. The 

> question is, do I need a new GPFS image?
> 
> -->Andy
> 
> On Jan 22, 2008 10:41 PM, Marc Eshel < eshel at almaden.ibm.com> wrote:
> Hi Andy,
> Can you please remind me version of Linux you are running on the server 
> and which branch of the tree are you using. I tried to run just the lock
> test from the connectathon suite and got the following results. I am 
using
> filelayout-draft-13 branch with Linux 2.6.24-rc8-pnfs. I will try to 
debug 
> 
> some more tomorrow.
> 
> Marc.
> 
> 
> Jan 22 15:20:45 bear107 kernel: xxx server proc  4 CLOSE
> Jan 22 15:20:45 bear107 kernel: NFSD: nfsd4_close on file lockfile8090
> Jan 22 15:20:45 bear107 kernel: NFSD: preprocess_seqid_op: seqid=0 
stateid 
> 
> = (47967970/00000020/000000
> 0c/00000000)
> Jan 22 15:20:45 bear107 kernel: NFSD: find_stateid flags 0x45
> Jan 22 15:20:45 bear107 kernel: renewing client (clientid
> 47967970/00000001)
> 000000100108 RIP:
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff883ad38a>] 
> :nfsd:free_nfs4_file+0x39/0x85
> Jan 22 15:20:45 bear107 kernel: PGD 22a96a067 PUD 22b50f067 PMD 0
> Jan 22 15:20:45 bear107 kernel: Oops: 0002 [1] SMP
> Jan 22 15:20:45 bear107 kernel: CPU 0
> Jan 22 15:20:45 bear107 kernel: Modules linked in: nfsd auth_rpcgss 
> exportfs mmfs mmfslinux tracedev a
> utofs4 nfs lockd sunrpc dm_mirror dm_mod ehci_hcd ohci_hcd usbcore bnx2
> qla2xxx ext3 jbd mptsas scsi_t
> ransport_sas mptspi scsi_transport_spi mptfc scsi_transport_fc mptscsih 
> mptbase sd_mod
> Jan 22 15:20:45 bear107 kernel: Pid: 10145, comm: nfsd Not tainted
> 2.6.24-rc8-pnfs #1
> Jan 22 15:20:45 bear107 kernel: RIP: 0010:[<ffffffff883ad38a>]
> [<ffffffff883ad38a>] :nfsd:free_nfs4_f 
> ile+0x39/0x85
> Jan 22 15:20:45 bear107 kernel: RSP: 0018:ffff81022370fd80  EFLAGS:
> 00010286
> Jan 22 15:20:45 bear107 kernel: RAX: 0000000000100100 RBX:
> ffff810221575070 RCX: ffff810221575078
> Jan 22 15:20:45 bear107 kernel: RDX: 0000000000200200 RSI: 
> 0000000000000046 RDI: ffffffff883bb0fc
> Jan 22 15:20:45 bear107 kernel: RBP: ffffffff883ad351 R08:
> 0000000000000000 R09: 0000000000000000
> Jan 22 15:20:45 bear107 kernel: R10: 0000000000000000 R11:
> 0000000000000000 R12: ffff81022e51aa80 
> Jan 22 15:20:45 bear107 kernel: R13: 0000000000000000 R14:
> ffff810222a4b000 R15: 0000000000000000
> Jan 22 15:20:45 bear107 kernel: FS:  00002b3f01091b00(0000)
> GS:ffffffff8056f000(0000) knlGS:0000000000
> 000000 
> Jan 22 15:20:45 bear107 kernel: CS:  0010 DS: 0000 ES: 0000 CR0:
> 000000008005003b
> Jan 22 15:20:45 bear107 kernel: CR2: 0000000000100108 CR3:
> 000000022e61f000 CR4: 00000000000006e0
> Jan 22 15:20:45 bear107 kernel: DR0: 0000000000000000 DR1: 
> 0000000000000000 DR2: 0000000000000000
> Jan 22 15:20:45 bear107 kernel: DR3: 0000000000000000 DR6:
> 00000000ffff0ff0 DR7: 0000000000000400
> Jan 22 15:20:45 bear107 kernel: Process nfsd (pid: 10145, threadinfo
> ffff81022370e000, task ffff81022d
> b6c860)
> Jan 22 15:20:45 bear107 kernel: Stack:  ffff810221575070 
ffffffff802c8e32
> ffff810008a45210 ffff81022fc
> ea2e0
> Jan 22 15:20:45 bear107 kernel:  ffff810221575070 ffffffff883ae41a 
> ffff810222dd5dd8 0000000000000004
> Jan 22 15:20:45 bear107 kernel:  ffff81022e51aa80 ffffffff883ae4e9
> ffff81022fcea250 ffffffff883ae37f
> Jan 22 15:20:45 bear107 kernel: Call Trace:
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff802c8e32>] kref_put+0x74/0x82 

> Jan 22 15:20:45 bear107 kernel:  [<ffffffff883ae41a>]
> :nfsd:release_stateid+0x162/0x175
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff883ae4e9>]
> :nfsd:release_stateowner+0xbc/0xf1
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff883ae37f>] 
> :nfsd:release_stateid+0xc7/0x175
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff883b0899>]
> :nfsd:nfsd4_close+0x96/0x124
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff883a5255>]
> :nfsd:nfsd4_proc_compound+0x29d/0x440 
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff88396878>]
> :nfsd:nfsd_dispatch+0xe2/0x1d6
> Jan 22 15:20:45 bear107 kernel:  [<ffffffff8812f70c>]
> :sunrpc:svc_process+0x408/0x6de
> Jan 22 15:20:46 bear107 kernel:  [<ffffffff88396678>] 
> :nfsd:nfsd+0x1a4/0x2c2
> Jan 22 15:20:46 bear107 kernel:  [<ffffffff8020c3f8>] child_rip+0xa/0x12
> Jan 22 15:20:46 bear107 kernel:  [<ffffffff883964d4>] 
:nfsd:nfsd+0x0/0x2c2
> Jan 22 15:20:46 bear107 kernel:  [<ffffffff8020c3ee>] child_rip+0x0/0x12 

> Jan 22 15:20:46 bear107 kernel:
> Jan 22 15:20:46 bear107 kernel:
> Jan 22 15:20:46 bear107 kernel: Code: 48 89 50 08 48 89 02 31 c0 48 c7 
41
> 08 00 02 20 00 48 8b 73
> Jan 22 15:20:46 bear107 kernel: RIP  [<ffffffff883ad38a>] 
> :nfsd:free_nfs4_file+0x39/0x85
> Jan 22 15:20:46 bear107 kernel:  RSP <ffff81022370fd80>
> Jan 22 15:20:46 bear107 kernel: CR2: 0000000000100108
> Jan 22 15:20:46 bear107 kernel: ---[ end trace a9871b2cb4f37ac4 ]--- 
> 
> 
> 
> _______________________________________________
> pNFS mailing list
> pNFS at linux-nfs.org
> http://linux-nfs.org/cgi-bin/mailman/listinfo/pnfs





More information about the pNFS mailing list