NFS client patches for Linux 2.6.26-rc9

The following set of patches fix known issues with the 2.6.26-rc9 NFS client code, and significantly enhance the support for NFSv4.

linux-2.6.26-001-fix_readdir.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Mon, 7 Jul 2008 13:26:10 -0400

NFS: Fix readdir cache invalidation

invalidate_inode_pages2_range() takes page offset arguments, not byte ranges.

Another thought is that individual pages might perhaps get evicted by VM pressure, in which case we might perhaps want to re-read not only the evicted page, but all subsequent pages too (in case the server returns more/less data per page so that the alignment of the next entry changes). We should therefore remove the condition that we only do this on page->index==0.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-002-fix_rpcbind_double_free.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Mon, 7 Jul 2008 12:18:52 -0400

SUNRPC: Fix a double-free in rpcbind

It is wrong to be freeing up the rpcbind arguments if the call to rpcb_call_async() fails, since they should already have been freed up by rpcb_map_release().

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-003-fix_rpcbind_breakage.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Tue, 1 Jul 2008 15:20:55 -0400

SUNRPC: Fix an rpcbind breakage for the case of IPv6 lookups

Now that rpcb_next_version has been split into an IPv4 version and an IPv6 version, we Oops when rpcb_call_async attempts to look up the IPv6-specific RPC procedure in rpcb_next_version.

Fix the Oops simply by having rpcb_getport_async pass the correct RPC procedure as an argument.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-004-reduce_nfs4_create_stack_usage.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Fri, 20 Jun 2008 15:35:32 -0400

NFS: Reduce the stack usage in NFSv4 create operations

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-005-reduce_nfs3_create_stack_usage.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Fri, 20 Jun 2008 17:00:23 -0400

NFS: Reduce the stack usage in NFSv3 create operations

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-006-fix_nfs_update_request_memleak.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Thu, 5 Jun 2008 15:17:39 -0400

NFS: Fix a preemption count leak in nfs_update_request

The commit 2785259631697ebb0749a3782cca206e2e542939 (nfs: use GFP_NOFS preloads for radix-tree insertion) appears to have introduced a bug: We only want to call radix_tree_preload() once after creating a request. Calling it every time we loop after we created the request, will cause preemption count leaks.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com> Cc: Nick Piggin <npiggin@suse.de>

linux-2.6.26-007-add_lock_check_bounds.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Tue, 20 May 2008 19:34:39 -0400

NFS: Add correct bounds checking to NFSv2 locks

NFSv2 file locking currently fails the Connectathon tests, because the calls to the VFS locking code do not return an EINVAL error if the struct file_lock overflows the 32-bit boundaries.

The problem is due to the fact that we occasionally call helpers from fs/locks.c in order to avoid RPC calls to the server when we know that a local process holds the lock. These helpers are, of course, always 64-bit enabled, so EINVAL is not returned in cases when it would if the call had gone to the NLM code.

For consistency, we therefore add support for a bounds-checking helper.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-008-exit_on_encode_error.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Wed, 14 May 2008 19:48:25 -0700

SUNRPC: Ensure we exit early in case of an encode error

All errors from call_encode(), with exception of EAGAIN are fatal, so we should immediately return instead of proceeding to xprt_transmit().

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-009-enomem_on_encode_is_fatal.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Tue, 10 Jun 2008 18:30:11 -0400

SUNRPC: An ENOMEM error from call_encode is always fatal

The special 'ENOMEM' case that was previously flagged as non-fatal is bogus: auth_gss always returns EAGAIN for non-fatal errors, and may in fact return ENOMEM in the special case where xdr_buf_read_netobj runs out of preallocated buffer space (invariably a _fatal_ error, since there is no provision for preallocating larger buffers).

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-010-optimise_append.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Tue, 10 Jun 2008 18:31:00 -0400

NFS: Optimise append writes with holes

If a file is being extended, and we're creating a hole, we might as well declare the entire page to be up to date.

This patch significantly improves the write performance for sparse files in the case where lseek(SEEK_END) is used to append several non-contiguous writes at intervals of < PAGE_SIZE.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-011-revert_commit_44dd151d.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Sun, 25 May 2008 12:59:19 -0400

NFS: Revert commit 44dd151d

Revert commit 44dd151d "NFS: Don't mark a written page as uptodate until it is on disk". While it is true that the write may fail, that is always the case. There is no reason why we should treat data on pages that are not already marked as PG_uptodate as being special. The only thing we gain is a noticeable slowdown when re-reading these pages.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-012-use_gfp_nofs_when_alloc_creds.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Tue, 10 Jun 2008 18:31:01 -0400

SUNRPC: Use GFP_NOFS when allocating credentials

Since the credentials may be allocated during the call to rpc_new_task(), which again may be called by a memory allocator...

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-013-locks_dont_flush_on_delegation.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Tue, 10 Jun 2008 18:31:02 -0400

NFS: do_setlk(): don't flush caches when we have a delegation

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-014-update_help_text_for_config_nfs_fs.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 21 May 2008 17:09:04 -0400

NFS: Update help text for CONFIG_NFS_FS

Clean up: refresh the help text for Kconfig items related to the NFS client. Remove obsolete URLs, and make the language consistent among the options.

Also move the ROOT_NFS config option next to the options related to the NFS client.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-015-add_a_function_to_display_the_name_of_an_rpc_procedure.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 21 May 2008 17:09:12 -0400

SUNRPC: Add a function to display the name of an RPC procedure

Improve debugging messages in call_start() and call_verify() by having them show the RPC procedure name instead of the procedure number.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-016-rename_call_functions_that_are_no_longer_fsm_states.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 21 May 2008 17:09:19 -0400

SUNRPC: Rename "call_" functions that are no longer FSM states

The RPC client uses a finite state machine to move RPC tasks through each step of an RPC request. Each state is contained in a function in net/sunrpc/clnt.c, and named call_foo.

Some of the functions named call_foo have changed over the past few years and are no longer states in the FSM. These include: call_encode, call_header, and call_verify. As a clean up, rename the functions that have changed.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-017-don_t_display_the_rpc_show_tasks_header_if_there_are_no_tasks.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 21 May 2008 17:09:26 -0400

SUNRPC: Don't display the rpc_show_tasks header if there are no tasks

Clean up: don't display the rpc_show_tasks column header unless there is at least one task to display. As far as I can tell, it is safe to let the list_for_each_entry macro decide that each list is empty.

scripts/checkpatch.pl also wants a KERN_FOO at the start of any newly added printk() calls, so this and subsequent patches will also add KERN_INFO.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-018-refactor_rpc_show_tasks.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 21 May 2008 17:09:33 -0400

SUNRPC: Refactor rpc_show_tasks

Clean up: move the logic that displays each task to its own function. This removes indentation and makes future changes easier.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-019-display_some_debugging_information_as_text_rather_than_numbers.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 21 May 2008 17:09:41 -0400

SUNRPC: Display some debugging information as text rather than numbers

In rpc_show_tasks(), display the program name, version number, procedure name and tk_action as human-readable variable-length text fields rather than columnar numbers.

Doing the symbol lookup here helps in cases where we have actual debugging output from a kernel log, but don't have access to the kernel image or RPC module that generated the output.

Sample output:

-pid- flgs status -client- --rqstp- -timeout ---ops-- 5608 0001 -11 eeb42690 f6d93710 0 f8fa1764 nfsv3 WRITE a:call_transmit_status q:none 5609 0001 -11 eeb42690 f6d937e0 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5610 0001 -11 eeb42690 f6d93230 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5611 0001 -11 eeb42690 f6d93300 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5612 0001 -11 eeb42690 f6d93090 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5613 0001 -11 eeb42690 f6d933d0 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5614 0001 -11 eeb42690 f6d93cc0 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5615 0001 -11 eeb42690 f6d93a50 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5616 0001 -11 eeb42690 f6d93640 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5617 0001 -11 eeb42690 f6d93b20 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending 5618 0001 -11 eeb42690 f6d93160 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-020-make_nfs_fsync_methods_consistent.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Tue, 27 May 2008 16:29:07 -0400

NFS: Make nfs_fsync methods consistent

Clean up: Report the same debugging info, count function calls the same, and use similar function naming in nfs_fsync_dir() and nfs_fsync().

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-021-make_nfs_llseek_methods_consistent.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 11 Jun 2008 17:55:34 -0400

NFS: Make nfs_llseek methods consistent

Clean up: Report the same debugging info in nfs_llseek_dir() and nfs_llseek_file().

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-022-make_nfs_open_methods_consistent.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 11 Jun 2008 17:55:42 -0400

NFS: Make nfs_open methods consistent

Clean up: Report the same debugging info and count function calls the same for files and directories in nfs_opendir() and nfs_file_open().

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-023-add_debugging_facility_for_nfs_aops.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 11 Jun 2008 17:55:50 -0400

NFS: Add debugging facility for NFS aops

Recent work in fs/nfs/file.c neglected to add appropriate trace debugging for the NFS client's address space operations.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-024-use_nfsdbg_file_for_all_fops.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 11 Jun 2008 17:55:58 -0400

NFS: Use NFSDBG_FILE for all fops

Clean up: some fops use NFSDBG_FILE, some use NFSDBG_VFS. Let's use NFSDBG_FILE for all fops, and consistently report file names instead of inode numbers.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-025-fix_trace_debugging_nits_in_write_c.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 11 Jun 2008 17:56:05 -0400

NFS: Fix trace debugging nits in write.c

Clean up: fix a few dprintk messages that still need to show the RPC task ID correctly, and be sure we use the preferred %lld or %llu instead of %Ld or %Lu.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-026-remove_obsolete_messages_during_transport_connect.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 11 Jun 2008 17:56:13 -0400

SUNRPC: Remove obsolete messages during transport connect

Recent changes to the RPC client's transport connect logic make connect status values ECONNREFUSED and ECONNRESET impossible.

Clean up xprt_connect_status() to account for these changes.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-027-nfsroot_remove_cvs_keyword.dif:

From: Adrian Bunk <bunk@kernel.org>

Date: Tue, 20 May 2008 01:08:00 +0300

fs/nfs/nfsroot.c: remove CVS keyword

This patch removes a CVS keyword that wasn't updated for a long time from a comment.

Signed-off-by: Adrian Bunk <bunk@kernel.org>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-028-check_options_when_remounting.dif:

From: Jeff Layton <jlayton@redhat.com>

Date: Tue, 10 Jun 2008 15:38:39 -0400

NFS: implement option checking when remounting NFS filesystems (resend)

When remounting an NFS or NFS4 filesystem, the new NFS options are not respected, yet the remount will still return success. This patch adds a remount_fs sb op for NFS that checks any new nfs mount options against the existing ones and fails the mount if any have changed.

This is only implemented for string-based mount options since doing this with binary options isn't really feasible.

This is essentially the same as the original patch I sent out, but adds a check to see if the addr= option has changed.

Signed-off-by: Jeff Layton <jlayton@redhat.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-029-bring_back_cl_chatty.dif:

From: Olga Kornievskaia <aglo@citi.umich.edu>

Date: Mon, 9 Jun 2008 16:51:31 -0400

rpc: bring back cl_chatty

The cl_chatty flag alows us to control whether a given rpc client leaves

"server X not responding, timed out"

messages in the syslog. Such messages make sense for ordinary nfs clients (where an unresponsive server means applications on the mountpoint are probably hanging), but not for the callback client (which can fail more commonly, with the only result just of disabling some optimizations).

Previously cl_chatty was removed, do to lack of users; reinstate it, and use it for the nfsd's callback client.

Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-030-eliminate_unused_variable_in_auth_gss_upcall_code.dif:

From: \\\"J. Bruce Fields\\\" <bfields@citi.umich.edu>

Date: Mon, 9 Jun 2008 16:51:33 -0400

rpc: eliminate unused variable in auth_gss upcall code

Also, a minor comment grammar fix in the same file.

Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-031-remove_some_unused_macros.dif:

From: \\\"J. Bruce Fields\\\" <bfields@citi.umich.edu>

Date: Mon, 9 Jun 2008 16:51:34 -0400

rpc: remove some unused macros

There used to be a print_hexl() function that used isprint(), now gone. I don't know why NFS_NGROUPS and CA_RUN_AS_MACHINE were here.

I also don't know why another #define that's actually used was marked "unused".

Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-032-minor_cleanup_of_scheduler_callback_code.dif:

From: \\\"J. Bruce Fields\\\" <bfields@citi.umich.edu>

Date: Mon, 9 Jun 2008 16:51:35 -0400

rpc: minor cleanup of scheduler callback code

Try to make the comment here a little more clear and concise.

Also, this macro definition seems unnecessary.

Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-034-set_rq_xtime_on_all_xprts.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Fri, 06 Jun 2008 13:22:25 -0400

SUNRPC: Ensure all transports set rq_xtime consistently

The RPC client uses the rq_xtime field in each RPC request to determine the round-trip time of the request. Currently, the rq_xtime field is initialized by each transport just before it starts enqueing a request to be sent. However, transports do not handle initializing this value consistently; sometimes they don't initialize it at all.

To make the measurement of request round-trip time consistent for all RPC client transport capabilities, pull rq_xtime initialization into the RPC client's generic transport logic. Now all transports will get a standardized RTT measure automatically, from:

xprt_transmit()

to

xprt_complete_rqst()

This makes round-trip time calculation more accurate for the TCP transport. The socket ->sendmsg() method can return "-EAGAIN" if the socket's output buffer is full, so the TCP transport's ->send_request() method may call the ->sendmsg() method repeatedly until it gets all of the request's bytes queued in the socket's buffer.

Currently, the TCP transport sets the rq_xtime field every time through that loop so the final value is the timestamp just before the *last* call to the underlying socket's ->sendmsg() method. After this patch, the rq_xtime field contains a timestamp that reflects the time just before the *first* call to ->sendmsg().

This is consequential under heavy workloads because large requests often take multiple ->sendmsg() calls to get all the bytes of a request queued. The TCP transport causes the request to sleep until the remote end of the socket has received enough bytes to clear space in the socket's local output buffer. This delay can be quite significant.

The method introduced by this patch is a more accurate measure of RTT for stream transports, since the server can cause enough back pressure to delay (ie increase the latency of) requests from the client.

Additionally, this patch corrects the behavior of the RDMA transport, which entirely neglected to initialize the rq_xtime field. RPC performance metrics for RDMA transports now display correct RPC request round trip times.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Acked-by: Tom Talpey <thomas.talpey@netapp.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-035-remove_redundant_file_open_rpc_op.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Wed, 11 Jun 2008 16:32:46 -0400

NFS: Remove the redundant file_open entry from struct nfs_rpc_ops

All instances are set to nfs_open(), so we should just remove the redundant indirection. Ditto for the file_release op

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-036-add_nfs_iostat_h_to_include_linux.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Thu, 12 Jun 2008 12:32:25 -0400

NFS: Move fs/nfs/iostat.h to include/linux

The fs/nfs/iostat.h header has definitions that were designed to be exposed to user space. Move these definitions under include/linux so user space can use the definitions in applications that read /proc/self/mountstats.

Also address a handful of coding style issues called out by checkpatch.pl in fs/nfs/iostat.h.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-037-fix_warning_in_nfs4_async_handle_error.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Wed, 11 Jun 2008 16:42:05 -0400

NFS: Fix a warning in nfs4_async_handle_error

We're not modifying the nfs_server when we call nfs_inc_server_stats and friends, so allow the compiler to pass 'const' pointers too.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-038-fix_nfs4_proc_set_acl.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Wed, 11 Jun 2008 17:39:04 -0400

NFS: Ensure we zap only the access and acl caches when setting new acls

...and ensure that we obey the NFS_INO_INVALID_ACL flag when retrieving the acls.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-039-allow_any_value_for_the_retry_option.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Thu, 12 Jun 2008 12:37:33 -0400

NFS: Allow any value for the "retry" option

The kernel NFS mount option parser should ignore the retry= mount option since it is meaningful only in user space. Today it expects a number rather than arbitrary text, so it ignores the option if the value is numeric, but chokes if there are other characters in the value.

Change it to allow any text (except ",") as its value.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-040-treat_intr_and_nointr_options_as_deprecated.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Thu, 12 Jun 2008 12:37:41 -0400

NFS: Treat "intr" and "nointr" options as deprecated

Clean up: the "intr" and "nointr" mount options were recently retired. Document this in the NFS mount option parser.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-041-missing_newline_in_nfs_mount_debugging_message.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Thu, 12 Jun 2008 12:37:49 -0400

NFS: missing newline in NFS mount debugging message

Clean up.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-042-cleanup_nfs_update_request.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Fri, 13 Jun 2008 12:12:32 -0400

NFS: Clean up nfs_update_request()

Simplify the loop in nfs_update_request by moving into a separate function the code that attempts to update an existing cached NFS write.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-043-defer_commit.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Fri, 13 Jun 2008 13:25:22 -0400

NFS: Allow redirtying of a completed unstable write.

Currently, if an unstable write completes, we cannot redirty the page in order to reflect a new change in the page data until after we've sent a COMMIT request.

This patch allows a page rewrite to proceed without the unnecessary COMMIT step, putting it immediately back onto the dirty page list, undoing the VM unstable write accounting, and removing the NFS_PAGE_TAG_COMMIT tag from the NFS radix tree.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-044-fix_nfs4_fs_type_undeclared.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Tue, 17 Jun 2008 16:12:00 -0400

NFS: Fix a dependency on CONFIG_NFS_V4 in nfs_remount

Fix the 'nfs4_fs_type' undeclared error in nfs_remount when compiling sans NFSv4...

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com> Cc: Jeff Layton <jlayton@redhat.com>

linux-2.6.26-045-use_common_device_name_parsing_logic_for_nfsv4_and_nfsv2_v3.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Mon, 23 Jun 2008 12:36:37 -0400

NFS: Use common device name parsing logic for NFSv4 and NFSv2/v3

To support passing a raw IPv6 address as a server hostname, we need to expand the logic that handles splitting the passed-in device name into a server hostname and export path

Start by pulling device name parsing out of the mount option validation functions and into separate helper functions.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-046-support_raw_ipv6_address_hostnames_during_nfs_mount_operation.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Mon, 23 Jun 2008 12:36:45 -0400

NFS: Support raw IPv6 address hostnames during NFS mount operation

Traditionally the mount command has looked for a ":" to separate the server's hostname from the export path in the mounted on device name, like this:

mount server:/export /mounted/on/dir

The server's hostname is "server" and the export path is "/export".

You can also substitute a specific IPv4 network address for the server hostname, like this:

mount 192.168.0.55:/export /mounted/on/dir

Raw IPv6 addresses present a problem, however, because they look something like this:

fe80::200:5aff:fe00:30b

Note the use of colons.

To get around the presence of colons, copy the Solaris convention used for mounting IPv6 servers by address: wrap a raw IPv6 address with square brackets.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-047-add_string_length_argument_to_nfs_parse_server_address.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Mon, 23 Jun 2008 12:36:53 -0400

NFS: Add string length argument to nfs_parse_server_address

To make nfs_parse_server_address() more generally useful, allow it to accept input strings that are not terminated with '\0'.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-048-handle_interface_identifiers_in_incoming_ipv6_addresses.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Mon, 23 Jun 2008 12:37:01 -0400

NFS: handle interface identifiers in incoming IPv6 addresses

Add support in the kernel NFS client's address parser for interface identifiers.

IPv6 link-local addresses require an additional "interface identifier", which is a network device name or an integer that indexes the array of local network interfaces. They are suffixed to the address with a '%'. For example:

fe80::215:c5ff:fe3b:e1b2%2

indicates an interface index of 2. Or

fe80::215:c5ff:fe3b:e1b2%eth0

indicates that requests should be routed through the eth0 device. Without the interface ID, link-local addresses are not usable for NFS.

Both the kernel NFS client mount option parser and the mount.nfs command can take either form. The mount.nfs command always passes the address through getnameinfo(3), which usually re-writes interface indices as device names.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-049-fix_uninitialised_var_in_setclientid_confirm.dif:

From: Benny Halevy <bhalevy@panasas.com>

Date: Tue, 24 Jun 2008 20:25:57 +0300

nfs: initialize timeout variable in nfs4_proc_setclientid_confirm

gcc (4.3.0) rightfully warns about this: /usr0/export/dev/bhalevy/git/linux-pnfs-bh-nfs41/fs/nfs/nfs4proc.c: In function ‘nfs4_proc_setclientid_confirm’: /usr0/export/dev/bhalevy/git/linux-pnfs-bh-nfs41/fs/nfs/nfs4proc.c:2936: warning: ‘timeout’ may be used uninitialized in this function

nfs4_delay that's passed a pointer to 'timeout' is looking at its value and sets it up to some value in the range: NFS4_POLL_RETRY_MIN..NFS4_POLL_RETRY_MAX if (*timeout <= 0) *timeout = NFS4_POLL_RETRY_MIN; if (*timeout > NFS4_POLL_RETRY_MAX) *timeout = NFS4_POLL_RETRY_MAX;

Therefore it will end up set to some sane, though rather indeterministic, value.

Signed-off-by: Benny Halevy <bhalevy@panasas.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-050-remove_bkl_from_nfs_callback_up_and_nfs_callback_down.dif:

From: Jeff Layton <jlayton@redhat.com>

Date: Wed, 11 Jun 2008 10:03:10 -0400

nfs4: remove BKL from nfs_callback_up and nfs_callback_down

The nfs_callback_mutex is sufficient protection.

Signed-off-by: Jeff Layton <jlayton@redhat.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-051-fix_potential_race_with_rapid_nfs_callback_up_down_cycle.dif:

From: Jeff Layton <jlayton@redhat.com>

Date: Wed, 11 Jun 2008 10:03:11 -0400

nfs4: fix potential race with rapid nfs_callback_up/down cycle

If the nfsv4 callback thread is rapidly brought up and down, it's possible that nfs_callback_svc might never get a chance to run. If this happens, the cleanup at thread exit might never occur, throwing the refcounting off and nfs_callback_info in an incorrect state.

Move the clean functions into nfs_callback_down. Also change the nfs_callback_info struct to track the svc_rqst rather than svc_serv since we need to know that to call svc_exit_thread.

Signed-off-by: Jeff Layton <jlayton@redhat.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-052-more_useful_debugging_output_for_rpcb_client.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 25 Jun 2008 17:24:23 -0400

SUNRPC: More useful debugging output for rpcb client

Clean up dprintk's in rpcb client's XDR decoder functions.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-053-document_some_naked_integers_in_rpcbind_client.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 25 Jun 2008 17:24:31 -0400

SUNRPC: Document some naked integers in rpcbind client

Clean up: Replace naked integers that represent rpcbind protocol versions.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-054-use_rpcbind_version_2_getport.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 25 Jun 2008 17:24:39 -0400

SUNRPC: Use rpcbind version 2 GETPORT

Clean up: Change the version 2 procedure name to GETPORT. It's the same procedure number as GETADDR, but version 2 implementations usually refer to it as GETPORT.

This also now matches the procedure name used in the version 2 procedure entry in the rpcb_next_version[] array, making it slightly less confusing.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-055-use_getaddr_for_rpcbind_version_4_queries.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 25 Jun 2008 17:24:47 -0400

SUNRPC: Use GETADDR for rpcbind version 4 queries

Some rpcbind servers that do support rpcbind version 4 do not support the GETVERSADDR procedure. Use GETADDR for querying rpcbind servers via rpcbind version 4 instead of GETVERSADDR.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-056-use_only_rpcbind_v2_for_af_inet_requests.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Wed, 25 Jun 2008 17:24:54 -0400

SUNRPC: Use only rpcbind v2 for AF_INET requests

Some server vendors support the higher versions of rpcbind only for AF_INET6. The kernel doesn't need to use v3 or v4 for AF_INET anyway, so change the kernel's rpcbind client to query AF_INET servers over rpcbind v2 only.

This has a few interesting benefits:

1. If the rpcbind request is going over TCP, and the server doesn't support rpcbind versions 3 or 4, the client reduces by two the number of ephemeral ports left in TIME_WAIT for each rpcbind request. This will help during NFS mount storms.

2. The rpcbind interaction with servers that don't support rpcbind versions 3 or 4 will use less network traffic. Also helpful during mount storms.

3. We can eliminate the kernel build option that controls whether the kernel's rpcbind client uses rpcbind version 3 and 4 for AF_INET servers. Less complicated kernel configuration...

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-057-set_transport_defaults_after_mount_option_parsing_is_finished.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Wed Jul 2 14:43:47 2008 -0400

NFS: set transport defaults after mount option parsing is finished

Move the UDP/TCP default timeo/retrans settings for text mounts to nfs_init_timeout_values(), which was were they were always being initialised (and sanity checked) for binary mounts. Document the default timeout values using appropriate #defines.

Ensure that we initialise and sanity check the transport protocols that may have been specified by the user.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-058-move_the_nfs_set_port_call_out_of_nfs_parse_mount_options.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Thu, 26 Jun 2008 17:47:05 -0400

NFS: Move the nfs_set_port() call out of nfs_parse_mount_options()

The remount path does not need to set the port in the server address. Since it's not really a part of option parsing, move the nfs_set_port() call to nfs_parse_mount_options()'s callers.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-059-use_documenting_macro_constants_for_initializing_ac_reg_dir_min_max.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Thu, 26 Jun 2008 17:47:12 -0400

NFS: use documenting macro constants for initializing ac{reg, dir}{min, max}

Clean up.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-060-refactor_logic_for_parsing_nfs_security_flavor_mount_options.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Tue, 24 Jun 2008 16:33:38 -0400

NFS: Refactor logic for parsing NFS security flavor mount options

Clean up: Refactor the NFS mount option parsing function to extract the security flavor parsing logic into a separate function.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-061-set_security_flavor_default_for_nfsv2_3_mounts_like_other_defaults.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Tue, 24 Jun 2008 16:33:46 -0400

NFS: Set security flavor default for NFSv2/3 mounts like other defaults

Set the default security flavor when we set the other mount option default values. After this change, only the legacy user-space mount path needs to set the NFS_MOUNT_SECFLAVOUR flag.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-062-set_security_flavor_default_for_nfsv4_mounts_like_other_defaults.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Tue, 24 Jun 2008 16:33:54 -0400

NFS4: Set security flavor default for NFSv4 mounts like other defaults

Set the default security flavor when we set the other mount option default values for NFSv4. This cleans up the NFSv4 mount option parsing path to look like the NFSv2/v3 one.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-063-add_sloppy_mount_option.dif:

From: Chuck Lever <chuck.lever@oracle.com>

Date: Tue, 24 Jun 2008 19:28:02 -0400

NFS: Allow either strict or sloppy mount option parsing

The kernel's NFS client mount option parser currently doesn't allow unrecognized or incorrect mount options. This prevents misspellings or incorrectly specified mount options from possibly causing silent data corruption.

However, NFS mount options are not standardized, so different operating systems can use differently spelled mount options to support similar features, or can support mount options which no other operating system supports.

"Sloppy" mount option parsing, which allows the parser to ignore any option it doesn't recognize, is needed to support automounters that often use maps that are shared between heterogenous operating systems.

The legacy mount command ignores the validity of the values of mount options entirely, except for the "sec=" and "proto=" options. If an incorrect value is specified, the out-of-range value is passed to the kernel; if a value is specified that contains non-numeric characters, it appears as though the legacy mount command sets that option to zero (probably incorrect behavior in general).

In any case, this sets a precedent which we will partially follow for the kernel mount option parser:

+ if "sloppy" is not set, the parser will be strict about both unrecognized options (same as legacy) and invalid option values (stricter than legacy)

+ if "sloppy" is set, the parser will ignore unrecognized options and invalid option values (same as legacy)

An "invalid" option value in this case means that either the type (integer, short, or string) or sign (for integer values) of the specified value is incorrect.

This patch does two things: it changes the NFS client's mount option parsing loop so that it parses the whole string instead of failing at the first unrecognized option or invalid option value. An unrecognized option or an invalid option value cause the option to be skipped.

Then, the patch adds a "sloppy" mount option that allows the parsing to succeed anyway if there were any problems during parsing. When parsing a set of options is complete, if there are errors and "sloppy" was specified, return success anyway. Otherwise, only return success if there are no errors.

Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

linux-2.6.26-064-optimise_rpcbind.dif:

From: Trond Myklebust <Trond.Myklebust@netapp.com>

Date: Mon, 7 Jul 2008 12:18:53 -0400

SUNRPC: Ensure our task is notified when an rpcbind call is done

If another task is busy in rpcb_getport_async number, it is more efficient to have it wake us up when it has finished instead of arbitrarily sleeping for 5 seconds.

Also ensure that rpcb_wake_rpcbind_waiters() is called regardless of whether or not rpcb_getport_done() gets called.

Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>

[ICO]NameLast modifiedSizeDescription

[PARENTDIR]Parent Directory   -  
[TXT]linux-2.6.26-001-fix..>2008-07-07 17:37 1.3K 
[TXT]linux-2.6.26-002-fix..>2008-07-07 17:37 1.1K 
[TXT]linux-2.6.26-003-fix..>2008-07-07 17:37 3.2K 
[TXT]linux-2.6.26-004-red..>2008-06-26 19:53 6.9K 
[TXT]linux-2.6.26-005-red..>2008-06-26 19:53 10K 
[TXT]linux-2.6.26-006-fix..>2008-06-26 19:53 1.5K 
[TXT]linux-2.6.26-007-add..>2008-06-12 19:37 3.5K 
[TXT]linux-2.6.26-008-exi..>2008-06-12 19:37 1.2K 
[TXT]linux-2.6.26-009-eno..>2008-06-12 19:37 1.0K 
[TXT]linux-2.6.26-010-opt..>2008-06-12 19:37 2.5K 
[TXT]linux-2.6.26-011-rev..>2008-06-12 19:37 2.2K 
[TXT]linux-2.6.26-012-use..>2008-06-12 19:37 5.1K 
[TXT]linux-2.6.26-013-loc..>2008-06-12 19:37 720  
[TXT]linux-2.6.26-014-upd..>2008-07-07 17:37 7.8K 
[TXT]linux-2.6.26-015-add..>2008-06-12 19:37 1.9K 
[TXT]linux-2.6.26-016-ren..>2008-06-12 19:37 4.5K 
[TXT]linux-2.6.26-017-don..>2008-06-12 19:37 1.9K 
[TXT]linux-2.6.26-018-ref..>2008-06-12 19:37 2.2K 
[TXT]linux-2.6.26-019-dis..>2008-06-12 19:37 3.5K 
[TXT]linux-2.6.26-020-mak..>2008-07-07 17:37 2.9K 
[TXT]linux-2.6.26-021-mak..>2008-07-07 17:37 1.8K 
[TXT]linux-2.6.26-022-mak..>2008-07-07 17:37 1.4K 
[TXT]linux-2.6.26-023-add..>2008-06-12 19:37 3.1K 
[TXT]linux-2.6.26-024-use..>2008-07-07 17:37 8.6K 
[TXT]linux-2.6.26-025-fix..>2008-06-12 19:37 3.6K 
[TXT]linux-2.6.26-026-rem..>2008-06-12 19:37 1.4K 
[TXT]linux-2.6.26-027-nfs..>2008-06-12 19:37 738  
[TXT]linux-2.6.26-028-che..>2008-06-26 19:53 4.5K 
[TXT]linux-2.6.26-029-bri..>2008-06-12 19:37 4.1K 
[TXT]linux-2.6.26-030-eli..>2008-06-26 19:53 1.4K 
[TXT]linux-2.6.26-031-rem..>2008-06-26 19:53 1.4K 
[TXT]linux-2.6.26-032-min..>2008-06-26 19:53 1.9K 
[   ]linux-2.6.26-033-fix..>2008-06-26 19:53 6.2K 
[TXT]linux-2.6.26-034-set..>2008-06-12 19:37 3.5K 
[TXT]linux-2.6.26-035-rem..>2008-06-26 19:53 3.1K 
[TXT]linux-2.6.26-036-add..>2008-06-12 19:37 10K 
[TXT]linux-2.6.26-037-fix..>2008-06-26 19:53 2.2K 
[TXT]linux-2.6.26-038-fix..>2008-06-26 19:53 3.5K 
[TXT]linux-2.6.26-039-all..>2008-06-26 19:53 1.3K 
[TXT]linux-2.6.26-040-tre..>2008-06-26 19:53 1.5K 
[TXT]linux-2.6.26-041-mis..>2008-06-26 19:53 742  
[TXT]linux-2.6.26-042-cle..>2008-06-26 19:53 7.8K 
[TXT]linux-2.6.26-043-def..>2008-06-26 19:53 5.8K 
[TXT]linux-2.6.26-044-fix..>2008-06-26 19:53 1.4K 
[TXT]linux-2.6.26-045-use..>2008-06-26 19:53 5.1K 
[TXT]linux-2.6.26-046-sup..>2008-06-26 19:53 3.8K 
[TXT]linux-2.6.26-047-add..>2008-06-26 19:53 4.9K 
[TXT]linux-2.6.26-048-han..>2008-06-26 19:53 3.1K 
[TXT]linux-2.6.26-049-fix..>2008-06-26 19:53 1.4K 
[TXT]linux-2.6.26-050-rem..>2008-06-26 19:53 1.3K 
[TXT]linux-2.6.26-051-fix..>2008-06-26 19:53 3.0K 
[TXT]linux-2.6.26-052-mor..>2008-07-07 17:37 1.1K 
[TXT]linux-2.6.26-053-doc..>2008-07-07 17:37 3.1K 
[TXT]linux-2.6.26-054-use..>2008-07-07 17:37 1.0K 
[TXT]linux-2.6.26-055-use..>2008-07-07 17:37 1.4K 
[TXT]linux-2.6.26-056-use..>2008-07-07 17:37 3.3K 
[TXT]linux-2.6.26-057-set..>2008-07-07 17:37 6.7K 
[TXT]linux-2.6.26-058-mov..>2008-07-07 17:37 1.4K 
[TXT]linux-2.6.26-059-use..>2008-07-07 17:37 3.7K 
[TXT]linux-2.6.26-060-ref..>2008-07-07 17:37 5.1K 
[TXT]linux-2.6.26-061-set..>2008-07-07 17:37 3.7K 
[TXT]linux-2.6.26-062-set..>2008-07-07 17:37 1.9K 
[TXT]linux-2.6.26-063-add..>2008-07-07 17:37 11K 
[TXT]linux-2.6.26-064-opt..>2008-07-07 17:37 4.6K 
[   ]series 2008-07-07 17:37 3.8K 

Apache/2.4.6 (CentOS) OpenSSL/1.0.2k-fips PHP/5.4.16 mod_perl/2.0.11 Perl/v5.16.3 Server at linux-nfs.org Port 80