117 lines
4.1 KiB
ReStructuredText
117 lines
4.1 KiB
ReStructuredText
===========================
|
|
InfiniBand Midlayer Locking
|
|
===========================
|
|
|
|
This guide is an attempt to make explicit the locking assumptions
|
|
made by the InfiniBand midlayer. It describes the requirements on
|
|
both low-level drivers that sit below the midlayer and upper level
|
|
protocols that use the midlayer.
|
|
|
|
Sleeping and interrupt context
|
|
==============================
|
|
|
|
With the following exceptions, a low-level driver implementation of
|
|
all of the methods in struct ib_device may sleep. The exceptions
|
|
are any methods from the list:
|
|
|
|
- create_ah
|
|
- modify_ah
|
|
- query_ah
|
|
- destroy_ah
|
|
- post_send
|
|
- post_recv
|
|
- poll_cq
|
|
- req_notify_cq
|
|
|
|
which may not sleep and must be callable from any context.
|
|
|
|
The corresponding functions exported to upper level protocol
|
|
consumers:
|
|
|
|
- rdma_create_ah
|
|
- rdma_modify_ah
|
|
- rdma_query_ah
|
|
- rdma_destroy_ah
|
|
- ib_post_send
|
|
- ib_post_recv
|
|
- ib_req_notify_cq
|
|
|
|
are therefore safe to call from any context.
|
|
|
|
In addition, the function
|
|
|
|
- ib_dispatch_event
|
|
|
|
used by low-level drivers to dispatch asynchronous events through
|
|
the midlayer is also safe to call from any context.
|
|
|
|
Reentrancy
|
|
----------
|
|
|
|
All of the methods in struct ib_device exported by a low-level
|
|
driver must be fully reentrant. The low-level driver is required to
|
|
perform all synchronization necessary to maintain consistency, even
|
|
if multiple function calls using the same object are run
|
|
simultaneously.
|
|
|
|
The IB midlayer does not perform any serialization of function calls.
|
|
|
|
Because low-level drivers are reentrant, upper level protocol
|
|
consumers are not required to perform any serialization. However,
|
|
some serialization may be required to get sensible results. For
|
|
example, a consumer may safely call ib_poll_cq() on multiple CPUs
|
|
simultaneously. However, the ordering of the work completion
|
|
information between different calls of ib_poll_cq() is not defined.
|
|
|
|
Callbacks
|
|
---------
|
|
|
|
A low-level driver must not perform a callback directly from the
|
|
same callchain as an ib_device method call. For example, it is not
|
|
allowed for a low-level driver to call a consumer's completion event
|
|
handler directly from its post_send method. Instead, the low-level
|
|
driver should defer this callback by, for example, scheduling a
|
|
tasklet to perform the callback.
|
|
|
|
The low-level driver is responsible for ensuring that multiple
|
|
completion event handlers for the same CQ are not called
|
|
simultaneously. The driver must guarantee that only one CQ event
|
|
handler for a given CQ is running at a time. In other words, the
|
|
following situation is not allowed::
|
|
|
|
CPU1 CPU2
|
|
|
|
low-level driver ->
|
|
consumer CQ event callback:
|
|
/* ... */
|
|
ib_req_notify_cq(cq, ...);
|
|
low-level driver ->
|
|
/* ... */ consumer CQ event callback:
|
|
/* ... */
|
|
return from CQ event handler
|
|
|
|
The context in which completion event and asynchronous event
|
|
callbacks run is not defined. Depending on the low-level driver, it
|
|
may be process context, softirq context, or interrupt context.
|
|
Upper level protocol consumers may not sleep in a callback.
|
|
|
|
Hot-plug
|
|
--------
|
|
|
|
A low-level driver announces that a device is ready for use by
|
|
consumers when it calls ib_register_device(), all initialization
|
|
must be complete before this call. The device must remain usable
|
|
until the driver's call to ib_unregister_device() has returned.
|
|
|
|
A low-level driver must call ib_register_device() and
|
|
ib_unregister_device() from process context. It must not hold any
|
|
semaphores that could cause deadlock if a consumer calls back into
|
|
the driver across these calls.
|
|
|
|
An upper level protocol consumer may begin using an IB device as
|
|
soon as the add method of its struct ib_client is called for that
|
|
device. A consumer must finish all cleanup and free all resources
|
|
relating to a device before returning from the remove method.
|
|
|
|
A consumer is permitted to sleep in its add and remove methods.
|