[IB] mthca: first pass at catastrophic error reporting
authorRoland Dreier <rolandd@cisco.com>
Thu, 27 Oct 2005 18:03:38 +0000 (11:03 -0700)
committerRoland Dreier <rolandd@cisco.com>
Thu, 27 Oct 2005 18:03:38 +0000 (11:03 -0700)
commit3d155f8cd0d077938d271225d26ee52f8eb26082
tree28f65ed2dc5e9f12884daf2a97f50367f145d8a0
parent7cc656efb560cda66b5ed48444cad7556ea4fe99
[IB] mthca: first pass at catastrophic error reporting

Add some initial support for detecting and reporting catastrophic
errors reported by Mellanox HCAs.  We start a periodic timer which
polls the catastrophic error reporting buffer in device memory.  If an
error is detected, we dump the contents of the buffer for port-mortem
debugging, and report a fatal asynchronous error to higher levels.

In the future we can try to recover from these errors by resetting the
device, but this will require some work in higher-level code as well.
Let's get this in now, so that we at least get catastrophic errors
reported in logs.

Signed-off-by: Roland Dreier <rolandd@cisco.com>
drivers/infiniband/hw/mthca/Makefile
drivers/infiniband/hw/mthca/mthca_catas.c [new file with mode: 0644]
drivers/infiniband/hw/mthca/mthca_cmd.c
drivers/infiniband/hw/mthca/mthca_dev.h
drivers/infiniband/hw/mthca/mthca_provider.c