habanalabs: increase timeout during reset

[ Upstream commit 7a65ee046b2238e053f6ebb610e1a082cfc49490 ] When doing training, the DL framework (e.g. tensorflow) performs hundreds of thousands of memory allocations and mappings. In case the driver needs to perform hard-reset during training, the driver kills the application and unmaps all those memory allocations. Unfortunately, because of that large amount of mappings, the driver isn't able to do that in the current timeout (5 seconds). Therefore, increase the timeout significantly to 30 seconds to avoid situation where the driver resets the device with active mappings, which sometime can cause a kernel bug. BTW, it doesn't mean we will spend all the 30 seconds because the reset thread checks every one second if the unmap operation is done. Reviewed-by: Omer Shpigelman <oshpigelman@habana.ai> Signed-off-by: Oded Gabbay <oded.gabbay@gmail.com> Signed-off-by: Sasha Levin <sashal@kernel.org>
author: Oded Gabbay <oded.gabbay@gmail.com> 2020-03-27 16:38:37 +0300
committer: Greg Kroah-Hartman <gregkh@linuxfoundation.org> 2020-06-24 17:50:28 +0200
commit: 5c2207ba2394ee6c2dd7383890818aca89ff4b9b (patch)
tree: 2b05c8b8b14366738c3e2ad0e79e98624cd6e8b8 /drivers/misc
parent: 828b192c57e8f4fee77f7a34bd19c1b58b049dad (diff)
1 files changed, 1 insertions, 1 deletions
diff --git a/drivers/misc/habanalabs/habanalabs.h b/drivers/misc/habanalabs/habanalabs.h
index 75862be53c60..30addffd76f5 100644
--- a/drivers/misc/habanalabs/habanalabs.h
+++ b/drivers/misc/habanalabs/habanalabs.h
@@ -23,7 +23,7 @@
 
 #define HL_MMAP_CB_MASK			(0x8000000000000000ull >> PAGE_SHIFT)
 
-#define HL_PENDING_RESET_PER_SEC	5
+#define HL_PENDING_RESET_PER_SEC	30
 
 #define HL_DEVICE_TIMEOUT_USEC		1000000 /* 1 s */
author	Oded Gabbay <oded.gabbay@gmail.com>	2020-03-27 16:38:37 +0300
committer	Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2020-06-24 17:50:28 +0200
commit	5c2207ba2394ee6c2dd7383890818aca89ff4b9b (patch)
tree	2b05c8b8b14366738c3e2ad0e79e98624cd6e8b8 /drivers/misc
parent	828b192c57e8f4fee77f7a34bd19c1b58b049dad (diff)