防止 SQL Server 中的死锁

Question

提问by Jussi Kosunen

I have an application connected to a SQL Server 2014 database that combines several rows into one. There are no other connections to this database while the application is running.

我有一个连接到 SQL Server 2014 数据库的应用程序，该数据库将多行合并为一行。应用程序运行时，没有与此数据库的其他连接。

First, select a chunk of rows within a specific time span. This query uses a non-clustered seek (TIME column) merged with a clustered lookup.

首先，在特定时间跨度内选择一组行。此查询使用与集群查找合并的非集群查找（TIME 列）。

select ...
from FOO
where TIME >= @from and TIME < @to and ...

Then, we process these rows in c# and write changes as a single update and multiple deletes, this happens many times per chunk. These also use non-clustered index seeks.

然后，我们在 c# 中处理这些行并将更改写入单个更新和多个删除，这在每个块中发生多次。这些也使用非聚集索引查找。

begin tran

update FOO set ...
where NON_CLUSTERED_ID = @id

delete FOO where NON_CLUSTERED_ID in (@id1, @id2, @id3, ...)

commit

I am getting deadlocks when running this with multiple parallel chunks. I tried using ROWLOCKfor the updateand deletebut that caused even more deadlocks than before for some reason, even though there are no overlaps between chunks.

使用多个并行块运行此程序时出现死锁。我尝试使用ROWLOCKfor updateanddelete但由于某种原因导致比以前更多的死锁，即使块之间没有重叠。

Then I tried TABLOCKX, HOLDLOCKon the update, but that means I can't perform my selectin parallel so I'm losing the advantages of parallelism.

然后我尝试TABLOCKX, HOLDLOCK了update，但这意味着我无法select并行执行我的操作，因此我失去了并行性的优势。

Any idea how I can avoid deadlocks but still process multiple parallel chunks?

知道如何避免死锁但仍处理多个并行块吗？

Would it be safe to use NOLOCKon my selectin this case, given there is no row overlap between chunks? Then TABLOCKX, HOLDLOCKwould only block the updateand delete, correct?

鉴于块之间没有行重叠，在这种情况下NOLOCK在 my上使用是否安全select？那么TABLOCKX, HOLDLOCK只会阻止updateand delete，对吗？

Or should I just accept that deadlocks will happen and retry the query in my application?

或者我应该接受死锁会发生并在我的应用程序中重试查询？

UPDATE(additional information): All deadlocks so far have happened in the updateand deletephase, none in the select. I'll try to get some deadlock logs up if I can't get this solved today (the correct trace flags weren't enabled before).

更新（附加信息）：到目前为止，所有死锁都发生在update和delete阶段，没有发生在select. 如果我今天不能解决这个问题（之前没有启用正确的跟踪标志），我将尝试获取一些死锁日志。

UPDATE: These are the two arrangements of deadlocks that occur with ROWLOCK, they both refer only to the deletestatement and the non-clustered index it uses. I'm not sure if these are the same as the deadlocks that occur without any table hints as I wasn't able to reproduce any of those.

UPDATE：这是与一起发生的死锁的两种安排ROWLOCK，它们都只涉及delete语句和它使用的非聚集索引。我不确定这些是否与没有任何表提示的死锁相同，因为我无法重现其中任何一个。

Ask if there's anything else needed from the .xdl, I'm a bit weary of attaching the whole thing.

询问 .xdl 是否还需要其他任何内容，我对附加整个内容有点厌倦。

Answer 1

回答by TT.

The general advice regarding deadlocks: make sure you do everything in the same order, i.e. acquire locks in the same order, for different processes.

关于死锁的一般建议：确保以相同的顺序执行所有操作，即为不同的进程以相同的顺序获取锁。

You can find the same advice in this technical article on microsoft.com regarding Minimizing Deadlocks. There's a good reason it is listed first.

您可以在 microsoft.com 上的这篇关于最小化死锁的技术文章中找到相同的建议。它首先被列出是有充分理由的。

Access objects in the same order.
Avoid user interaction in transactions.
Keep transactions short and in one batch.
Use a lower isolation level.
Use a row versioning-based isolation level.
Set READ_COMMITTED_SNAPSHOT database option ON to enable read-committed transactions to use row versioning.
Use snapshot isolation.
Use bound connections.

以相同的顺序访问对象。
避免交易中的用户交互。
保持交易简短和一批。
使用较低的隔离级别。
使用基于行版本控制的隔离级别。
将 READ_COMMITTED_SNAPSHOT 数据库选项设置为 ON 以启用读提交事务以使用行版本控制。
使用快照隔离。
使用绑定连接。

Update after question from Cato:

卡托提问后更新：

How would acquiring locks in the same order apply here? Have you got any advice on how he would change his SQL to do that?

在这里如何以相同的顺序获取锁？你对他如何改变他的 SQL 有什么建议吗？

Deadlocks are always the same, no matter what environment: two processes (say A& B) acquire multiple locks (say X& Y) in a different order so that Ais waiting for Yand Bis waiting for Xwhile Ais holding Xand Bis holding Y.

无论什么环境，死锁总是相同的：两个进程（比如A& B）以不同的顺序获取多个锁（比如X& Y），因此A正在等待Y和B正在等待X而A正在持有X和B正在持有Y。

It applies here because DELETEand UPDATEstatements implicitely acquire locks on the rows or index range or table (depending on what the engine deems appropriate).

它适用于此处，因为DELETEandUPDATE语句隐式地获取行或索引范围或表上的锁（取决于引擎认为合适的内容）。

You should analyze your process and see if there are scenarios where locks could be acquired in a different order. If that doesn't reveal anything, you can analyze deadlocks using the SQL Server Profiler:

您应该分析您的流程，看看是否存在可以以不同顺序获取锁的场景。如果这没有显示任何内容，您可以使用 SQL Server Profiler 分析死锁：

To trace deadlock events, add the Deadlock graph event class to a trace. This event class populates the TextData data column in the trace with XML data about the process and objects that are involved in the deadlock. SQL Server Profiler can extract the XML document to a deadlock XML (.xdl) file which you can view later in SQL Server Management Studio. You can configure SQL Server Profiler to extract Deadlock graph events to a single file that contains all Deadlock graph events, or to separate files.

要跟踪死锁事件，请将死锁图事件类添加到跟踪中。此事件类使用有关死锁中涉及的进程和对象的 XML 数据填充跟踪中的 TextData 数据列。SQL Server Profiler 可以将 XML 文档提取到死锁 XML (.xdl) 文件中，稍后您可以在 SQL Server Management Studio 中查看该文件。您可以将 SQL Server Profiler 配置为将死锁图事件提取到包含所有死锁图事件的单个文件中，或提取到单独的文件中。

Answer 2

回答by Vladimir Baranov

I'd use sp_getapplockin the updating transaction to prevent multiple instances of this code running in parallel. This will not block the selecting statement as table locking hints do.

我会sp_getapplock在更新事务中使用以防止此代码的多个实例并行运行。这不会像表锁定提示那样阻塞选择语句。

You still should program the retrying logic, because it may take a while to acquire the lock, longer than the timeout parameter.

您仍然应该编写重试逻辑，因为获取锁可能需要一段时间，比超时参数要长。

This is how the updating transaction can be wrapped into sp_getapplock.

这就是更新事务如何包装到sp_getapplock.

BEGIN TRANSACTION;
BEGIN TRY

    DECLARE @VarLockResult int;
    EXEC @VarLockResult = sp_getapplock
        @Resource = 'some_unique_name_app_lock',
        @LockMode = 'Exclusive',
        @LockOwner = 'Transaction',
        @LockTimeout = 60000,
        @DbPrincipal = 'public';

    IF @VarLockResult >= 0
    BEGIN
        -- Acquired the lock
        update FOO set ...
        where NON_CLUSTERED_ID = @id

        delete FOO where NON_CLUSTERED_ID in (@id1, @id2, @id3, ...)

    END ELSE BEGIN
        -- return some error code, so that the caller could retry
    END;

    COMMIT TRANSACTION;
END TRY
BEGIN CATCH
    ROLLBACK TRANSACTION;
    -- handle the error
END CATCH;

The selecting statement doesn't need any changes.

选择语句不需要任何更改。

I would recommend against NOLOCK, even though you say that IDs in chunks do not overlap. With this hint the SELECT query can skip some pages that are being changed, it can read some pages twice. It is unlikely that such behavior can be tolerated.

我建议不要使用NOLOCK，即使您说块中的 ID 不重叠。有了这个提示，SELECT 查询可以跳过一些正在更改的页面，它可以读取一些页面两次。这种行为不太可能被容忍。

防止 SQL Server 中的死锁

提问by Jussi Kosunen

回答by TT.

回答by Vladimir Baranov

相关推荐

最近更新

标签

防止 SQL Server 中的死锁

提问by Jussi Kosunen

回答by TT.

回答by Vladimir Baranov

相关推荐

SQL CASE [Column] WHEN IN ('case1', 'case2') THEN 'oops' END？

SQL 我们可以在没有 else 语句的情况下编写 case 语句吗

SQL 如何提高数据库的性能？

SQL 如何 INNER JOIN 4 个表？

相关推荐

最近更新

标签