SQL 从一列而不是另一列中选择所有值的有效方法

Question

提问by Flash

I need to return all values from colAthat are not in colBfrom mytable. I am using:

我需要返回colA不在colBfrom 中的所有值mytable。我在用：

SELECT DISTINCT(colA) FROM mytable WHERE colA NOT IN (SELECT colB FROM mytable)

It is working however the query is taking an excessively long time to complete.

它正在工作，但是查询需要很长时间才能完成。

Is there a more efficient way to do this?

有没有更有效的方法来做到这一点？

Answer 1

回答by Erwin Brandstetter

In standard SQL there are no parenthesesin DISTINCT colA. DISTINCTis not a function.

在标准 SQL中，DISTINCT colA. DISTINCT不是函数。

SELECT DISTINCT colA
FROM   mytable
WHERE  colA NOT IN (SELECT DISTINCT colB FROM mytable);

Added DISTINCTto the sub-select as well. If you have many duplicates it could speed up the query.

也添加DISTINCT到子选择中。如果您有很多重复项，它可以加快查询速度。

A CTE might be faster, depending on your DBMS. I additionally demonstrate LEFT JOINas alternative to exclude the values in valB, and an alternative way to get distinct values with GROUP BY:

CTE 可能更快，具体取决于您的 DBMS。我还演示了LEFT JOIN作为排除中值valB的替代方法，以及使用获得不同值的替代方法GROUP BY：

WITH x AS (SELECT colB FROM mytable GROUP BY colB)
SELECT m.colA
FROM   mytable m
LEFT   JOIN x ON x.colB = m.colA
WHERE  x.colB IS NULL
GROUP  BY m.colA;

Or, simplified further, and with a plain subquery (probably fastest):

或者，进一步简化，并使用简单的子查询（可能最快）：

SELECT DISTINCT m.colA
FROM   mytable m
LEFT   JOIN mytable x ON x.colB = m.colA
WHERE  x.colB IS NULL;

There are basically 4 techniquesto exclude rows with keys present in another (or the same) table:

有基本上4种技术来排除与存在于另一键（或相同）的表中的行：

Select rows which are not present in other table

选择其他表中不存在的行

The deciding factor for speed will be indexes. You need to have indexes on colAand colBfor this query to be fast.

速度的决定因素将是索引。您需要有索引，colA并且colB此查询要快速。

Answer 2

回答by Eric

You can use exists:

您可以使用exists：

select distinct
    colA
from
    mytable m1
where
    not exists (select 1 from mytable m2 where m2.colB = m1.colA)

existsdoes a semi-join to quickly match the values. not incompletes the entire result set and then does an oron it. existsis typically faster for values in tables.

exists执行半连接以快速匹配值。not in完成整个结果集，然后or对其进行处理。exists对于表中的值通常更快。

Answer 3

回答by Paul

You can use the EXCEPToperator which effectively diffs two SELECTqueries. EXCEPT DISTINCTwill return only unique values. Oracle's MINUSoperator is equivalent to EXCEPT DISTINCT.

您可以使用EXCEPT有效区分两个SELECT查询的运算符。 EXCEPT DISTINCT将只返回唯一值。Oracle 的MINUS运算符相当于EXCEPT DISTINCT.

SQL 从一列而不是另一列中选择所有值的有效方法

提问by Flash

回答by Erwin Brandstetter

回答by Eric

回答by Paul

相关推荐

最近更新

标签

SQL 从一列而不是另一列中选择所有值的有效方法

提问by Flash

回答by Erwin Brandstetter

回答by Eric

回答by Paul

相关推荐

SQL SQL中的表扫描和索引扫描

SQL 内连接两个以上的表

如何找出哪个用户执行了 SQL 语句？

SQL GETUTCDATE 函数

相关推荐

最近更新

标签