如何在MySQL表中找到重複的值

2019-10-16 22:56:46

在本教學中,您將學習如何在MySQL中找到一個或多個列的重複值。

在開始之前

由於原因很多,資料庫中的重複事件發生很多。查詢重複值是使用資料庫時必須處理的重要任務之一。

對於演示,我們將建立一個名為contacts表,其中包含四個列:idfirst_namelast_nameemail

USE testdb;

CREATE TABLE contacts (
    id INT PRIMARY KEY AUTO_INCREMENT,
    first_name VARCHAR(50) NOT NULL,
    last_name VARCHAR(50) NOT NULL,
    email VARCHAR(255) NOT NULL
);

以下語句將行插入到contacts表中:

INSERT INTO contacts (first_name,last_name,email) 
VALUES ('Carine ','Schmitt','[email protected]'),
       ('Jean','King','[email protected]'),
       ('Peter','Ferguson','[email protected]'),
       ('Janine ','Labrune','[email protected]'),
       ('Jonas ','Bergulfsen','[email protected]'),
       ('Janine ','Labrune','[email protected]'),
       ('Susan','Nelson','[email protected]'),
       ('Zbyszek ','Piestrzeniewicz','[email protected]'),
       ('Roland','Keitel','[email protected]'),
       ('Julie','Murphy','[email protected]'),
       ('Kwai','Lee','[email protected]'),
       ('Jean','King','[email protected]'),
       ('Susan','Nelson','[email protected]'),
       ('Roland','Keitel','[email protected]');

然後,查詢表中的資料如下 -

SELECT 
    *
FROM
    contacts;

執行上面查詢,得到以下結果 -

+----+------------+-----------------+--------------------------------+
| id | first_name | last_name       | email                          |
+----+------------+-----------------+--------------------------------+
|  1 | Carine     | Schmitt         | [email protected]          |
|  2 | Jean       | King            | [email protected]               |
|  3 | Peter      | Ferguson        | [email protected]      |
|  4 | Janine     | Labrune         | [email protected]         |
|  5 | Jonas      | Bergulfsen      | [email protected]       |
|  6 | Janine     | Labrune         | [email protected]         |
|  7 | Susan      | Nelson          | [email protected]            |
|  8 | Zbyszek    | Piestrzeniewicz | [email protected] |
|  9 | Roland     | Keitel          | [email protected]        |
| 10 | Julie      | Murphy          | [email protected]         |
| 11 | Kwai       | Lee             | [email protected]            |
| 12 | Jean       | King            | [email protected]               |
| 13 | Susan      | Nelson          | [email protected]           |
| 14 | Roland     | Keitel          | [email protected]        |
+----+------------+-----------------+--------------------------------+
14 rows in set

contacts表中,有一些行在first_namelast_nameemail列中具有重複的值,下面來看看如何查詢它們。

在一列中找到重複的值

在基於一列的表中找到重複值,則使用以下語句:

SELECT 
    col, 
    COUNT(col)
FROM
    table_name
GROUP BY col
HAVING COUNT(col) > 1;

如果表中出現多個值,則該值將被視為重複。在這個語句中,使用COUNT函式的GROUP BY子句來計算指定列(col)的值。HAVING子句中的條件僅包含值count大於1的行,這些行是重複的行。

可以使用此查詢在contacts表中查詢具有重複email的所有行,如下所示:

SELECT 
    email, 
    COUNT(email)
FROM
    contacts
GROUP BY email
HAVING COUNT(email) > 1;

以下顯示查詢的輸出:

+-------------------------+--------------+
| email                   | COUNT(email) |
+-------------------------+--------------+
| [email protected]  |            2 |
| [email protected] |            2 |
+-------------------------+--------------+
2 rows in set

如上查詢結果中可以看到,有一些行具有相同的電子郵件。

在多個列中查詢重複值

有時,希望基於多個列而不是一個查詢重複。在這種情況下,您可以使用以下查詢:

SELECT 
    col1, COUNT(col1),
    col2, COUNT(col2),
    ...

FROM
    table_name
GROUP BY 
    col1, 
    col2, ...
HAVING 
       (COUNT(col1) > 1) AND 
       (COUNT(col2) > 1) AND 
       ...

只有當列的組合重複時,行才被認為是重複的,所以在HAVING子句中使用了AND運算子。

例如,要使用first_namelast_nameemail列中的重複值在contacts表中查詢行,請使用以下查詢:

SELECT 
    first_name, COUNT(first_name),
    last_name,  COUNT(last_name),
    email,      COUNT(email)
FROM
    contacts
GROUP BY 
    first_name , 
    last_name , 
    email
HAVING  COUNT(first_name) > 1
    AND COUNT(last_name) > 1
    AND COUNT(email) > 1;

執行上面查詢後,得到以下輸出:

+------------+-------------------+-----------+------------------+-------------------------+--------------+
| first_name | COUNT(first_name) | last_name | COUNT(last_name) | email                   | COUNT(email) |
+------------+-------------------+-----------+------------------+-------------------------+--------------+
| Janine     |                 2 | Labrune   |                2 | [email protected]  |            2 |
| Roland     |                 2 | Keitel    |                2 | [email protected] |            2 |
+------------+-------------------+-----------+------------------+-------------------------+--------------+
2 rows in set

在本教學中,您已經學會了如何根據MySQL中一個或多個列的值來找到重複的行。