I have a dataset whereby duplicates can be found. I find the duplicates via matching matchID and the player_name column. However, after finding this information, how do I delete this group of duplicates from the dataset?
SELECT
*
FROM
project.worldcup_players;
SELECT
MatchID, Player_Name, COUNT(*) AS NumDuplicates
FROM
project.worldcup_players
GROUP BY
MatchID, Player_Name
HAVING
NumDuplicates > 1;