How do I delete duplicates from a DataFrame? I have used drop_duplicates() but it still keeps 1 copy of the row. I want to delete all traces of the duplicate.
df:
Name Age Sex
0 James 24 Male
1 Alice 28 Female
2 Phil 40 Male
3 James 24 Male
code snip:
data = {"Name": ["James", "Alice", "Phil", "James"],
"Age": [24, 28, 40, 24],
"Sex": ["Male", "Female", "Male", "Male"]}
df = pd.DataFrame(data)
df output desired:
Name Age Sex
1 Alice 28 Female
2 Phil 40 Male