How To Remove Entire Column If A Particular Row Has Duplicate Values In A Dataframe In Python

December 12, 2023 Post a Comment

I have a dataframe like this, df, Name City 0 sri chennai 1 pedhci pune 2 bahra pune there is a duplicate in City column. I tried: df['City'

Solution 1:

You can use:

df2 = df.drop_duplicates(subset='City')

if you want to store the result in a new dataframe, or:

df.drop_duplicates(subset='City',inplace=True)

if you want to update df.

This produces:

>>>df
      City    Name
0  chennai     sri
1     pune  pedhci
2     pune   bahra
>>>df.drop_duplicates(subset='City')
      City    Name
0  chennai     sri
1     pune  pedhci

This will thus only take duplicates for City into account, duplicates in Name are ignored.

Python Guru

How To Remove Entire Column If A Particular Row Has Duplicate Values In A Dataframe In Python

Solution 1:

Post a Comment for "How To Remove Entire Column If A Particular Row Has Duplicate Values In A Dataframe In Python"