0
print(len(df))
264764
df = df.merge(df2[['id','Re']], on=['id'], suffixes=('', '_fwd1'), how='left')
print(len(df))
2714684

why would this result in more data than df originally had? Even when i do how='inner' which should be the intersection of keys i get more observations.

martineau
  • 112,593
  • 23
  • 157
  • 280
ben shapiro
  • 101
  • 1
  • 7

0 Answers0