0

I have two Pandas Dataframe like this:

df1:

text_df1     label    filename
 'text_1' 'label_1'  'name_1'
 'text_2' 'label_2'  'name_1'
 'text_3' 'label_3'  'name_2'
 'text_4' 'label_4'  'name_2'

df2:

text_df2      filename
 'text_5'     'name_1'
 'text_6'     'name_2'

Expected output:

df3:

 text_df1   text_df2     filename  label
 'text_1'   'text_5'     'name_1'  'label_1'
 'text_2'   'text_5'     'name_1'  'label_2'
 'text_3'   'text_6'     'name_2'  'label_3'
 'text_4'   'text_6'     'name_2'  'label_4'

In df3 I would like to repeat strings of text_df1 everytime filename of df2 is in a row of filename df1, and also add the label of df1 in df3. For example text_5 repeat 2 times in df3 because his filename appears 2 times in df1 filename column.

I don't know how to do it in Pandas.

jos97
  • 361
  • 1
  • 12

0 Answers0