0

I´d be grateful if someone could help me out.

What my goal is: Given a DataFrame:

df = pd.DataFrame({"ID" : [1, 2, 3, 4], 
                   "age": [46, 48, 55, 55],
                   "gender": ['female', 'female', 'male', 'male'],
                   "overweight": ['y', 'n', 'y', 'y']},
                   index = [0, 1, 2, 3])  

Now I have already collected the unique values of "age"=[46,48,55], "gender"= ['female', 'male'] and "overweight": ['y', 'n'] in an dictionary, let´s call it for now "dict_unique_values". What I want to achieve is to check how often every combination of age, gender and overweight appears in the DataFrame. he output I want is an array and the array should contain all the frequencies of the combinations. So here is an example for three combinations:

  1. values are: age = 46, gender = male, overweight = y --> this combi appears only once
  2. values are: age = 48, gender = female, overweight = n --> this combi appears only once
  3. values are: age = 55, gender = male, overweight = y --> this combi appears twice

So the output for these three exemplary combis would be: [1, 1, 2]

The Problem: I have to do that for an unknown amount of columns with its own unique values (n over k possibilities to combine the values of the columns) and I have no idea to do that :D

But maybe you do ? :)

0 Answers0