We have a use case where we are using F1 score to optimize threshold selection for a binary classifier. In this use case however, FP's present a higher risk impact than FN. We'd like to consider a way to modify the conventional F1 score formula in such a way that its calculation weights FP as a higher impact than FN.
Does anyone have any suggestions on where or how in F1-score calculation we might weight the FPs at a say 20% higher than FNs?
Happy to take feedback on any of the above statements.