I am tasked with a project that aims to predict the probability of a product being returned before the product is even ordered.
I have an excel containing a bunch of orders. In order to make predictions, it is important to predict each item in the context of the order basket, e.g. if a person orders 5 pairs of shoes, he will most likely be returning 4 of the 5 pairs.
I can link purchases of different products via their order ID. Question is, how am I supposed to preprocess these IDs to preserve their relationship. Do I just encode them like a categorical attribute?