I am looking for a model that determines the likelihood of a customer making a purchase after visiting my website multiple times.
What I anticipate is there's two types of visitors... (1) those that are just checking prices, and (2) those that visit a few times before making a purchase.
All the customers, if they don't make a purchase, are effectively censored because I have no idea if they'll come back or not.
Let's say I have the following longitudinal sample data. The "time" is the time from the first appearance to time of the current record. Purchase is 0 if a purchase was not made and 1 if a purchase was made.
UserID Time Purchase
1 0 0
1 1 0
1 3 1
2 0 0
2 1 0
2 2 0
2 3 0
2 4 0
2 5 0
2 6 0
2 7 0
2 8 0
2 9 0
2 10 0
3 0 0
3 2 1
4 0 0
4 4 0
4 6 0
5 0 0
5 1 0
5 2 0
5 3 1
I can see a 60% customer purchase rate. But can I do better?
I can look at a chart of purchase likelihood as the customer "age"...
But on a customer by customer level, I think I can do even better...
For example, customer 4 looks to be sort of poking around... there's a lot of time between revisits... so how do I incorporate the timing factor here?
And suppose I had some covariates, like gender? How would I be able to account for all of these?
