I'm trying to model the distribution of my cumulated reputation on one Stack Exchange site over time (that is, each data point is the sum of my reputation whenever that reputation changes, mostly through up- or downvotes).
Here is an example of what the data looks like (the date is a timestamp in Unix time, that is, seconds since 1970-01-01 00:00:00 UTC):
reputation_history_type reputation_change creation_date cumulative_reputation
post_upvoted 10 1689366017 10
post_upvoted 10 1689376446 20
post_upvoted 10 1689504809 30
post_upvoted 10 1690366268 40
post_upvoted 10 1690472012 50
I'm interested in the relation between creation_date and cumulative reputation. The other columns are just given here to help you better understand the type of data.
I have been criticised for using a generalized linear model, because the variables aren't independent.
What would be a fitting model for such data? Or how do I find an appropriate one?
I'm using R.