I am analyzing data on polar bears and trying to figure out if different variables influence their movement. My data has a mix of categorical (e.g. bear ID number) and numerical variables (e.g. bear age) For my analysis, I was thinking of doing a model in a format like this: Movement = x1*(year) + x2*(length of ice season) + x3*(age of bear) + bear’s individual ID + etc. I am stuck between two options:
1. Doing a GLMM (Generalized Linear Mixed Model). Since I’m pretty sure my independent variables don’t all have a linear relationship to my dependent variable, I was thinking of doing a quick visual analysis of my variables and tweaking them accordingly: for example, if it looks like age of the bear has more of an exponential relationship with my movement variable, then I would write it in the model as x3*log(age of bear).
2. Doing a GAM (Generalized Additive Model). I’m not too familiar with this type of model, but I have heard that it’s usually the way to go if you believe the relationship between your variables isn’t necessarily linear.
In both cases, I am planning on including the bear ID as a random effect.
Which test would you recommend? Are there pros and cons to each? As an aside, my data also has relatively small sample sizes (30 to 45 bears).