I am using the famous diamonds dataset and I'm using linear regression to fit the following model: $\ln(\text{price}) = \beta_0 + \beta_1 \cdot \ln{\text{x}} + \beta_2 \cdot \text{clarity}$
where carat is a numerical value and clarity is a categorical one (so, one coefficient per dummy).
The model gets a ~ 96% $R^2$ coeficient and the fit seems quite good, but the residuals are not normal. My question is, in light of the following information, is this a problem? How should I treat this problem?
The plots:
y_test vs y_pred:
Residual vs y_test:
Residual vs predictor (x):
Residual histogram
QQ plot:
Besides that, residual stats:
mean 0.000654 std 0.196600 skew -0.264115 kurtosis 1.586213
Should I be worried? Should the non-normality of my residuals be addressed?




