Let us assume your target variable is conditionally normally distributed on the log scale (equivalent to residuals being normal). Then your point predictions on the log scale are the expectation and the median (which coincide for the normal distribution), again on the log scale.
Exponentiating the log-expectation and the residuals does not result in a normal distribution on the original scale, but in a lognormal distribution. For this distribution, the expectation and the median are no longer the same, the median is lower than the expectation. The exponentiation turns your point predictions on the log scale into median predictions on the original scale. They are no longer expectation predictions here. (If you want an expectation forecast, see here under "bias adjustments" for $\lambda=0$.)
The final piece of the puzzle is that the MAE is minimized precisely by the median, not the expectation. This is why your exponentiated predictions (which aim at the conditional median on the original scale) perform better under MAE than modeling directly on the original scale, because (presumably and almost certainly) your model on the original scale targets the expectation and not the median.
Comparing the two predictions does not make sense, because they aim at minimizing two different loss functions: modeling on the original scale (presumably and almost certainly) aims at an unbiased expectation prediction, which minimizes the MSE, while modeling on the log scale and exponentiating aims at a median forecast, which minimizes the MAE. I have written a little paper on this (Kolassa, 2020, IJF).
If you want an expectation forecast, you should do a bias-adjusted back-transformation (see the link above) and assess both models using the MSE. If you want a median forecast, you can go the route you took, and/or model on the original scale using the MAE as a loss function, and in any case evaluate the predictions using the MAE. Per this argument, expanded in the paper, it does not make sense to evaluate a forecast using multiple different error metrics.
There are illustrations of how optimal forecasts differ by error metric for different conditional distributions here and here.