I have a log-likelihood function with four parameters, out of which the variability of the likelihood with two parameters is extremely smooth. It suggests that the derivative is small for large ranges of these two parameters. Although I know there is a maximum, I have never achieved it through any optimization (local or global) I tried. If I use my own gradient functions, it takes a lot of time to converge, and when it converges, it converges to a wrong value. Are there some techniques by which I can handle such smooth log-likelihoods?
I am writing the likelihood with just these two parameters in the following equation.
$$ \log(p(Z|\Theta)) = -\sum_{i = 1}^N \left[ \log(\pi (F(f_i, \Theta))) + \frac{\sum_{l=1}^L Z_l(f_i)}{F(f_i, \Theta)} \right], $$
where $Z$ are the measurements and $F$ is a model of the expectation of the measurements that is a function of the parameters.
$$F(f_i, \Theta) = M\left[1+ \sum_{q = 1}^{N-1}\left(1 - \frac{q}{N}\right)\left[\exp(-j2\pi f_iq)G(q, \Theta) + \exp(j2\pi f_iq)G(-q, \Theta)\right]\right]$$
The function $G(q, \Theta)$ is difficult.
$G(q, \eta, \Lambda) = \exp(jC_1q) \int_{0}^{\infty} x^\eta \exp(-\Lambda x) \exp(-jC_2\exp(-c_3x)) dx, $ where $\Theta = [\eta, \Lambda]$. I also have used approximations of this integral.
EDIT
Based on the comment of @Cryo, I wanted to make it clear that, for now, I do not have a solid idea about the "acceptable" ranges of these parameters. So, the solutions I usually get are almost equally likely (because these solutions are on the plateau of the cost function). When I try reconstructing with $F(\hat{\Theta})$, I usually see they are a really good fit with the measurements $Z$.
Then, my correct question would be about how to make the optimization faster. If I consider all four parameters in my model, the gradient descent quasi-Newton-type algorithms take a lot of time to converge to a solution due to this plateau with these two parameters. This is frustrating because I want to make some Monte Carlo simulations on parameter sweeps. One such Monte Carlo run takes weeks to compute.
EDIT
I tried plotting the log-likelihood with the two problematic parameters by fixing the other two. The true values are $\eta = 2$, and $\Lambda = 3.6$. I can see there is clearly a maximum, but I have put the color scheme in a way that it is easier to see. You can see the other picture with a broader colorbar and notice how smooth the cost is.

