In most books, the result that the MLE is asymptotically normal is given, and that is used as the definition of asymptotically normal, with no mention of what the actual definition is for a general estimator.
Let $\hat{\theta}_n$ be an estimator for $\theta$. Consider the following definitions.
Defn 1 (Wikipedia): $\hat{\theta}_n$ is asymptotically normal if there exists a sequence of constants $a_n$, $b_n$ such that $\frac{\hat{\theta}_n-a_n}{b_n}\stackrel{d}{\to} N(\mu,\sigma^2)$.
Defn 2: $\hat{\theta}_n$ is asymptotically normal if $\frac{\hat{\theta}_n-\theta}{{se}(\hat{\theta}_n)} \stackrel{d}{\to} N(0,1)$.
Defn 3: $\hat{\theta}_n$ is asymptotically normal if $\frac{\hat{\theta}_n-\theta}{{se}(\hat{\theta}_n)} \stackrel{d}{\to} N(\mu,\sigma^2)$.
Defn 4 (StackExchange/MLE version): $\hat{\theta}_n$ is asymptotically normal if $\sqrt{n}(\hat{\theta}_n -\theta) \stackrel{d}{\to} N(0,\sigma^2)$.
In Defn 1, we can assume without loss of generality that $\mu=0$ and $\sigma^2=1$, but we can't do that for Defn 2 and Defn 3. One of the main questions I have is can an estimator be asymptotically normal where the limiting distribution is normal with a nonzero mean. Does such an estimator exist?
Suppose $\theta$ is a mean and we estimate it using the bad estimator $\hat{\theta}_n = \bar{X}+1$, the sample mean plus 1. Then this is asymptotically normal according to Defn 1 ($a_n = \theta+1$), but not asymptotically normal by Defn 2 ($\frac{\hat{\theta}_n-\theta}{{se}(\hat{\theta}_n)} = \frac{\bar{X}-\theta}{{se}(\hat{\theta}_n)} + \frac{a}{{se}(\hat{\theta}_n)}$, where the first term converges to $N(0,1)$ and the second term diverges), and even Defn 3.
Defn 4 seems too limited and too closely related to MLE. It doesn't appear to account for cases where the rate of convergence is not $\sqrt{n}$.
It seems Defn 1 is the most expansive definition here. It raises the question that if Defn 1 holds with $\mu=0$ and $\sigma^2=1$, how is it related to Defn 2, specifically is it necessary that $\frac{b_n}{se(\hat{\theta}_n)}\stackrel{p}{\to}1$ and $E(\hat{\theta}_n)-a_n\stackrel{p}{\to}0$?
So what is the correct definition? Can the mean of the limiting distribution be nonzero. Is there a reference?