Questions tagged [identifiability]

A model is identifiable if a single set of parameters can be found that will yield the best fit.

A model is identifiable if a single set of parameters can be found that will yield the best fit.

For example, consider the classic ANOVA model:
$$ y_{ij} = \mu + \alpha_j + \varepsilon_i $$ where $y_{ij}$ represents observed scores decomposed into a population mean, $\mu$, a mean shift, $\alpha_j$, associated with condition $j$, and each unit's individual divergence from their condition's mean, $\varepsilon_i$. When there are $J$ conditions, there are $J+1$ parameters to fit in this model. Without additional constraints, this model is unidentifiable; for instance if three conditions had means $3$, $4$, and $5$, they could be fit equally well with:
\begin{array}[llll] \mu \mu = 1 \qquad &\alpha_1 = \; \; \ 2 \qquad &\alpha_2 = \; \; \ 3 \qquad &\alpha_3 = \; \; \ 4 \\ \mu = 4 \qquad &\alpha_1 = -1 \qquad &\alpha_2 = \; \; \ 0 \qquad &\alpha_3 = \; \; \ 1 \\ \mu = 9 \qquad &\alpha_1 = -6 \qquad &\alpha_2 = -5 \qquad &\alpha_3 = -4 \\ ... \end{array} (Therefore, in practice the ANOVA is given additional constraints such as $\frac{1}{N}\sum_j n_j\alpha_j = 0$.)

Although issues with identifiability are trivial in the above example, they can be more subtle in other contexts. Identifiability concerns can arise in fitting structural equations models (), for example.

261 questions
50
votes
2 answers

What is model identifiability?

I know that with a model that is not identifiable the data can be said to be generated by multiple different assignments to the model parameters. I know that sometimes it's possible to constrain parameters so that all are identifiable, as in the…
Jack Tanner
  • 4,842
4
votes
2 answers

Understanding basic identifiability

I'm having trouble understanding identifiability. Specifically, I'm not sure, in the following example, why $P\left(C\right)$ cannot be identified. Here's the example: You have 2 unfair coins with the following…
3
votes
2 answers

How do we Deal with Identifiability Problems in Statistics?

In statistics, are there any common strategies to deal with non-identifiable models? For example, I have heard that mixture models (i.e. based on weighted sums of normal probability distributions) can be quite useful at modelling complex datasets…
stats_noob
  • 1
  • 3
  • 32
  • 105
3
votes
2 answers

What is a "weakly identified" parameterization?

I understand that a parameterization is identified if it's true that $$ \theta_1 \neq \theta_2 \Rightarrow p(y|\theta_1) \neq p(y|\theta_2) $$ Intuitively, it means that two different parameter values must result in two different probability of the…
Heisenberg
  • 4,590
  • 4
  • 30
  • 62
1
vote
0 answers

How do we identify the parameters in this factor model?

Consider the following model: $$ y_{it}=\nu_{it}+\epsilon_{it}$$ $$\nu_{it}=\rho \nu_{it-1}+\zeta_{it}$$ Where $y_{it}$ is the income for $i$ at time $t$. $\epsilon_{it}$ is the idiosyncratic income shock. $\nu_{it}$ denotes the permanent component…
Ludwig Gershwin
  • 303
  • 1
  • 4
1
vote
0 answers

How to verify identification of a model?

I have the system of simultaneous equations: $$ \begin{cases} y_1 = b_{12}y_2 + b_{13}y_3 + a_{11}x_{12} + a_{13}x_3 \\ y_2 = b_{21}y_1 + a_{21}x_1 + a_{22}x_{2} \\ y_3 = b_{32}y_2 + a_{31}x_1 + a_{32}x_2 + a_{33}x_3 \end{cases} $$. For the second…
1
vote
0 answers

Can I combine equations to produce overidentifying restrictions?

Say theory tells me that $$ y = f(x_1,x_2|\theta_f) $$ where $\theta_f$ is a set of parameters Similarly, theory tells me that $$ y = g(x_1,x_3|\theta_g) $$ where $\theta_g$ is another set of parameters. There might be common parameters between the…
luchonacho
  • 2,742
  • 4
  • 26
  • 47
1
vote
1 answer

Stochastic volatility model - why not identified?

For the following model $y_t = \beta e^{h_t/2} \epsilon_t$ $h_{t+1} = \mu + \phi(h_t - \mu) + \sigma_n \eta_t$ this Kim et al (1998) paper writes that For identifiability reasons either $\beta$ must be set to one or $\mu$ to zero Why is that? (to…
luffe
  • 265
1
vote
1 answer

Is it possible to automatically de-identify/confidentialise aggregate data?

Suppose I have a data set containing every tax return lodged with a particular tax authority. Suppose I would like to allow people to study trends and patterns but not see any individual's tax return information. I can produce a lot of aggregate…