Questions tagged [identifiability]

A model is identifiable if a single set of parameters can be found that will yield the best fit.

For example, consider the classic ANOVA model:
$$ y_{ij} = \mu + \alpha_j + \varepsilon_i $$ where $y_{ij}$ represents observed scores decomposed into a population mean, $\mu$, a mean shift, $\alpha_j$, associated with condition $j$, and each unit's individual divergence from their condition's mean, $\varepsilon_i$. When there are $J$ conditions, there are $J+1$ parameters to fit in this model. Without additional constraints, this model is unidentifiable; for instance if three conditions had means $3$, $4$, and $5$, they could be fit equally well with:
\begin{array}[llll] \mu \mu = 1 \qquad &\alpha_1 = \; \; \ 2 \qquad &\alpha_2 = \; \; \ 3 \qquad &\alpha_3 = \; \; \ 4 \\ \mu = 4 \qquad &\alpha_1 = -1 \qquad &\alpha_2 = \; \; \ 0 \qquad &\alpha_3 = \; \; \ 1 \\ \mu = 9 \qquad &\alpha_1 = -6 \qquad &\alpha_2 = -5 \qquad &\alpha_3 = -4 \\ ... \end{array} (Therefore, in practice the ANOVA is given additional constraints such as $\frac{1}{N}\sum_j n_j\alpha_j = 0$.)

Although issues with identifiability are trivial in the above example, they can be more subtle in other contexts. Identifiability concerns can arise in fitting structural equations models (sem), for example.

261 questions

votes

2 answers

What is model identifiability?

I know that with a model that is not identifiable the data can be said to be generated by multiple different assignments to the model parameters. I know that sometimes it's possible to constrain parameters so that all are identifiable, as in the…

identifiability

asked Jan 05 '12 at 02:59

Jack Tanner

4,842

votes

2 answers

Understanding basic identifiability

I'm having trouble understanding identifiability. Specifically, I'm not sure, in the following example, why $P\left(C\right)$ cannot be identified. Here's the example: You have 2 unfair coins with the following…

identifiability

asked Jul 12 '12 at 21:26

Phillip Cloud

votes

2 answers

How do we Deal with Identifiability Problems in Statistics?

In statistics, are there any common strategies to deal with non-identifiable models? For example, I have heard that mixture models (i.e. based on weighted sums of normal probability distributions) can be quite useful at modelling complex datasets…

identifiability

asked Jul 01 '23 at 04:35

stats_noob

votes

2 answers

What is a "weakly identified" parameterization?

I understand that a parameterization is identified if it's true that $$ \theta_1 \neq \theta_2 \Rightarrow p(y|\theta_1) \neq p(y|\theta_2) $$ Intuitively, it means that two different parameter values must result in two different probability of the…

identifiability

asked Jan 19 '18 at 21:22

Heisenberg

4,590
4
30
62

vote

0 answers

How do we identify the parameters in this factor model?

Consider the following model: $$ y_{it}=\nu_{it}+\epsilon_{it}$$ $$\nu_{it}=\rho \nu_{it-1}+\zeta_{it}$$ Where $y_{it}$ is the income for $i$ at time $t$. $\epsilon_{it}$ is the idiosyncratic income shock. $\nu_{it}$ denotes the permanent component…

identifiability

asked Sep 21 '22 at 02:44

Ludwig Gershwin

vote

0 answers

How to verify identification of a model?

I have the system of simultaneous equations: $$ \begin{cases} y_1 = b_{12}y_2 + b_{13}y_3 + a_{11}x_{12} + a_{13}x_3 \\ y_2 = b_{21}y_1 + a_{21}x_1 + a_{22}x_{2} \\ y_3 = b_{32}y_2 + a_{31}x_1 + a_{32}x_2 + a_{33}x_3 \end{cases} $$. For the second…

identifiability

asked May 13 '20 at 14:23

VanyaTihonov

vote

0 answers

Can I combine equations to produce overidentifying restrictions?

Say theory tells me that $$ y = f(x_1,x_2|\theta_f) $$ where $\theta_f$ is a set of parameters Similarly, theory tells me that $$ y = g(x_1,x_3|\theta_g) $$ where $\theta_g$ is another set of parameters. There might be common parameters between the…

identifiability

asked Jan 16 '20 at 12:51

luchonacho

2,742
4
26
47

vote

1 answer

Stochastic volatility model - why not identified?

For the following model $y_t = \beta e^{h_t/2} \epsilon_t$ $h_{t+1} = \mu + \phi(h_t - \mu) + \sigma_n \eta_t$ this Kim et al (1998) paper writes that For identifiability reasons either $\beta$ must be set to one or $\mu$ to zero Why is that? (to…

identifiability

asked Feb 13 '17 at 22:11

luffe

vote

1 answer

Is it possible to automatically de-identify/confidentialise aggregate data?

Suppose I have a data set containing every tax return lodged with a particular tax authority. Suppose I would like to allow people to study trends and patterns but not see any individual's tax return information. I can produce a lot of aggregate…

identifiability

asked Jun 05 '16 at 14:06

Patrick Conheady