Questions tagged [robust]

Robustness in general refers to a statistic's insensitivity to deviations from its underlying assumptions (Huber and Ronchetti, 2009).

Robust statistics are insensitive to deviations from their underlying assumptions and outliers. Such methods are useful it is not possible to detect and remove outliers or to appropriately test the assumptions required by a given statistic. A robust statistic is meant to achieve three goals:

  1. efficiency - it should have an optimal or nearly optimal efficiency as the assumed model
  2. stability - small deviations from the assumptions should have only a small influence on performance
  3. breakdown - larger deviations from the assumptions should not lead to a complete failure

Examples of robust statistics are median regression as estimation technique, or Huber-White standard errors for statistical inference. Note that "robust" is not equivalent to "better". Robustness is always based on compromise as it sacrifices efficiency to ensure against larger deviations from the assumptions from the model (Anscombe, 1960).

For further reading see

  • Huber, P.J. and Ronchetti, E.M. (2009) "Robust Statistics", 2nd Edition, Wiley Series in Probability and Statistics, John Wiley & Sons, Inc., New Jersey
  • Anscombe, F.J. (1960) "Rejection of Outliers", Technometrics, Vol. 2, pp. 123-147
581 questions
9
votes
2 answers

What is the difference between the MCD and the MVE estimators?

As far as I understand, the Minimum Covariance Determinant (MCD) estimator looks for the subset of h data points whose covariance matrix has the smallest determinant. the Minimum Volume Ellipsoid (MVE) searches for the ellipsoid with the smallest…
user7064
  • 2,213
6
votes
2 answers

Robust version of Hotelling $T^2$ test

I am looking for a robust version of Hotelling's $T^2$ test for the mean of a vector. As data, I have a $m\ \times\ n$ matrix, $X$, each row an i.i.d. sample of an $n$-dimensional RV, $x$. The null hypothesis I wish to test is $E[x] = \mu$, where…
shabbychef
  • 14,814
6
votes
3 answers

Robust regression or ANOVA for non-normal dependent variable

I have a data set with a dependent variable on a scale from 0 to 100 (n=198). The problem is that many subjects (25) scored exactly 100 but below 100 every score is achieved only once. This distorts the histogram as you can see on the following…
DBR
0
votes
1 answer

Multiple comparison tests after using Robust method (lmrob)

I have some violation of assumptions (normality and equality of variances) is my analysis and I decided to a use robust technique (lmrob function in R). I have a continuous response, one categorical predictor and one covariate. I was wondering which…
Farid
  • 83