Most Popular
1500 questions
52
votes
3 answers
Bootstrap vs. permutation hypothesis testing
There are several popular resampling techniques, which are often used in practice, such as bootstrapping, permutation test, jackknife, etc. There are numerous articles & books discuss these techniques, for example Philip I Good (2010) Permutation,…
Tu.2
- 2,957
52
votes
3 answers
How are we defining 'reproducible research'?
This has come up in a few questions now, and I've been wondering about something. Has the field as a whole moved toward "reproducibility" focusing on the availability of the original data, and the code in question?
I was always taught that the core…
Fomite
- 23,134
52
votes
1 answer
How to interpret error measures?
I am running the classify in Weka for a certain dataset and I've noticed that if I'm trying to predict a nominal value the output specifically shows the correctly and incorrectly predicted values. However, now I'm running it for a numerical…
FloIancu
- 623
51
votes
3 answers
Confidence interval around binomial estimate of 0 or 1
What is the best technique to calculate a confidence interval of a binomial experiment, if your estimate is that $p=0$ (or similarly $p=1$) and sample size is relatively small, for example $n=25$?
Kasper
- 3,399
51
votes
7 answers
Why is "statistically significant" not enough?
I have completed my data analysis and got "statistically significant results" which is consistent with my hypothesis. However, a student in statistics told me this is a premature conclusion. Why? Is there anything else needed to be included in my…
Jim Von
- 621
51
votes
2 answers
Multiple regression or partial correlation coefficient? And relations between the two
I don't even know if this question makes sense, but what is the difference between multiple regression and partial correlation (apart from the obvious differences between correlation and regression, which is not what I am aiming at)?
I want to…
user34927
51
votes
3 answers
Is it possible to interpret the bootstrap from a Bayesian perspective?
Ok, this is a question that keeps me up at night.
Can the bootstrap procedure be interpreted as approximating some Bayesian procedure (except for the Bayesian bootstrap)?
I really like the Bayesian "interpretation" of statistics which I find nicely…
Rasmus Bååth
- 6,840
51
votes
5 answers
What is residual standard error?
When running a multiple regression model in R, one of the outputs is a residual standard error of 0.0589 on 95,161 degrees of freedom. I know that the 95,161 degrees of freedom is given by the difference between the number of observations in my…
ustroetz
- 791
- 1
- 8
- 14
51
votes
2 answers
Comparing two models using anova() function in R
From the documentation for anova():
When given a sequence of objects, ‘anova’ tests the models against one another in the order specified...
What does it mean to test the models against one another? And why does the order matter?
Here is an…
qed
- 2,808
51
votes
9 answers
Does anyone know any good open source software for visualizing data from database?
Recently I came across Tableau and tried to visualize the data from database and csv file. The user iterface enables the user to visualize time and spatial data and create plots in an instant. Such tool is really useful as it enables to observe the…
niko
- 1,353
51
votes
4 answers
What are the factors that cause the posterior distributions to be intractable?
In Bayesian statistics, it is often mentioned that the posterior distribution is intractable and thus approximate inference must be applied. What are the factors that cause this intractability?
Nick
- 3,537
51
votes
1 answer
What is the difference between a loss function and an error function?
Is the term "loss" synonymous with "error"? Is there a difference in definition?
Also, what is the origin of the term "loss"?
NB: The error function mentioned here is not to be confused with normal error.
Dan Kowalczyk
- 620
51
votes
5 answers
How do I test a nonlinear association?
For plot 1, I can test the association between x and y by doing a simple correlation.
For plot 2, where the relationship is nonlinear yet there is a clear relation between x and y, how can I test the association and label its nature?
user1447630
- 1,059
51
votes
3 answers
Dice-coefficient loss function vs cross-entropy
When training a pixel segmentation neural network, such as a fully convolutional network, how do you make the decision to use the cross-entropy loss function versus Dice-coefficient loss function?
I realize this is a short question, but not quite…
Christian
- 1,872
- 4
- 21
- 28
51
votes
14 answers
Why is median age a better statistic than mean age?
If you look at Wolfram Alpha
Or this Wikipedia page List of countries by median age
Clearly median seems to be the statistic of choice when it comes to ages.
I am not able to explain to myself why arithmetic mean would be a worse statistic.…
Lazer
- 613