Highest Voted Questions - Statistical Analysis Stack Exchange

52

votes

3 answers

Bootstrap vs. permutation hypothesis testing

There are several popular resampling techniques, which are often used in practice, such as bootstrapping, permutation test, jackknife, etc. There are numerous articles & books discuss these techniques, for example Philip I Good (2010) Permutation,…

asked Dec 25 '11 at 01:03

Tu.2

2,957

52

votes

3 answers

How are we defining 'reproducible research'?

This has come up in a few questions now, and I've been wondering about something. Has the field as a whole moved toward "reproducibility" focusing on the availability of the original data, and the code in question? I was always taught that the core…

asked Aug 31 '11 at 03:39

Fomite

23,134

52

votes

1 answer

How to interpret error measures?

I am running the classify in Weka for a certain dataset and I've noticed that if I'm trying to predict a nominal value the output specifically shows the correctly and incorrectly predicted values. However, now I'm running it for a numerical…

asked Jan 05 '15 at 13:54

FloIancu

623

51

votes

3 answers

Confidence interval around binomial estimate of 0 or 1

What is the best technique to calculate a confidence interval of a binomial experiment, if your estimate is that $p=0$ (or similarly $p=1$) and sample size is relatively small, for example $n=25$?

asked Jan 19 '14 at 08:38

Kasper

3,399

51

votes

7 answers

Why is "statistically significant" not enough?

I have completed my data analysis and got "statistically significant results" which is consistent with my hypothesis. However, a student in statistics told me this is a premature conclusion. Why? Is there anything else needed to be included in my…

asked Dec 11 '13 at 04:43

Jim Von

621

51

votes

2 answers

Multiple regression or partial correlation coefficient? And relations between the two

I don't even know if this question makes sense, but what is the difference between multiple regression and partial correlation (apart from the obvious differences between correlation and regression, which is not what I am aiming at)? I want to…

asked Nov 17 '13 at 18:49

user34927

51

votes

3 answers

Is it possible to interpret the bootstrap from a Bayesian perspective?

Ok, this is a question that keeps me up at night. Can the bootstrap procedure be interpreted as approximating some Bayesian procedure (except for the Bayesian bootstrap)? I really like the Bayesian "interpretation" of statistics which I find nicely…

asked Oct 03 '13 at 09:21

Rasmus Bååth

6,840

51

votes

5 answers

What is residual standard error?

When running a multiple regression model in R, one of the outputs is a residual standard error of 0.0589 on 95,161 degrees of freedom. I know that the 95,161 degrees of freedom is given by the difference between the number of observations in my…

asked Apr 30 '13 at 20:54

ustroetz

791
1
8
14

51

votes

2 answers

Comparing two models using anova() function in R

From the documentation for anova(): When given a sequence of objects, ‘anova’ tests the models against one another in the order specified... What does it mean to test the models against one another? And why does the order matter? Here is an…

asked Mar 26 '13 at 10:01

qed

2,808

51

votes

9 answers

Does anyone know any good open source software for visualizing data from database?

Recently I came across Tableau and tried to visualize the data from database and csv file. The user iterface enables the user to visualize time and spatial data and create plots in an instant. Such tool is really useful as it enables to observe the…

asked Nov 22 '12 at 16:28

niko

1,353

51

votes

4 answers

What are the factors that cause the posterior distributions to be intractable?

In Bayesian statistics, it is often mentioned that the posterior distribution is intractable and thus approximate inference must be applied. What are the factors that cause this intractability?

asked Nov 11 '10 at 00:33

Nick

3,537

51

votes

1 answer

What is the difference between a loss function and an error function?

Is the term "loss" synonymous with "error"? Is there a difference in definition? Also, what is the origin of the term "loss"? NB: The error function mentioned here is not to be confused with normal error.

loss-functions

asked Jul 26 '18 at 00:00

Dan Kowalczyk

620

51

votes

5 answers

How do I test a nonlinear association?

For plot 1, I can test the association between x and y by doing a simple correlation. For plot 2, where the relationship is nonlinear yet there is a clear relation between x and y, how can I test the association and label its nature?

asked Sep 07 '12 at 23:11

user1447630

1,059

51

votes

3 answers

Dice-coefficient loss function vs cross-entropy

When training a pixel segmentation neural network, such as a fully convolutional network, how do you make the decision to use the cross-entropy loss function versus Dice-coefficient loss function? I realize this is a short question, but not quite…

asked Jan 04 '18 at 03:12

Christian

1,872
4
21
28

51

votes

14 answers

Why is median age a better statistic than mean age?

If you look at Wolfram Alpha Or this Wikipedia page List of countries by median age Clearly median seems to be the statistic of choice when it comes to ages. I am not able to explain to myself why arithmetic mean would be a worse statistic.…

asked Sep 10 '10 at 20:26

Lazer

613

Most Popular