Questions tagged [boxplot]

A graphical display to summarize the distribution of a sample. It displays five numbers plus (possibly) some outliers - those five points being the median, hinges (approximate quartiles), and the largest and smallest value not counting any points marked as outliers.

A boxplot or box-and-whisker plot is a graphical display that summarizes a sample distribution. Mostly used when comparing many samples, it displays five numbers plus (possibly) some outliers. The five numbers are the median (drawn as a central line) and hinges - approximate quartiles - for which a box is drawn. Then lines are drawn out to the largest and smallest values not marked as outliers. There are many variants of the basic boxplot.

262 questions
7
votes
2 answers

Why does Tableau's Box/Whisker plot show outliers automatically and how can I get rid of it?

I have a data set shown as box-whisker graphs after disaggregating. See below. I am wondering why Tableau (the product I am using) automatically plots a whole bunch of values outside the box-whisker. I thought the whiskers of the box are minimums…
Gil
  • 173
  • 1
  • 1
  • 4
4
votes
1 answer

Box Plot Upper Fence is greater than Max

Hi is it normal for the upper fence to be greater than max? If not, what might have gone wrong? I am using Empirical Rule and doing calculation mean +3 * IQR Thanks
Nathan
  • 43
2
votes
1 answer

Reproduce boxplot with only data points at given quantiles

It is trivial to create a boxplot in R with a full dataset. However, with limited access to the whole dataset, I just have 5 data point at min, 25%, 50% ,75%, and max. So is there any easy way to reproduce the boxplot with only these 5 values?
divy
  • 21
2
votes
2 answers

Using boxplots of previous day’s percentage change in S&P index to predict today's return based on yesterday's movement

In the book "An Introduction to Statistical Learning with Applications in Python, Trevor Hastie et al., Springer", there's the following paragraph: The left-hand panel of Figure 1.2 displays two boxplots of the previous day’s percentage changes in…
2
votes
1 answer

Help needed with my box plot

I decided to give box and whiskers plots a try with my data. However, the produced box plots look a bit abnormal. I have not been able to find such cases online. For example, my "T" value has no box at all, while the median of "R" value is actually…
2
votes
1 answer

Boxplot interpretation

I'm taking an introductory online statistics class. Unfortunately not everything is clear to me from this class. One of the chapters in the book contains an example of a Box-and-Whisker interpretation, which I can't understand. Here is the…
flashburn
  • 311
1
vote
1 answer

Boxplot with a symmetric box, but with whiskers of different lengths

From Figure 2 of Ferreira et al. (2016) "Graphical representation of chemical periodicity of main elements through boxplot", we can see the taxonomy of some common cases of symmetrical and asymmetrical distributions, with their corresponding…
Ommo
  • 270
1
vote
1 answer

determine the value of the mean using whisker plot

how can i determine the mean using box plot? also can we know which series has the highest mean value ?
Hasan
  • 31
1
vote
2 answers

Why do box-and-whisker plots use interquartile range, when it deliberately ignores everything below the 25% percentile and above the 75% percentile?

Why ought IQRs, and anything that relies on them like boxplots, be used when they ignore data? I replicated Wikipedia's graphs.
user157457
1
vote
1 answer

Simplify the data in a boxplot

I'm a web developer and currently working on visualizing the execution time of processing a request. Thus, I can create a list with all execution times (e.g., t = [0.12, 0.23, 0.16, 0.17, ...]). The problem is that the size of the list can grow…
1
vote
0 answers

ANOVA ASSUMPTIONS, LOW or NONE VARIANCE IN PART OF DATA

It was said to me there is a problem here, cause there is no variance of the control and ulex treatment - it can be the nature of the data or it can be the assumptions to apply anova in these data were not achieved. [] The data was log transformed…
1
vote
1 answer

boxplot : computed values combined with raw data

Given computed values for a box plot (min, lq, median, uq, max) and original raw data size is it possible to merge that with raw data. given: box1 = [min:1,lq:2,median:5,uq:8,max:10] size1 = 50 #size of dataset that gave me the box1 calculated…
1
vote
0 answers

Time period length for comparing two Boxplots?

I want to see the variability in a dataset or timeseries in two different period by the means of Box and Whisker Plot My time series covers a period of 1901-2010. I want to see the variability post-1970 period w.r.t. to pre-1970 period. Is this the…
dsbisht
  • 171
0
votes
1 answer

How do I determine if differences between medians are statistically significant when notches are very close (see "C" and "D")?

Regarding "C" and "D": Is there a way other than visually inspecting the notches from the boxplots? I know the range of values for "D" is lower than "C" and the median value appears to be lower for "D". I also think that box "D" has less…
0
votes
1 answer

arrows on whiskers of box plots

I plotted box plots for a dataset of >20 million rows. The whiskers came out with little arrows pointing towards the boxes: What do they mean?
gil
  • 1
1
2