A histogram is a graphical representation of the frequencies of a continuous variable. The variable is divided into bins and a bar is drawn for each bin, proportional to its frequency in the data.
Questions tagged [histogram]
561 questions
3
votes
2 answers
What to do when IQR returns 0 in Freedman-Diaconis' rule?
I want to discretize a pandas continuous column. For discretization, I'm using Freedman-Diaconis rule which computes the optimal number of bins which will be given input to KBinsDiscretizer. Freedman-Diaconis' rule states that,
$$ \text{bin width},…
Robur_131
- 133
3
votes
2 answers
Making two histograms to compare homerun stats
When making a histogram to compare two things say the homerun distances of two baseball players, do you have to use the same numerical scale (range) on the x axisfor each even if their homerun distances do not start at the same distance? and should…
bob
- 31
2
votes
0 answers
Derivation/Explanation of the Freedman-Diaconis Rule
Can anyone provide a good derivation of the FD rule? Or explain why it is a good way to define the bin widths of a histogram. Are there any other similar rules and how do they compare?
user27119
- 308
- 1
- 17
2
votes
1 answer
Classifying histograms in N dimensional space - is my aproach correct?
I have a problem which consists of classifying N-dimensional histograms. The salient points are as follows:
For ALL dimensions:
Each histogram has the same number of bins (say 500, for argument sake)
Each bin has the same range
A high level view…
1
vote
0 answers
Freedman-Diaconis Rule
The Freedman-Diaconis Rule says that the optimal bin size of a histogram is $$ \text{Bin Size} = 2 \cdot \text{IQR}(x) n^{-1/3}$$ where $x$ is the data and $n$ is the number of observations in the data.
Using this rule, can we infer the shape of the…
proton
- 661
1
vote
0 answers
Wikipedia's Article on the Histogram
Wikipedia's nice article on the histogram contains the following sentence:
"Using wider bins where the density of the underlying data points is low reduces noise due to sampling randomness; using narrower bins where the density is high (so the…
compbiostats
- 1,557
1
vote
2 answers
How to approximate histogram(f(x)) from histogram(x)?
I have a histogram of a variable x, and I would like to get the histogram of f(x). Let's just say the transformation function is elementary and smooth. Is there a good method (maybe unbiased?) to transform the histogram(x) into histogram(f(x)),…
Azmisov
- 292
- 1
- 2
- 12
1
vote
1 answer
y-axis ticks of distribution plot
Can I know what are the ticks at y axis mean?
I created a distribution plot of titanic['Age'] data from.Kaggle Titanic Data
How to learn more about distribution of age column from the dataset from the graph? Can anyone explain what are the ticks at…
sai_636
- 133
1
vote
2 answers
Is this histogram normally distributed?
Is this histogram normally distributed? I can't tell since there are peaks that are outside the curve.
user116018
- 11
- 1
1
vote
2 answers
Why do we use the histogram?
As somebody who never took a statistics course (but had to teach a few classes on it), I wondered why is the histogram introduced in a statistics course? Usually when something is introduced in a "watered-down" way, it is important in later more…
Nicolas Bourbaki
- 2,859
1
vote
1 answer
Evaluate overlap of two histograms which are normalized to area 1
I have two normalized histograms which are normalized to area 1. They have different bin widths and forms. They only thing which they have in common is their area which is 1. Can someone explain me how I can evaluate the area overlap of these…
tester2k
- 85
- 5
1
vote
1 answer
(Frequency Histogram) - Bins of equal width
How do I create 5 bins of equal width to plot a frequency histogram of the following sequence of numbers?
0.15 0.54 0.23 0.65 0.36 0.15 0.87 0.65 0.90 0.64 0.74 0.98 0.96 0.74 0.82 0.91 0.19
This is just an example: I have more numbers. I am also…
Lord Dariu
- 19
1
vote
0 answers
Methods for 3D histograms comparison
When I wish to compare 2D image histograms I can use methods like Chi-Square and Intersection but what are my options if I wish to compare 3D histograms (e.g, based on R,G,B values)?
Thanks.
wrek
- 111
0
votes
2 answers
Histogram - what constitutes grouped data?
One of the questions on my course asked us to identify what type of data a histogram is used for. Two of the options were continuous or grouped data. The correct answer was continuous as this is the form the original data is in. Everybody else got…
Geoff
- 31
0
votes
1 answer
histogram starting point
I would like to ask something about histograms. I have a dataset containing only positive values. How can I get a histogram in spss25 with first range beginning a negative number? What does it mean?
Johanna
- 3
- 1