I am looking at waiting times between two events from multiple patients, so I'm looking at a gamma distribution. Turns out, the model is plotting out an exponential distribution, which if I was to look at something like the arrival times of ambulances at A&E this would make sense. However, I'm now unsure whether I am modelling my data correctly, as I am only looking at a single time-to-event per person, each person being mutually exclusive from the next, and exponential distributions are meant to be memoryless. Surely, any data is memoryless if there is only one event per independent and mutually exclusive case?
The scenario: I would like to plot the probability distribution of the time from a headache diagnosis (H) made in a Doctors (GP in the UK) to a headache diagnosis made at a special referral clinic (R). This is on a per patient basis e.g.,
P1:H------>R
P1:H-------------------->R
P1:H----------->R
P1:H-->R
Is it correct to try modelling this kind of data using a gamma distribution and then would it be logical to end up with an exponential distribution?
EDIT
A histogram distribution of what the data looks like:

The gamma distribution - Black - idealised gamma distribution. Red - modelling distribution:

You ask if the Gamma is appropriate. What is the CV for the data you're modeling? Have you visually looked at the histogram? Perhaps if you posted some more details we could make better modeling suggestions.
– SecretAgentMan Sep 04 '18 at 20:20(2) Your histogram has ~13 bins. What was the sample size of your data?
(3) Visually, your data looks like it might be modeled by a hyperexponential distribution fairly well depending on the application.
– SecretAgentMan Sep 05 '18 at 15:17