5

First, the expected survival time when time is time to event is treated as a continuous r.v. is given as the following:

$$\int_0^{\infty} S(t)\, dt$$

The formula for the mean residual survival time $\text{mrl}(u) = E(T -u | T>u)$ is given by

$$\frac{1}{S(u)}\int_u^{\infty} S(t)\, dt.$$

Can anyone explain the logic of dividing by $S(u)$?

For discrete time, this step does not appear needed (cf related question). Why?

Thanks!

ADD after Henry's explanation:

Lets say I have discrete time (12 months with the following hazards and Survival).

month<-seq(0,12)
h_x<-c(0,0.115706673,0.110186514,0.115769107,0.108296623,0.08908868,0.082548228,0.060146699,0.048112058,0.042197452,0.036919831,0.024691358,0.012787724)
S_x<-c(1,0.8843,0.7869,0.6958,0.6204,0.5651,0.5185,0.4873,0.4639,0.4443,0.4279,0.4173,0.4120)

plot(month,S_x,type="b")

1) Is the mean (truncated) lifetime sum(S_x[2:12]) #6.3117?

2) Lets say that we are know that a person survives up to 5th month (start of 5) i.e. they get through 4 complete months and we now want to estimated their mean (truncated) residual life. So, $\gamma$ =7 Is this computed as sum(S_x[5:12])/S_x[5] #6.358317 since 6.358 < $\gamma$+1 ?

B_Miner
  • 8,630
  • 1
    I'm a little unsure of the notation, but I think $\prod_{k=t}^{j}\left(1-h(k|\mathbf{x})\right)$ corresponds to $S(t)/S(u)\ dt$. Notice how the former is automatically normalized to $1$ for $j\lt t$ and, similarly, that $S(t)/S(u)\ dt = dt$ at $u=t$. Therefore the answer to your question must be lurking in the definition of $h$. Could you provide that definition so that this pair of questions can stand on their own? – whuber Oct 18 '11 at 21:42
  • Not sure I follow @whuber. Can you explain more? – B_Miner Oct 18 '11 at 23:03

1 Answers1

4

$S(t)$ is the probability of surviving to time $t$ or beyond.

So the probability of surviving to time $t$ or beyond given that you have survived to time $u$ (with $u \le t$) is $\frac{S(t)}{S(u)}$, and so this is also the probability of a residual survival time of $t-u$ or beyond after time $u$.

This is implicit in your earlier discrete question, which could have looked like $$\frac{\prod_{k=0}^{j}\left(1-h(k|\mathbf{x})\right)}{\prod_{k=0}^{t-1}\left(1-h(k|\mathbf{x})\right)}$$ but this is more easily written as ${\prod_{k=t}^{j}\left(1-h(k|\mathbf{x})\right)}.$

Henry
  • 39,459
  • Henry: I posted an addition to my question to help clarify. Many thanks! – B_Miner Oct 19 '11 at 01:10
  • 1
    Your extended question is slightly spoilt by R starting counting at 1 not 0. I would given sum(S_x[2:13]) (6.724) and sum(S_x[6:13])/S_x[5] (6.022). – Henry Oct 19 '11 at 01:24
  • should the mrl be sum(S_x[6:13])/S_x[6] as well? – B_Miner Oct 19 '11 at 01:33
  • I would say not, for same reason the first answer is sum(S_x[2:13])/S_x[1] not sum(S_x[2:13])/S_x[2] nor sum(S_x[1:13])/S_x[1] – Henry Oct 19 '11 at 01:37
  • 2
    @B_Miner: To expand briefly on Henry's nice answer, the formulas you provided in the question are completely general in the sense that they hold for all distributions of nonnegative random variables. – cardinal Oct 19 '11 at 09:34
  • Should one be able to look at the MRL in the case of continuous r.v. T in terms of simple conditional probability? For example P(T>x+t | T>t) = P(T>x+t $\cap$ T>t) / P(T>t). What happens to the second part of the numerator to get P(T>x+t) / P(T>t)? – B_Miner Oct 19 '11 at 12:54
  • 1
    @B_Miner: As events $[T>x+t] ∩ [T>t] = [T>x+t]$ because $[T>x+t] \subset [T>t]$ – Henry Oct 19 '11 at 15:18
  • @Henry: Hope to revisit this as I think that the extended question above 1) Is the mean (truncated) lifetime sum(S_x[2:12]) #6.3117? was misunderstood. I was re-reading this and wonder if my use of the word truncated confused. I was asking how to compute the mean lifetime (which is the mean of the truncated distribution since the survivor function does not go to zero). Should this not be a summation of the entire survival curve: sum(S_x[1:13])/S_x[1] where the denominator is typically ignored since it is always equal to 1. – B_Miner Dec 10 '11 at 03:48
  • This is really a matter of interpreting the data in the question. Suppose you had started with h_x <- c(0,1); S_x <- c(1,0). Then the probability of surviving to the start of the first month is $0$. So I would say the expected lifetime was zero, i.e. sum(S_x[2:2])/S_x[1]. – Henry Dec 10 '11 at 12:21
  • I was reading this: www.data-miners.com/resources/Will%20Survival.pdf page 3. and this support.sas.com/resources/papers/proceedings10/252-2010.pdf page 2. In both cases (first is discrete time, second is continuous) it seems that the calculation for the mean lifetime (truncated/restricted or not) started at summing or integrating at t=0, which should include 1, the survival at t=0. So, I am confused if I am predicting mean life (not residual) in the first case I gave as 7.7 or 6.7. – B_Miner Dec 10 '11 at 13:45
  • Henry - I will post this as a new question. – B_Miner Dec 10 '11 at 14:36
  • Henry, I am revisiting my inability to completely grasp discrete time survival analysis. I posted a fresh question about how to calculate S(t) here: http://stats.stackexchange.com/questions/60036/discrete-time-survival-analysis-correct-way-to-write-survival-function. Is it true given your answer "S(t) is the probability of surviving to time t or beyond" that you regard S(t) as P(T>=t)? Note, the equal sign. – B_Miner May 26 '13 at 14:28