I am estimating the association between an exposure ("time in therapeutic range", possible values from 0% to 100%)) and time to a binary outcome ("chronic kidney disease"). I would like to assess potential non-linear relationships between the exposure and outcome.
The exposure is not normally distributed with many participants having a value of 0% or 100%. Is it still valid to model non-linear associations for this type of variable distribution using restricted cubic splines? I feel like the 0% and 100% participants are exerting outsized influence on the results. Additionally, the 0% and 100% categories don't provide the most clinically relevant information because it is obvious that 0% is bad and 100% is good.
Here are the splines with and without the 0%/100% patients. 4 knots minimized the AIC if 0%/100% participants were included. The relationship between the exposure (time in therapeutic range) and outcome was not non-linear after excluding the 0%/100% participants.
Your help is much appreciated.

