The other answers already provide a good enough explanation, but I wanted to provide a worked example to show the differences in case somebody discovers this question down the road. I have fit a model using the biopsy data in the MASS package in R. The only adaptation to this data that I made was converting the character values from the outcome variable to a binary 0/1 value.
#### Load Libraries ####
library(mgcv)
library(dplyr)
Save Data From MASS Package
biopsy <- MASS::biopsy %>%
as_tibble() %>%
mutate(class = ifelse(class=="benign",0,1))
biopsy
I then fit the model with 6 predictors using cubic regression splines and REML fitting with a logistic regression model.
#### Fit Model ####
fit <- gam(
class
~ s(V1, bs = "cr")
+ s(V2, bs = "cr")
+ s(V3, bs = "cr")
+ s(V4, bs = "cr")
+ s(V5, bs = "cr")
+ s(V6, bs = "cr"),
data = biopsy,
method = "REML",
family = binomial
)
Finally, I plot the model with predicted probability. The first part of the plot function uses the model fit, the second part transforms the plotted values into predicted probability, the third part puts it all on one page, and the last argument includes standard error around the mean.
#### Plot Model by Predicted Probability ####
plot(fit,
trans=plogis,
pages=1,
seWithMean = T)
Your plot should look like this:

Here you can see the values are now bounded between 0 and 1, with each decimal value directly interpretable as a percentage in the "class" outcome. As an example, the plot here shows that a zero value on V1 (tumor clump thickness) has a predicted probability of malign tumor growth of around 20%, whereas chances of having a malign tumor increase to near 100% at the maximum value of V1.
However, these are the probabilities when centered to an average value of .5. Including the intercept allows one to look at the probabilities when all other predictors are set to their average value. This can be achieved with the following code:
#### Include Intercept ####
plot(fit,
trans=plogis,
pages=1,
seWithMean = T,
shift = coef(fit)[1])
Giving you this plot:

Contrarily, simply removing the trans=plogis argument plots by log odds, which is likely what you saw when you plotted the model.
#### Removing Trans=Plogis Argument ####
plot(fit,
pages=1,
seWithMean = T)
Which looks like this:

Hopefully the difference between both plotting methods is more clear now.