Apologies if the question is too trivial but what exactly sets these two apart?
Let's say that I have a set of data for a hundred points (the independent variable may not be uniformly spaced) as:
{{1, 7}, {2, 8},...,{100, 5}}
Now, I can apply any of the extrapolation techniques (Newton's, Lagrange's or even Curve Fitting for that matter) and get a y = f(x). Now if I put in any x, in or out from my original data set, I can get the corresponding y. This way I predicted a y value which wasn't originally in my data set.
How is Prediction different from this?