My dataset consists of 1 million observations.
I am running a linear regression of Y on X (mean-centered and scaled variable), trying to demonstrate that there is a curvilinear relationship between the two variables of the form lm(y ~ x + I(x^2). This results in a significant coefficient on the squared term. As a sanity check, I started adding polynomials of other degrees I(x^3), I(x^4), and so on, to see if these weren't significant. To my chagrin, polynomials all the way up to I(x^30) were significant.
I assume this is partially because my dataset is so large, and thus detecting statistical significance is relatively easier due to high precision estimates. So my question is:
Why is this happening in my dataset? Is there a stricter way to test significance in big datasets? What is the correct way for detecting a curvilinear relationship in big datasets between y and x?