I have an amalgamation of independently collected datasets covering a large area of ocean where surveys to count birds have occurred sporadically over a long time. Some surveys were from boat, some were from air, and often there is some spatial overlap between these efforts.
I want to link the bird counts to environmental covariates, and use that relationship to make predictions about abundance in areas that have not been sampled based on their environmental conditions. To control for variability in counts due to non-environmental data, I also wanted to include additional information such as vessel type (plane vs boat) but, given how the environmental data have been collected--there is a grid for the entire study area, each grid cell has one row of environmental data--I am not sure that this is a statistically sound thing to do. Environmental variables are all static (e.g., distance to shore, depth to seafloor, etc) so they are not expected to change through time. I have also summarized the total count and effort for each surveyed grid cell such that it can be linked to the gridded environmental data. When I think about splitting these rows based on grid cell further by vessel type, I can't help but wonder if this is a form of pseudoreplication because the environmental covariates for a given grid cell would be identical.
I am familiar with examples of pseudoreplication, like "treating multiple leaves from the same plant as replicates; treating multiple plants from the same pot or flat as replicates" but I am not sure if what I have here would also be considered pseudoreplication -- would this be equivalent to treating multiple counts from the same location as replicates...?
Here is a hypothetical dataset to better illustrate my question:
- Database that excludes vessel type as a covariate:
| Grid ID | Count | Distance to shore (m) | Distance to seafloor (m) |
|---|---|---|---|
| 1 | 25 | 98.7 | 204.6 |
| 2 | 2 | 1049.2 | 1027.8 |
- Database that includes vessel type as a covariate:
| Grid ID | Count | Vessel | Distance to shore (m) | Distance to seafloor (m) |
|---|---|---|---|---|
| 1 | 20 | Boat | 98.7 | 204.6 |
| 1 | 5 | Plane | 98.7 | 204.6 |
| 2 | 1 | Boat | 1049.2 | 1027.8 |
| 2 | 1 | Plane | 1049.2 | 1027.8 |
Any advice would be helpful!
Vesselas a predictor variable. – EdM May 19 '22 at 18:38