Comment: As @whuber has commented, this is not a situation
in which hypothesis testing is likely to be useful.
Consider the fictitious data sampled in R (from right-skewed gamma populations) and summarized
below:
set.seed(401)
x1 = rgamma(57000, 4, 1/400)
summary(x1); sd(x1)
Min. 1st Qu. Median Mean 3rd Qu. Max.
85.09 1016.90 1466.74 1598.92 2045.88 9062.14
[1] 799.5738
x2 = rgamma(57000, 4, 1/399)
summary(x2); sd(x2)
Min. 1st Qu. Median Mean 3rd Qu. Max.
53.02 1014.25 1463.89 1592.21 2032.31 7999.17
[1] 794.2221
For these particular data, the second sample has slightly smaller
max, min, quartiles, median, mean and SD. Without knowing the context of your experiment, it is not possible to say
whether these differences are of practical importance.
Also, your real data might
show other differences.
Boxplots show some of the slight differences noted above and happen to show a slightly different pattern of high outliers
between the two samples.
boxplot(x1, x2, horizontal=T, col="skyblue2")

The Wilcoxon rank sum test is equivalent to the Mann-Whitney U test, but uses a different test statistic.
As for your data, the test does not show a statistically significant
difference at the 5% level.
wilcox.test(x1, x2)
Wilcoxon rank sum test
with continuity correction
data: x1 and x2
W = 1.631e+09, p-value = 0.2448
alternative hypothesis:
true location shift is not equal to 0
With such large sample sizes, it makes better sense to
look at the data descriptions than at the results of the
test. The test looks only for a difference in location
and descriptive statistics may suggest other differences--
some of which might be of practical importance.