Math behind ensemble learning

Asked Dec 03 '22 at 09:21

Active Dec 03 '22 at 09:33

Viewed 45 times

I'm struggling to find some clear math behind ensemble learning.

I can simulate it very easily, eg:

import numpy as np, scipy.stats
r = np.random.random(1000)
d = np.array([0]*1000)
cors = []
for i in range(100):
  v = np.random.random(1000)
  c = scipy.stats.pearsonr(v,r).statistic
  cors.append(c)
  d = d + v * c # <- ensembling
print(np.max(cors))
print(scipy.stats.pearsonr(d,r).statistic)
0.07027996608008028
0.2646662315626593

It seems like a very simple concept, and yet I can't find any clear mathematical description as to why it works.

edited Dec 03 '22 at 09:33

asked Dec 03 '22 at 09:21

Blaze

1

There’s an extended discussion in Elements of Statistical Learning. – Sycorax Dec 03 '22 at 16:59
Wow, great book. Thanks – Blaze Dec 03 '22 at 17:01

Math behind ensemble learning

0 Answers0