I'm refreshing my (basic) knowledge about maximum likelihood and stumbled over
which summarizes the concept quite well, I'd say. I only wonder when $ argmax L(\theta) $ is maximized, how it is really done. Looking at the example above, wouldn't I need something that measures and compares the residuals like in least squares? What exactly is $ \theta $ (representing) ?
