I read the proof the of the law of large numbers where its states that the the sample mean converges in probability to the population mean and it its proven by Chebyshev's Inequality Here
I am curious if there is something similar in estimating the joint and conditional probability distributions. I found a couple of articles Here1 and Here2, and my question is is there a way to estimate these probabilities for discrete variables without having to go over the data and count the occurrences and divide by the total number of rows