I've been working with an industry-related dataset where I need to analyze the correlation between a specific output (y) and several inputs (x1, x2, x3, etc.). During my search, I came across various correlation analysis methods such as Pearson, Kendall, Spearman, and Mutual Information. However, I'm facing a challenge: different methods yield completely different correlation rankings. In some cases, the variations are quite significant. For instance, one input might be the most correlated with y according to the Mutual Information method, but it ranks as one of the least correlated in other methods.
This inconsistency poses a dilemma for my analysis, which is crucial for an industry-related dataset. They need a clear understanding of how inputs correlate with y. I'm considering combining results from all methods but I'm worried that some might not be well-suited to my dataset, leading to inaccurate conclusions.
Could you advise on the most suitable correlation method for a general analysis? Can Mutual Information be considered the best method as it quantifies the "amount of information" obtained about one random variable by observing the other random variable? Are there any methods I should avoid because they are designed just for specific situations?
Also, I would appreciate any tips on how to effectively conclude and integrate the findings from different methods.