Given these values, is it possible to generate random values that conform to this distribution (using Python, but preferably without the SciPy package)?
| Statistic | Value |
|---|---|
| Mean | 1.518 |
| Std Dev | 24.827 |
| Skew | 140.770 |
| Kurtosis | 25342.612 |
| Mode | 1 |
| Min | 1 |
| Max | 5735 |
| Count | 182557 |
Purpose: I'm trying to generate dummy / fake / synthetic data based on production data.
I've tried finding several solutions, but most of them seem to use SciPy.
Reason for avoiding SciPy: I don't want to include ~200 MB of dependency just to use a single function.
EDIT
Old title: How to generate random values based on mean, standard deviation, skew and kurtosis in Python without SciPy?
I gave up avoiding installation of yet another package. I've included SciPy in our dependencies, but I'm still stuck with finding a good solution for this.
The solutions I've tried either generate too many zeros or all ones. It's confusing to me because I don't have a statistical background, and I know that it's against SE's rules to simply ask for working code as solution, I'm asking for some tips or nudges in the right direction.
I've tried a few functions (skewnorm, pearson3, etc.) without understanding what they're doing and merely checking if the output looks as I expect. Of course, I couldn't find a satisfactory solution.
README/documentation files without impeding our software's functionality. Finally, do you build from source or use a precompiled wheel, the latter is often bigger. – usεr11852 Jul 29 '23 at 22:21numpyfunctions. – usεr11852 Aug 02 '23 at 22:19