Suppose, I have Dataframe with median home prices and number of homes sold in a market. I'd like to calculate a confidence metric around median home price based on # of home sales / transaction.
import pandas as pd
Create a list of data
data = [
["Dallas", 1000, 250000],
["Austin", 2000, 300000],
["Texas", 3000, 350000],
]
Create a pandas DataFrame
df = pd.DataFrame(data, columns=["market", "number_of_homes_sold", "median_home_price"])
Print the DataFrame
print(df)
- What are some appropriate confidence scores / metrics to calculate based on
nor the sample size? - Would I calculate the confidence interval? If so, how?
I have the raw transactions-level data.