I am currently collecting data for a project at University. My aim is essentially to investigate whether actively managed equity-based mutual funds outperform passive mutual funds. I am using the CRSP Survivor-Bias-Free US Mutual Fund Database as the source for my data. However, I am having trouble with sorting the data. I have no experience with MatLab, R or similar coding languages. Therefore, my only solution seems to be sorting the data using the "Conditional statement" feature in wrds and using Excel.
Until now, I have managed to discard all funds that are not equity-based or mixed equity and fixed income. Now I have to exclude all funds that have a front-or rear-load as I only want data on no-load funds. I have tried sorting out load-funds by creating the following conditional statement in my wrds query: "WHERE front_load = 0 AND dollar_amt = -99". This should only include funds with a maximum load of 0, because a missing (-99) "dollar_amt" means that the given front-load is the maximum load for the fund.
The problem I come across is that when comparing the resulting dataset with a dataset that shows all funds and their loads, I notice that some of the funds in my no-load dataset show up with loads in the other dataset. This is because no-load dataset does not take into account that a fund might have been a no-load fund in a given time period, but later has become a fund with load. Therefore, I somehow have to exclude funds that have ever had a fund load.
I am hoping that someone is familiar with the wrds database and the options regarding conditional statements. If not, I understand that it is hard to provide me with a solution.
Thank you.
The database I use: https://wrds-www.wharton.upenn.edu/pages/get-data/center-research-security-prices-crsp/quarterly-update/mutual-funds/