I have a quite large file (excel 120mb/csv 90mb) with data for different companies each company has a company code. I need to develop a tool that creates indexes for one company (defined by the user). So I am aware of methods to filter a dataframe (.query()) but cannot find a way to filter the data during the import process so that I will only load the required company data. The data looks like this:
EID CompCode V2 Age
1 1234 1 19
2 1234 3 34
1 2222 4 44
where CompCode is the code of the company, EID is the employee id (consecutive count), V2 is a question where 1-5 are the answers.
I can use excel or csv.
Anyone who can help me out here?