I'm working on a project that has a large data set - too large for excel to handle. Would love to know if you could help.
I have this file here and it’s a list of hundreds of thousands of zip codes with specific personas (there are 67). Each column has a value to annotate how many households roll into each persona.
I'm trying to run some numbers to determine:
1. How many zip codes are in each persona - with the ability to drill into which zip codes those are.
2. How many households roll into each zip code (with the ability to drill into which zip codes those are.)
3. How many households roll into each persona (with the ability to drill into which zip codes those are.)
The 67 personas are titled "PZM" in the column headers. Each row of zip codes has a numerical value in the respective columns where they have a presence (it could be any value from >0, based on the volumes of households in each persona).
The household volumes are annotated by the numerical values in each column >0.
At the end of the day I’d like to be able to have some sort of view of the data to actually understand what it is showing. Obviously looking at thousands of rows of data is not efficient and I would need something that could repeat this process multiple times by reading the columns - sometimes would be PZM, other times could be CXN, etc)
I'm also working on this on my own, please let me know if you can assist before working on it!