How to organize large data set for statistical analysis?

Brian

Board Regular
Joined
Apr 24, 2002
Messages
113
How should I organize a large data set for statistical analysis? Any suggestions? Examples? Sample control chart xls files?

Data:
- Each part undergoes 3 inspections:
visual (6 data points per part)
test1 (40 data points per part)
test2 (40 data points per part) {ex measure paint thickness in 40 locations on each part)
- one test lot of 25 parts is inspected per day.
- Therefore 25 x(6 + 40 + 40) = 2150 data points/day

Analysis:
- Individual part: plot graphs of tests 1 &2 to show variation over the surface.
- Daily Lot of 25: plot mean and std dev for the 25 parts,
- Weekly: plot control chart and other summaries. Some chart plot avg per part, others avg per lot of 25 parts.
- Monthly: plot charts with each week's test data condensed to a single point on a chart.

Current (inherited) system:
- daily data file contains all data, calculations and charts for the lot of 25.
- macro copies the summary data that was calculated in the daily file (mean, std dev, etc.) to the weekly file
- weekly file: plots summary data (contains many weeks of daily summary data)
- monthly file: manually paste one week of data onto a sheet that calculates the weeks averages, then copy that result to a new table.

My concerns:
- since the daily file pushes the results of its calculations up to the weekly file, If I fins a mistake in the daly file calc, then I need to edit 6 months or more of daily files to correct it. (already found one mistake in the monthly calc, which would be ugly to recalc with the current manual copy/paste intensive method)
- Since the weekly file does not contain the original data, (1) to drill down into the data, I need to find and open the daily files, and (2)to add any new calcs, I need to add equations to every individual file and push the data back up again.

Question:
- How should the data be organized? I am leaning towards:
1) keeping only raw data in the daily file (labelled with the date), then have a "daily analysis file" where you select the desired date, and it either uses a macro to copy the data into the file, or use indirect statements to link to the file.
2) For the weekly file, store all data, then us a pivot table to perform the calcs (but then the file will get huge) or, use a macro to import a week of data at a time, perform the stat calcs, save them and then erase the original data, or lots of indirect equations to link files (but then long recalc time)

Any suggestions? Examples? Sample control chart xls files?

Thanks,

Brian
 

Excel Facts

Excel Can Read to You
Customize Quick Access Toolbar. From All Commands, add Speak Cells or Speak Cells on Enter to QAT. Select cells. Press Speak Cells.
Without really looking at your app, I can only guess as what would be best. I think, however, I would consider making a separate table for each test. I assume each part inspected has some unique ID. So your fields might be, Part ID, Date, Data Point1, Date Point2, etc. You can then use MS Query to link your tables and retrieve data by date. See Tushar's website for the tutorial RDBMS in Excel

http://tushar-mehta.com/index.html?http://tushar-mehta.com/excel/
 
Upvote 0

Forum statistics

Threads
1,214,584
Messages
6,120,385
Members
448,956
Latest member
JPav

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top