Basically, what I am creating is a access data cleaning template which will automatically sort out information into a single standardized form for all of the excel files that I drop into the cleaning template.
I will have 200 different excel files with similar column headings, but not exactly the same (For example, "Disease Type" column in one excel sheet would correspond to "Disease" column in another excel sheet.
Some sheets will have more than usual number of columns in which case I will have to remove unnecessary column headings.
How do you go about it? I am thinking about developing macro that compares two data sheet (one standardized, one raw sheet) and take only the column headings I need and save that information on a different sheet. I would appreciate any help on this project.
Here is similar question by someone else from this forum
I am working with 2 separate files. Let's refer to them as (File 1) and (Template 1).
After opening (File 1), I need to compare the number of columns with headings to the number of columns with headings in
(Template 1). If the number of columns in (File 1) exceeds the number in (Template 1) than I must now examine each of the column headings in (File 1) to determine which of these does not exist in (Template 1). I will delete the entire column from (File 1) after determining which it is.
Also, the data has to be in a separate file (shouldn't be consolidated) since they are from many different companies. We would like to standardize and remove the unnecessary columns first and work on sorting the data within the column into a few different categories that we need to analyze the data.
One suggestion has been made which was to manually standardize the column headings for all 200 files since trying to program all of the variable possibilities does not seem practicable. Once it's done, I can use code to "Find" those headings I need and copy the columns to a new file. , then your first object should be to standardize the column headings for data which you want to capture. The most likely method to do that would be manually, since trying to program all of the variable possibilities does not seem practicable. Once you get the headers standardized, you could then use code to 'Find' those headers and copy the columns to a new excel file.
Any other ideas? Final work will be ultimately moved to SQL database.
Thanks
I will have 200 different excel files with similar column headings, but not exactly the same (For example, "Disease Type" column in one excel sheet would correspond to "Disease" column in another excel sheet.
Some sheets will have more than usual number of columns in which case I will have to remove unnecessary column headings.
How do you go about it? I am thinking about developing macro that compares two data sheet (one standardized, one raw sheet) and take only the column headings I need and save that information on a different sheet. I would appreciate any help on this project.
Here is similar question by someone else from this forum
I am working with 2 separate files. Let's refer to them as (File 1) and (Template 1).
After opening (File 1), I need to compare the number of columns with headings to the number of columns with headings in
(Template 1). If the number of columns in (File 1) exceeds the number in (Template 1) than I must now examine each of the column headings in (File 1) to determine which of these does not exist in (Template 1). I will delete the entire column from (File 1) after determining which it is.
Also, the data has to be in a separate file (shouldn't be consolidated) since they are from many different companies. We would like to standardize and remove the unnecessary columns first and work on sorting the data within the column into a few different categories that we need to analyze the data.
One suggestion has been made which was to manually standardize the column headings for all 200 files since trying to program all of the variable possibilities does not seem practicable. Once it's done, I can use code to "Find" those headings I need and copy the columns to a new file. , then your first object should be to standardize the column headings for data which you want to capture. The most likely method to do that would be manually, since trying to program all of the variable possibilities does not seem practicable. Once you get the headers standardized, you could then use code to 'Find' those headers and copy the columns to a new excel file.
Any other ideas? Final work will be ultimately moved to SQL database.
Thanks