Help Parsing Data to Columns

redhots4

Board Regular
Joined
Aug 30, 2004
Messages
136
Office Version
  1. 365
Platform
  1. MacOS
I need some guru help for a real challenge I'm facing. I have access to data from a website with 4500-5000 records. Each record has 11-14 rows of data containing demographic details about the people records. I want to manipulate this data into a normalized data file with columns for each data point. (Name, address, email, company name, etc). Not all records have the same number of data points, so creating lookups based on a consistent master field isn't possible. For example, not all records use "nickname", although I want to see the nickname when it's available. Sometimes the Company Name is missing, so in field where I expect to see Company, I see the address value.

I've copied two records into the sample file and shown my desired output. I appreciate any help I can get!

Sunshine Nurse Email Contact Listv2.xlsx
ABCDEFGHIJKLM
1Raw DataFull NameNicknameCust #EmailCompanyAddressCityStateZipOfficecellFax
2Chris SmithChris SmithChris51094email2@aol.com50 Edgewater DrCoral GablesFL33133305-555-7274305-555-7284305-555-0134
3Chris Smith
4"Chris"
5Cust #51094
6Eligible
750 Edgewater Dr
8Coral Gables, FL 33133-6988
9Office: 305-555-7274
10Cell: 305-555-7284
11Fax: 305-555-0134
12email2@aol.com
13Randy JonesRandy Jones91186rjones@ABCmanu.comABC Manufacturing Inc.500 S Jones Ave Ste 100Fort LauderdaleFL33301954-555-6032954-555-6032
14Randy Jones
15Cust #91186
16Eligible
17ABC Manufacturing Inc.
18500 S Jones Ave Ste 100
19Fort Lauderdale, FL 33301-4437
20Office: 954-555-6032
21Cell: 954-555-6032
22rjones@ABCmanu.com
Sheet2
 

Excel Facts

Back into an answer in Excel
Use Data, What-If Analysis, Goal Seek to find the correct input cell value to reach a desired result
Unless you can
a) include the field name in the data you're scraping (so it would be the first column), or
b) include all field regardless of whether or not they contain data (and always in the same order)
I'd say the input is too volatile for any non-manual solution. Some of your values contain keywords that could help (e.g. Cell) but not all of them do.

Probably the hardest part after that would be breaking up the addresses as you've shown as I expect they would not always have the same delimiters (cityCommaSpaceStateSpaceZip)
 
Upvote 0
Thanks for replying. I've already spent a lot of time writing formulas to split the data into columns as shown on the sheet here. What I can't figure out is how to get all of that data on one row with the Full Name in the left-hand column as the lookup. (This table contains sensitive data so I've mocked this up to show what it looks like. Therefore you're not seeing the actual formulas used to populate the various columns)

Sunshine Nurse Email Contact Listv2.xlsx
ABCDEFGHIJKLMNO
1Raw DataFull NameLast NameFirst NameNicknameCust #EmailCompanyAddressCityStateZipOfficecellFax
2Chris Smith
3Chris SmithChris SmithSmithChris
4"Chris"Chris
5Cust #5109451094
6Eligible
750 Edgewater Dr50 Edgewater Dr
8Coral Gables, FL 33133-6988Coral GablesFL33133
9Office: 305-555-7274305-555-7274
10Cell: 305-555-7284305-555-7284
11Fax: 305-555-0134305-555-0134
12email2@aol.comemail2@aol.com
13Randy Jones
14Randy JonesRandy JonesJoneRandy
15Cust #9118691186
16Eligible
17ABC Manufacturing Inc.ABC Manufacturing Inc.
18500 S Jones Ave Ste 100500 S Jones Ave Ste 100
19Fort Lauderdale, FL 33301-4437Fort LauderdaleFL33301
20Office: 954-555-6032954-555-6032
21Cell: 954-555-6032954-555-6032
22rjones@ABCmanu.comrjones@ABCmanu.com
Sheet2
 
Upvote 0
I'm not much of a formula guy; was thinking of identifying the components and putting them in the correct column by using code. Main problem is, I see no reliable way to determine which row belongs where. Any cell with "Cust" in it seems obvious; others not so much.
 
Upvote 0

Forum statistics

Threads
1,215,092
Messages
6,123,063
Members
449,090
Latest member
fragment

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top