To prepare my data for uploading into my system I have to scrub it and do the following....
I have a table of 10,000 plus records. Each record has a name field and a business indicator field which is empty. Based on the name field, I have to determine if that record is a business record or a personal record. If I determine it is a business, I put a "B" in the business indicator field. To do that I have to look for key words, letters and or phrases in the name field. For example if the name field has INC, LLC, LTD, ASSN, Group, etc etc in it I can label it as a business record. My problem is I have 10,000 - 20,00 records and about 200 key words/letters/phrases to search for. If I filter each record for each key word it would take forever to identify the business records (which is the objective).
Is there a smarter, better and faster way to identify which records are business records?
Sample Table 1
Acct Date Amt Business Indicator Name Address City State
123456 03/10/14 500.25 John Deere Inc John's street Philadelphia PA
125698 02/12/14 235.69 John Doe 1 ABC Drive Trenton NJ
sample list of key words (THERE ARE ABOUT 200 OF THEM)
ABC CITY
ADP COMM
AUTO CTR
BANK CTY
BUR DCP
BUIL INC
Looking at my table I would put a "B" in the business indicator column because of the INC in the name field. How could I determine this for 10,000 - 20,000 records, capturing 200 key phrases?
thank you
I have a table of 10,000 plus records. Each record has a name field and a business indicator field which is empty. Based on the name field, I have to determine if that record is a business record or a personal record. If I determine it is a business, I put a "B" in the business indicator field. To do that I have to look for key words, letters and or phrases in the name field. For example if the name field has INC, LLC, LTD, ASSN, Group, etc etc in it I can label it as a business record. My problem is I have 10,000 - 20,00 records and about 200 key words/letters/phrases to search for. If I filter each record for each key word it would take forever to identify the business records (which is the objective).
Is there a smarter, better and faster way to identify which records are business records?
Sample Table 1
Acct Date Amt Business Indicator Name Address City State
123456 03/10/14 500.25 John Deere Inc John's street Philadelphia PA
125698 02/12/14 235.69 John Doe 1 ABC Drive Trenton NJ
sample list of key words (THERE ARE ABOUT 200 OF THEM)
ABC CITY
ADP COMM
AUTO CTR
BANK CTY
BUR DCP
BUIL INC
Looking at my table I would put a "B" in the business indicator column because of the INC in the name field. How could I determine this for 10,000 - 20,000 records, capturing 200 key phrases?
thank you