ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
2
The questions below are designed to introduce you to the type of work and thinking that is required of ETL developers. You will find from these sample questions that this that the work requires paying attention and having a good attention to detail. It's also a great introduction to common data structures and formats that are used in many other jobs and contexts. Hopefully you find these exercises fun and rewarding!
3
4
5
Question #1
From Data Set 1 to Data Set 2 (below) notice how the Patient_ID has changed to the MRN (Medical Record Number).
6
What do you think has been added to the Patient_ID to form a MRN?
7
8
9
10
Data Set 1
11
Patient_IDPatient_LNPatient_MNPatient_FNDOBSSNStreet_AddressCityState
Zip_Code
PhonePhone Type
12
40923JohnsLynnJanice1/25/1967182786543782 Hancock StreetClintonNJ18809
908-553-9012
Cell
13
34058SmithNicholasWarren7/31/197620987543682-10 Queens Blvd., Apt 2GNew YorkNY11120
917-785-8766
Home
14
78643AlexanderShaunPeter9/30/198010670542356 Humboldt St., Apt. 1CBrooklynNY11211
212-396-6700
Work
15
98723ParsonsMichelleJuanita2/25/199215090876572 Orion CourtHempsteadNY11550
212-598-9866
Work
16
10298ButtsRachelDionne1/27/195911520974289 JFK Bldv., Apt 34CJersey CityNJ07399
908-553-9123
Cell
17
67402KellerNicoleSusan3/18/19644301983609 Jones Ave.WycoffNJ07481
908-781-7651
Cell
18
87205FoxThomasJohn8/3/2005309674365777 Orient Way, Unit 7DRutherfordNJ07070
201-735-7946
Home
19
12944JimenezLionelAllejandro10/10/2003192786734321 East 125th St., #8DNew YorkNY10020
646-921-0090
Work
20
55309O'KelleyPaulMark7/11/1997981036455
89 Grand Parkway South., Apt. 67
Staten IslandNY10301
646-324-6548
Cell
21
21307TooleMyrnaJanette4/19/1942784012376239 Millers CircleJamaicaNY11405
929-407-4002
Cell
22
23
Data Set 2
24
MRNLast_Name
Middle_Initial
First_NameDOBSSNStreet_Address_1
Street_Address_2
CityStateZip+4Home PhoneWork PhoneCell Phone
25
J-40923JOHNSLJANICE25-Jan-67182-78-6543782 Hancock StreetClintonNJ18809-00009085539012
26
S-34058SMITHNWARREN31-Jul-76209-87-543682-10 Queens BoulevardApt. 2GNew YorkNY11120-00009177858766
27
A-78643ALEXANDERSPETER30-Sep-80106-70-542356 Humboldt StreetApt. 1CBrooklynNY11211-00002123966700
28
P-98723PARSONSMJUANITA25-Feb-92150-90-876572 Orion CourtHempsteadNY11550-00002125989866
29
B-10298BUTTSRDIONNE27-Jan-59115-20-974289 JFK BoulevardApt. 34CJersey CityNJ07399-00009085539123
30
K-67402KELLERNSUSAN18-Mar-64430-19-83609 Jones AvenueWycoffNJ
07481-0000
9087817651
31
F-87205FOXTJOHN03-Aug-05309-67-4365777 Orient WayUnit 7DRutherfordNJ
07070-0000
2017357946
32
J-12944JIMINEZL
ALLEJANDRO
10-Oct-03192-78-6734321 East 125th Street#8DNew YorkNY10020-00006469210090
33
O-55309OKELLEYPMARK11-Jul-97981-03-645589 Grand Parkway SouthApt. 67Staten IslandNY10301-00009294074002
34
T-21307TOOLEMJANETTE19-Apr-42784-01-2376239 Millers CircleJamaicaNY11405-0000
35
36
Question #2.
Why do you think the format has been changed in the target system (represented by Data Set 2)?
37
A.
Having more characters allows for a greater number of unique values.
38
B.
Using an alpha-numeric format allows for more combinations for Medical Record numbers.
39
C.
The target system used an easy convention so it's easy to derive the new Medical Record number.
40
D.
All of the above.
41
42
Question #3
From Data Set 1 to Data Set 2 (below) notice how the patient's name has been changed.
43
Based on the sample for patient Janice Lynn Johns in Data Set 2, update the name for Warren Nicolas Smith
44
in the space highlighted in yellow.
45
46
Data Set 1
47
Patient_IDPatient_LNPatient_MNPatient_FN
48
40923JohnsLynnJanice
49
34058SmithNicholasWarren
50
51
Data Set 2
52
MRNLast_Name
Middle_Initial
First_Name
53
J-40923JOHNSLJANICE
54
34058
55
56
Question #4
Now try it for Peter Shaun Alexander.
57
58
Data Set 1
59
Patient_IDPatient_LNPatient_MNPatient_FN
60
40923JohnsLynnJanice
61
34058SmithNicholasWarren
62
78643AlexanderShaunPeter
63
64
Data Set 2
65
MRNLast_Name
Middle_Initial
First_Name
66
J-40923JOHNSLJANICE
67
S-34058SMITHNWARREN
68
78643
69
70
Question #5.
Now try it for the next patient record.
71
72
Data Set 1
73
MRNLast_Name
Middle_Initial
First_Name
74
J-40923JOHNSLJANICE
75
S-34058SMITHNWARREN
76
A-78643ALEXANDERSPETER
77
P-98723PARSONSMJUANITA
78
B-10298BUTTSRDIONNE
79
K-67402KELLERNSUSAN
80
F-87205FOXTJOHN
81
82
Data Set 2
83
MRNLast_Name
Middle_Initial
First_Name
84
J-40923JOHNSLJANICE
85
S-34058SMITHNWARREN
86
K-67402
87
88
89
Question #6
From Data Set 1 to Data Set 2 (below) notice how the DOB has been changed.
90
Based on the sample for patient Juanita Michelle Parsons, update the DOB Dionne Rachel Butts
91
in the space highlighted in yellow.
92
93
Data Set 1
94
Patient_IDPatient_LNPatient_MNPatient_FNDOB
95
98723ParsonsMichelleJuanita2/25/1992
96
10298ButtsRachelDionne1/27/1959
97
98
Data Set 2
99
MRNLast_Name
Middle_Initial
First_NameDOB
100
P-98723PARSONSMJUANITA25-Feb-92