An example of my data is shown below. There are 30 columns for diagnosis (diagnoses 1, diagnosis 2, and so on). Each diagnoses number has several disease conditions. I would like to create a binary variable for every disease condition. This is a large dataset and I would like to know of an efficient way to do this task. Thank you, Al Bothwell
Diagnosis 1 |
Diagnosis 2 |
Hypertension |
Delusional Disorder |
Chronic Airway Obst |
Chronic Airway Obst |
Delusional Disorder |
Hypertension |
Osteoporosis Nos |
Osteoporosis Nos |
Leukocytosis Nos |
Atrial Fibrillation |
Atrial Fibrillation |
Joint Pain-Shlder |
Joint Pain-Shlder |
Leukocytosis Nos |