Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Colander: program to de-identify dates

By setting all occurrences of a specific event  (first hospitalization start for example), to a common date and time.

Eg. if you have a file with two patients, patients A and B which looks like the following

Patient

Date

Event

A

2011/1011/12

In

B

2011/1011/11

In

A

2011/14/12

Out

B

2011/10/12

Out

Patient

...

...

Patient

Date

Event

A

2000/01/01

In

B

2000/01/01

In

A

2000/04/01

Out

B

2000/01/02

Out

Author:

endrebak

Config file format (input_file_name|output_file_name (!= input file name)|columns containing dates|column containing pid)

New Format? #input_file_name|(!=)output_file_name|all columns containing dates|one column containing the unique ID

CSV/sykehus_avd.csv|CSV/sykehus_avd_anon.csv|6,5,7|-13

...

Files:

Colanderapp.py

config.txt

Notes:
not tested on real data