File Sizes & Distribution

Data will be sent to investigators on thumb drives.

The data files will be encrypted using SecureDoc and password-protected. The data files will also be compressed using the GZIP compression utility. Programs will be made available to decrypt and unzip the files onto the user's PC in the directory that the user specifies. The PC must be equipped with Windows NT, Windows 95 or later. GUNZIP is necessary to unzip the files if using a UNIX or Linux machine.

The following table provides some examples from recent productions. The files are compressed before they are written to the thumb drives, however, the file sizes provided are for the files in their non-compressed format. The table shows the estimated size of files in gigabytes for one major cancer site (breast) and for four major cancer sites combined (breast, colorectal, lung and prostate). Also shown are estimates for the non-SEER cohort database which includes patients not in SEER file, answered "NO" to any cancer question and resided in SEER area at time of survey.

File 1 Major Cancer Site
Size (GB)
4 Major Cancer Sites
Size (GB)
5% Non-SEER Database
Size (GB)
PEDSF 0.24 0.71 --
SUMDENOM -- -- 1.04
CAHPS Survey 0.09 0.47 2.19
MEDPAR 02-15 0.15 0.49 2.05
HHA 02-15 0.63 1.68 5.69
Hospice 02-15 0.46 1.37 6.95
NCH
NCH02 0.52 1.60 7.80
NCH03 0.64 1.92 6.95
NCH04 0.71 2.14 7.80
NCH05 0.89 2.59 9.79
NCH06 0.93 2.64 9.94
NCH07 0.99 2.82 11.08
NCH08 0.99 2.85 11.35
NCH09 1.03 2.92 11.69
NCH10 1.04 2.92 11.81
NCH11 1.02 2.82 11.72
NCH12 1.03 2.80 12.05
NCH13 1.01 2.71 12.18
NCH14 0.95 2.47 12.40
NCH15 0.90 2.28 12.37
Outpatient
Outpatient02 0.40 1.15 3.96
Outpatient03 0.49 1.38 4.88
Outpatient04 0.55 1.55 5.57
Outpatient05 0.67 1.87 7.00
Outpatient06 0.68 1.86 6.97
Outpatient07 0.74 2.02 8.10
Outpatient08 0.76 2.10 8.68
Outpatient09 0.80 2.18 9.02
Outpatient10 0.82 2.24 9.32
Outpatient11 0.86 2.29 9.70
Outpatient12 0.85 2.29 9.85
Outpatient13 0.85 2.25 9.85
Outpatient14 0.78 2.04 10.18
Outpatient15 0.74 1.89 10.39
DME
DME 03-15 0.57 1.77 8.12
Part
Part D07 0.23 0.62 --
Part D08 0.24 0.66 --
Part D09 0.25 0.67 --
Part D10 0.25 0.66 --
Part D11 0.24 0.64 --
Part D12 0.24 0.63 --
Part D13 0.25 0.63 --
Part D14 0.24 0.58 --
Part D15 0.22 0.53 --
PTD denom07-15 0.03 0.10 --
Total 26.96 74.79 288.41
Last Updated: 03 Oct, 2019