Difference between revisions of "HTAC Database - Data Download Guide"

From Pheno Wiki
Jump to: navigation, search
Line 5: Line 5:
 
* For Step 4, you have 3 option:
 
* For Step 4, you have 3 option:
 
:* '''"Master List (N = 1254)":''' This is most likely the option that all users will chose. This includes all subjects with Status = 2 (Complete). <br/>
 
:* '''"Master List (N = 1254)":''' This is most likely the option that all users will chose. This includes all subjects with Status = 2 (Complete). <br/>
:* '''"Population Stratified Set":''' This includes all subjects with Status = 2 (Complete), plus 62 additional subjects with Status = 0 and Genetic Recovery Case = 1. This larger dataset will be used for primary genetic analyses only. We have included the additional Genetic Recovery Cases in an attempt to increase our total sample size as much as possible, but they don't necessarily meet inclusion criteria for the Master List. <br/>
+
:* '''"Population Stratified Set":''' This includes all subjects with Status = 2 (Complete), plus 62 additional subjects with Status = 0 and Genetic Recovery Case = 1. This larger dataset (N = 1316) will be used for primary genetic analyses only. We have included the additional Genetic Recovery Cases in an attempt to increase our total sample size as much as possible, but they don't necessarily meet inclusion criteria for the Master List. <br/>
 
:* '''"Inactive/Active/Complete (N = 1839)":''' This includes all subjects recorded in the study. This should only be downloaded for QC purposes. This dataset should not be downloaded and used for analyses. <br/>
 
:* '''"Inactive/Active/Complete (N = 1839)":''' This includes all subjects recorded in the study. This should only be downloaded for QC purposes. This dataset should not be downloaded and used for analyses. <br/>
 +
[[File:Example.jpg]]

Revision as of 16:57, 6 March 2013

This guide assumes that you are familiar with the CNP dataset, names of the data subsets (e.g., LA5C), and have access to the HTAC Database.

In the HTAC Customized Data Export section, you can request data organized by Subject Type (Step 3) or Subject Status (Step 4).

  • For Step 3, you can chose to download only a certain set of patients, for example; if you want to download the entire dataset, select "ALL SUBJECTS".
  • For Step 4, you have 3 option:
  • "Master List (N = 1254)": This is most likely the option that all users will chose. This includes all subjects with Status = 2 (Complete).
  • "Population Stratified Set": This includes all subjects with Status = 2 (Complete), plus 62 additional subjects with Status = 0 and Genetic Recovery Case = 1. This larger dataset (N = 1316) will be used for primary genetic analyses only. We have included the additional Genetic Recovery Cases in an attempt to increase our total sample size as much as possible, but they don't necessarily meet inclusion criteria for the Master List.
  • "Inactive/Active/Complete (N = 1839)": This includes all subjects recorded in the study. This should only be downloaded for QC purposes. This dataset should not be downloaded and used for analyses.

Example.jpg