Upload & QC
Data check is a module of the function to check whether the input data meet the requirements. This module mainly includes:
- Data conversion
All uploaded data will be converted to Plink binary format
- Calculate the number of loci and samples
The number of loci and that of the sample size in the data uploaded are calculated, and some simple summary statistics are provided.
- Basic data quality control
This module will:
a. Extract balletic SNPs;
b. Check chromosome codes (1-22,X[23), Y[24), XY[25), MT);
c. Transform reference genome to GRCh37;
d. Remove site duplication;
e. Update rsIDs.
Users can view the processed files in the data summary.