Loading required namespace: GenomicFiles
Using local VCF.
File already tabix-indexed.
Finding empty VCF columns based on first 10,000 rows.
1 sample detected: EBI-a-GCST001790
Constructing ScanVcfParam object.
VCF contains: 5,057,528 variant(s) x 1 sample(s)
Reading VCF file: multi-threaded (4 threads)
Renaming ID as SNP.
VCF file has -log10 P-values; these will be converted to unadjusted p-values in the 'P' column.
No INFO (SI) column detected.
Standardising column headers.
First line of summary statistics file: 
SNP	chr	BP	end	REF	ALT	FILTER	ReverseComplementedAlleles	ES	LP	SE	P	
Summary statistics report:
   - 2,522,276 rows
   - 2,522,271 unique variants
   - 284 genome-wide significant variants (P<5e-8)
   - 22 chromosomes
Checking for multi-GWAS.
Checking for multiple RSIDs on one row.
Inferring genome build.
Loading SNPlocs data.
Loading reference genome data.
Preprocessing RSIDs.
Validating RSIDs of 10,000 SNPs using BSgenome::snpsById...
Loading required package: BiocGenerics

Attaching package: ‘BiocGenerics’

The following objects are masked from ‘package:stats’:

    IQR, mad, sd, var, xtabs

The following objects are masked from ‘package:base’:

    anyDuplicated, aperm, append, as.data.frame, basename, cbind,
    colnames, dirname, do.call, duplicated, eval, evalq, Filter, Find,
    get, grep, grepl, intersect, is.unsorted, lapply, Map, mapply,
    match, mget, order, paste, pmax, pmax.int, pmin, pmin.int,
    Position, rank, rbind, Reduce, rownames, sapply, setdiff, sort,
    table, tapply, union, unique, unsplit, which.max, which.min

Loading required package: S4Vectors
Loading required package: stats4

Attaching package: ‘S4Vectors’

The following objects are masked from ‘package:base’:

    expand.grid, I, unname

BSgenome::snpsById done in 33 seconds.
Loading SNPlocs data.
Loading reference genome data.
Preprocessing RSIDs.
Validating RSIDs of 10,000 SNPs using BSgenome::snpsById...
BSgenome::snpsById done in 42 seconds.
Inferred genome build: GRCH37
Checking SNP RSIDs.
8 SNP IDs are not correctly formatted. These will be corrected from the reference genome.
Loading SNPlocs data.
Sorting coordinates with 'data.table'.
Writing in tabular format ==> /rds/general/project/neurogenomics-lab/ephemeral/MAGMA_Files_Public/data/GWAS_munged/ebi-a-GCST001790/logs/snp_not_found_from_chr_bp.tsv
Writing uncompressed instead of gzipped to enable tabix indexing.
Converting full summary stats file to tabix format for fast querying...
Reading header.
Ensuring file is bgzipped.
Tabix-indexing file.
Removing temporary .tsv file.
Checking for merged allele column.
Checking A1 is uppercase
Checking A2 is uppercase
Checking for incorrect base-pair positions
Ensuring all SNPs are on the reference genome.
Loading SNPlocs data.
Loading reference genome data.
Preprocessing RSIDs.
Validating RSIDs of 2,522,263 SNPs using BSgenome::snpsById...
BSgenome::snpsById done in 30 seconds.
9,572 SNPs are not on the reference genome. These will be corrected from the reference genome.
Loading SNPlocs data.
Sorting coordinates with 'data.table'.
Writing in tabular format ==> /rds/general/project/neurogenomics-lab/ephemeral/MAGMA_Files_Public/data/GWAS_munged/ebi-a-GCST001790/logs/snp_not_found_from_chr_bp_2.tsv
Writing uncompressed instead of gzipped to enable tabix indexing.
Converting full summary stats file to tabix format for fast querying...
Reading header.
Ensuring file is bgzipped.
Tabix-indexing file.
Removing temporary .tsv file.
Loading SNPlocs data.
Loading reference genome data.
Preprocessing RSIDs.
Validating RSIDs of 2,512,882 SNPs using BSgenome::snpsById...
BSgenome::snpsById done in 29 seconds.
Checking for correct direction of A1 (reference) and A2 (alternative allele).
There are 44 SNPs where neither A1 nor A2 match the reference genome. These will be removed.
Sorting coordinates with 'data.table'.
Writing in tabular format ==> /rds/general/project/neurogenomics-lab/ephemeral/MAGMA_Files_Public/data/GWAS_munged/ebi-a-GCST001790/logs/alleles_dont_match_ref_gen.tsv
Writing uncompressed instead of gzipped to enable tabix indexing.
Converting full summary stats file to tabix format for fast querying...
Reading header.
Ensuring file is bgzipped.
Tabix-indexing file.
Removing temporary .tsv file.
There are 22 SNPs where A1 doesn't match the reference genome.
These will be flipped with their effect columns.
Reordering so first three column headers are SNP, CHR and BP in this order.
Reordering so the fourth and fifth columns are A1 and A2.
Checking for missing data.
Checking for duplicate columns.
Checking for duplicate SNPs from SNP ID.
57 RSIDs are duplicated in the sumstats file. These duplicates will be removed
Sorting coordinates with 'data.table'.
Writing in tabular format ==> /rds/general/project/neurogenomics-lab/ephemeral/MAGMA_Files_Public/data/GWAS_munged/ebi-a-GCST001790/logs/dup_snp_id.tsv
Writing uncompressed instead of gzipped to enable tabix indexing.
Converting full summary stats file to tabix format for fast querying...
Reading header.
Ensuring file is bgzipped.
Tabix-indexing file.
Removing temporary .tsv file.
Checking for SNPs with duplicated base-pair positions.
27 base-pair positions are duplicated in the sumstats file. These duplicates will be removed.
Sorting coordinates with 'data.table'.
Writing in tabular format ==> /rds/general/project/neurogenomics-lab/ephemeral/MAGMA_Files_Public/data/GWAS_munged/ebi-a-GCST001790/logs/dup_base_pair_position.tsv
Writing uncompressed instead of gzipped to enable tabix indexing.
Converting full summary stats file to tabix format for fast querying...
Reading header.
Ensuring file is bgzipped.
Tabix-indexing file.
Removing temporary .tsv file.
INFO column not available. Skipping INFO score filtering step.
Filtering SNPs, ensuring SE>0.
Ensuring all SNPs have N<5 std dev above mean.
Checking for bi-allelic SNPs.
65,972 SNPs are non-biallelic. These will be removed.
Sorting coordinates with 'data.table'.
Writing in tabular format ==> /rds/general/project/neurogenomics-lab/ephemeral/MAGMA_Files_Public/data/GWAS_munged/ebi-a-GCST001790/logs/snp_bi_allelic.tsv
Writing uncompressed instead of gzipped to enable tabix indexing.
Converting full summary stats file to tabix format for fast querying...
Reading header.
Ensuring file is bgzipped.
Tabix-indexing file.
Removing temporary .tsv file.
Computing Z-score from P using formula: `sign(BETA)*sqrt(stats::qchisq(P,1,lower=FALSE)`
Warning: When method is an integer, must be >0.
Sorting coordinates with 'data.table'.
Sorting coordinates with 'data.table'.
Writing in tabular format ==> /rds/general/project/neurogenomics-lab/ephemeral/MAGMA_Files_Public/data/GWAS_munged/ebi-a-GCST001790/ebi-a-GCST001790.tsv
Writing uncompressed instead of gzipped to enable tabix indexing.
Converting full summary stats file to tabix format for fast querying...
Reading header.
Ensuring file is bgzipped.
Tabix-indexing file.
Removing temporary .tsv file.
Summary statistics report:
   - 2,446,839 rows (97% of original 2,522,276 rows)
   - 2,446,839 unique variants
   - 268 genome-wide significant variants (P<5e-8)
   - 22 chromosomes
Done munging in 7.884 minutes.
Successfully finished preparing sumstats file, preview:
Reading header.
          SNP CHR     BP A1 A2    END FILTER REVERSECOMPLEMENTEDALLELES   BETA
1: rs12565286   1 721290  G  C 721290   PASS                      FALSE  0.054
2: rs11804171   1 723819  T  A 723819   PASS                      FALSE  0.079
3:  rs2977670   1 723891  G  C 723891   PASS                      FALSE -0.082
4: rs12138618   1 750235  G  A 750235   PASS                      FALSE -0.054
5:  rs3094315   1 752566  G  A 752566   PASS                      FALSE  0.029
         LP    SE         P          Z
1: 0.190284 0.120 0.6452322  0.4603958
2: 0.325876 0.110 0.4721978  0.7189076
3: 0.342172 0.110 0.4548079 -0.7474236
4: 0.123118 0.170 0.7531509 -0.3144874
5: 0.233021 0.053 0.5847618  0.5464425