Unlock Biobank Collaboration Without Sharing Data

Generate Naive Summary Statistics (NSS) once, enable unlimited GWAS model exploration. Privacy-preserving meta-analysis for the biobank era.

280K+
Individuals Validated
10.5M
SNPs Analyzed
99.9%
Accuracy vs PLINK

Traditional Meta-Analysis is Broken

Current GWAMA workflows require repeated data requests, limiting model exploration and burdening biobanks with computational overhead.

Traditional GWAMA

  • Request summary stats for each model change
  • Biobanks bear computational burden
  • Only explore 1 of (k+1)2k possible models
  • Weeks of coordination overhead
Traditional GWAMA workflow

nssgen + dWAMA

  • Generate NSS once, explore all models
  • Computation shifts to researchers locally
  • Access all (k+1)2k covariate combinations
  • Minutes instead of weeks
nssgen dWAMA workflow

Built for Modern Genomics

Everything you need to accelerate collaborative genetic research while maintaining the highest privacy standards.

Privacy by Design

Individual-level genetic data never leaves the biobank. NSS contains only variance-covariance matrices—mathematically impossible to reconstruct individual genotypes.

Model Flexibility

Explore all (k+1)2^k possible GWAS models locally. Toggle covariates on/off, swap phenotypes, and fine-tune without re-contacting biobanks.

Computational Efficiency

O(Cmk²) time complexity vs O(Cnmk² + Cmk³) for traditional GWAMA. Analyze 64 models in 2 hours instead of weeks of coordination.

PRS Fine-Tuning

Systematically evaluate covariate effects on polygenic risk scores. Our UK Biobank validation showed R² improvements from 0.02 to 0.05 through model tuning.

Multi-Biobank Ready

Seamlessly combine NSS from multiple biobanks. Validated across 19 UK Biobank cohorts and 4 NIPT biobanks with 149K+ individuals.

API-First Platform

RESTful API for programmatic NSS generation and analysis. Integrate with your existing pipelines, PLINK workflows, and PRS tools.

How nssgen Works

A simple three-step process that transforms how biobanks collaborate on genetic research.

01

Generate NSS

Biobanks run our tool on their individual-level data once. This produces a compact variance-covariance matrix (NSS) that captures all statistical relationships.

Time complexity: O(nmK + K³)
02

Share Publicly

NSS files are safe to share—they contain no individual-level data. Upload to our platform or distribute through your preferred channels.

File size: O(K × m) instead of O(n × m)
03

Analyze Locally

Researchers download NSS and run unlimited GWAS models locally. No biobank coordination needed. Fine-tune covariates, run meta-analyses, optimize PRS.

Time complexity: O(Cmk²)

NSS Data Structure

Compact variance-covariance matrices containing phenotypes and principal components. ~50GB for a full biobank.

NSS matrix structure

UKC Computation Engine

Efficient least-squares estimation directly from summary statistics without individual data.

UKC engine formula

The Math Behind NSS

NSS captures the complete variance-covariance structure needed for any linear regression model. One matrix, unlimited models.

β = Ω-1Λb   where   Ω = XTX

NSS-LSE: Naive Summary Statistics Least Squares Estimation

Proven at Scale

Validated across UK Biobank and real-world NIPT cohorts with biobank-scale sample sizes.

UK Biobank Validation

We validated nssgen across 19 UK Biobank assessment centers, analyzing 280K unrelated white British individuals with 10.5M SNPs.

19
Cohorts
280K
Individuals
10.5M
SNPs
>0.999
Correlation with PLINK
UK Biobank cohorts and PCA

Polygenic Risk Score Fine-Tuning

Systematically explored 64 GWAS models for Weight prediction. nssgen enabled efficient covariate evaluation without repeated biobank access.

Models tested 2 × 2⁵ = 64
R² improvement 0.02 → 0.05
Best model 5 PCs + Sex (no BMI/Height)
PRS fine-tuning results

Real-World NIPT Cohorts

Validated in a true multi-biobank setting with 4 independent NIPT cohorts from China, demonstrating practical applicability.

4
Independent Biobanks
149K
Individuals
855K
Shared SNPs
2
Traits Analyzed
NIPT Manhattan plot
Coming Soon

nssgen API & Cloud Platform

Generate NSS in the cloud, access pre-computed datasets, and integrate with your existing genomics pipelines through our RESTful API.

  • RESTful API for programmatic access
  • Pre-computed public biobank NSS datasets
  • Python & R SDK packages
  • PLINK-compatible output formats

Get Early Access

Join the waitlist for priority access to the nssgen platform and API.

No spam. We'll only email you about nssgen updates.