We’re live:now accepting pilots!

Stop fighting infrastructure
Start doing science

One command to provision high-memory compute, run any workflow, and automatically capture every result. Built for labs, not cloud engineers.

Trusted by
ETH Zurich
University of Zurich
Vilniaus UniversitetasVilniaus
Universitetas
+15 institutions
Built for bioinformaticians

Everything you need. Nothing you don’t.

Run any workflow, reproduce any result, share with anyone. No cloud expertise required.

Core differentiator

Reproducibility built in,
not bolted on

Every run automatically captures its exact environment, inputs, parameters, and infrastructure. Reproduce any result months later. Share complete analyses with reviewers, collaborators, or clients in one click.

[captured] pipeline: rnaseq-nf v3.14
[captured] environment: conda 24.1
[captured] compute: 128 GB RAM, 32 vCPU
[captured] dataset: GEO:GSE12345
Reproducibility artifact saved.
Share: thoa.io/share/a8f3e2

Instant compute, zero setup

VMs up to 12 TB RAM in seconds. No cluster queues, no cloud accounts, no DevOps.

Pay only when you run

No idle clusters. Scale up for large jobs, pay nothing when idle.

Share with one click

Share datasets and results with collaborators, reviewers, or clients. No ad-hoc file transfers.

Any tool, any workflow

Snakemake, Nextflow, Python, R, Docker, Conda. No migration required.

Compliance-ready

21 CFR Part 11-aligned audit trails. Role-based access. Isolated, ephemeral compute.

Lab to core facility

Scales from individual researchers to multi-user teams. Centralized control, per-project isolation.

Who is Thoa for

Your role. Your pain. Your solution.

Thoa solves different problems for different teams. Find yours.

Academic

Research lab PI / Postdoc

You’re a scientist who spends more time debugging computing environments than analyzing data. When a reviewer asks you for your compute environment, data, and software versions, it takes days.

See solution
Academic

Research lab PI / Postdoc

Thoa gives you

Reproducibility built in. Every analysis is automatically captured and shareable. Reviewer asks for reproduction? Send a link, not a methods section.

startup

Biotech / Bioinformatics

Your engineers spend 40% of their time on cloud infrastructure instead of science. Compute costs are unpredictable. Investors ask about data governance and you improvise.

See solution
startup

Biotech / Bioinformatics

Thoa gives you

Zero DevOps overhead. Predictable per-job pricing. Built-in audit trails that hold up under investor scrutiny.

University / institutional

Core Facility

You serve 30+ researchers across 15 projects simultaneously. Pipeline inconsistencies, onboarding bottlenecks, and HPC queue pressure eat your week.

See solution
University / institutional

Core Facility

Thoa gives you

Instant researcher onboarding. Standardized pipelines. Every project runs reproducibly regardless of who runs it. Publication-ready sharing built in.

MSc / PhD program

Bioinformatics

You lose 2 weeks every semester to student computing environment setup. Students break each other’s configs. Results vary across the cohort due to computing environment drift.

See solution
MSc / PhD program

Bioinformatics

Thoa gives you

Every student gets a ready-to-run computing environment with one click. Consistent results across the entire cohort. Centralized compute budget control.

What researchers say

Built with scientists, not just for them

“Beyond providing powerful computing infrastructure at a very reasonable price and being very easy to use, Thoa’s main value proposition is the handling of traceability and reproducibility of pipelines. For the computational biology community this is still a pain point, and Thoa is certainly poised to fix it.”

Dr. Giancarlo Russo

Dr. Giancarlo Russo

Head of Bioinformatics Core Facility, Vilnius University

“In my work, we often struggle to reduce budgets around data management and efficiency of data organization in a compliant manner. The way Thoa integrates data from heterogeneous sources in a reproducible manner is really helpful.”

Dr. George Kazantzidis

Dr. George Kazantzidis

Biostatistician, Roche

“Collaborating across organizations and institutions on large datasets in a manageable way is always a challenge. Thoa’s native collaboration features are something I have greatly appreciated.”

Falko Noe

Falko Noe

Bioinformatician, ETH Zurich

Supported Workflows

Everything from QC to Expression Analysis

Thoa supports the full bioinformatics pipeline, from raw data quality checks through to publication-ready outputs. No pipeline too complex, no dataset too large.

  • MultiQC quality control reports
  • RNA-seq and differential expression
  • Genome assembly and annotation
  • Single-cell RNA-seq workflows
  • Dataset management and sharing
Pipeline Engine

Run Any Pipeline, Reproduce Any Result

Thoa executes Snakemake and Nextflow DAGs natively, with full graph scheduling, isolated environments, and automatic output capture on every run.

SnakemakeNextflow
PythonRDockerConda
RAW_READSFASTQCTRIM_GALORESTAR_ALIGNSORT_BAMRSEQCFEATURECOUNTSDESEQ2MULTIQC
See it in action

One command. Done.

thoa workbench
18'000+
Tools integrated
23+
Research institutions
12+ TB
Data processed
$0.90
Average cost per job
Compliance-Ready

Embedded Security and Compliance

Built for regulated environments. Thoa enforces data integrity, full auditability, and access controls aligned with biotech and pharma compliance standards, without adding friction to your workflows.

Data Integrity by Design

Every file, dataset, and output is versioned and linked to its run. Reproducibility and traceability built in.

Audit-Ready Run History

Full job provenance with timestamped records of who ran what, when, and on which data, ready for regulatory review.

Isolated Compute Environments

Workloads run in short-lived sandboxed VMs. No cross-contamination between runs, no residual state.

21 CFR Part 11-aligned audit trails
Role-based access control
Isolated, ephemeral compute

How we compare

Feature
Thoa
Others
Setup time
Minutes
Hours–Days
Reproducibility
Snakemake / Nextflow
~
Public datasets
Cloud model
Built-in
BYOC
21 CFR audit trails

Full side-by-side comparison with Seqera, DNAnexus & SevenBridges available on desktop

Transparent pricing

Pay only for what you compute

All plans include zero-egress storage and automatic reproducibility artifacts.

~74% savings vs. Seqera + AWS
Free
$0/mo

Company domain required

  • 40 GB storage · 100 GB archive
  • 2'000 credits (one-time)
  • 1 concurrent job · 64 GB RAM · 8 CPU
  • 6 hr max runtime
  • Conda only
  • 3 downloads per dataset
Starter
$35/mo
  • 200 GB storage · 1 TB archive
  • 2'500 credits/mo
  • 2 concurrent jobs · 128 GB RAM · 16 CPU
  • 1 day max runtime
  • Docker · Conda · Singularity
  • Private datasets · 10% compute off
Most popular
Pro
$109/mo
  • 1 TB storage · 5 TB archive
  • 6'000 credits/mo
  • 5 concurrent jobs · 512 GB RAM · 64 CPU
  • 3 day max runtime
  • Dataset versioning · MCP server
  • 5 priority runs/mo · 20% off
Team
$480/mo

For research labs

  • 4 TB storage · 20 TB archive
  • 28'000 credits/mo
  • 20 concurrent jobs · 1 TB RAM · 96 CPU · GPU
  • 1 week max runtime
  • RBAC · Webhooks · Zenodo publish
  • 30% compute off · team dashboard
Only 3 pilot spots remaining for Q2 2026

Ready to stop managing
infrastructure?

Join the pilot program. Get free compute credits, dedicated support, and direct influence on the product.

Free compute credits 4–6 month collaboration Direct access to founders