Back to projects

respire

Python / React

A modular, open-source platform that centralizes publicly available lung disease data from scattered repositories into a single searchable interface. Researchers can filter across gene expression, microbiome, and other pulmonary datasets and download everything in a consistent, analysis-ready format.

Studies indexed

2,690 across 2 modules

Samples sourced

64,761

Data volume

103.5 GB, analysis-ready

Capability

First unified search for pulmonary expression data

Core registry and module API specGene expression moduleMicrobiome moduleSearch interfaceData ingestion pipelines
Project scoping and roadmapClient relationship ownershipProject managementProject architecture
  • React
  • Redux
  • FastAPI
  • PostgreSQL
  • Docker
  • Plugin architecture with a defined module API — new data types (proteomics, metabolomics) plug into the core without rebuilding it, so the system grows without re-engineering
  • Containerized microservices — the system needs to be deployable by other research groups on their own infrastructure, so portability was a hard requirement
  • Open-sourced and domain-agnostic core — the framework works for any disease area, not just pulmonary, which multiplies the value of the engineering investment