Data products and pipelines

Accelerating Research Expertise

This section brings together the reusable data products we have developed to support faster, more efficient research. It includes curation methods and pipelines curated assets, curated pipelines, and a reusable code library , designed to reduce duplication, improve consistency, and help researchers get started more quickly.

Explore the reusable research resources we have created to speed up the journey from data access to analysis. From prepared datasets and commonly used variables to ready-to-use workflows and shared code examples, these products are designed to reduce duplication and support high quality research.

Curated Assets

Option 1: Start analysis faster with ready-made, reusable tables built from electronic health record datasets and prepared for direct use in analysis

Option 2: Common research data, prepared once and reused across projects. Curated assets provide harmonised, high-quality well-documented, analysis-ready tables built from electronic health record datasets. They help research teams work more efficiently by removing the need to repeat the same data preparation in every project, while enabling more comparable and robust studies.

Data Curation Pipelines

Option 1: Reusable workflows that prepare raw electronic health record data for analysis, tailored to the needs of your project and cohort

Option 2: Our data curation pipelines provide structured, repeatable workflows for transforming raw electronic health record data into analysis-ready datasets. Within Secure Data Environments (SDEs) and Trusted Research Environments (TREs), they combine reusable code, defined methods, and quality checks into clear, transparent, end-to-end processes.