Avalon - Synthetic FHIR R4 patient data as OMOP CDM 5.4 views
Free OMOP CDM views on BigQuery when healthcare researchers need PHI-free test data.

90x query cost reduction by flattening 459 nested FHIR fields to 15 columns.
Healthcare developers, FHIR engineers, data analysts
CMS SynPUF · MIMIC-III · OHDSI datasets
8 resource types: Patient, Encounter, Observation, Condition, Procedure, Immunization, MedicationRequest, DiagnosticReport.
The raw Synthea output has 459 nested fields per resource, urn:uuid: references, and no column descriptions. We flatten it to clean views with ~15 columns each, pre-extracted IDs, and descriptions sourced from the FHIR R4 OpenAPI spec. Example:
-- Raw FHIR: SELECT id, code.text FROM diagnostic_report WHERE subject.reference = CONCAT("urn:uuid:", patient_id) -- Forge view: SELECT report_name, patient_id FROM v_diagnostic_report Data scanned per query drops ~90x (450 MB → 5 MB).
Free to subscribe: https://console.cloud.google.com/bigquery/analytics-hub/exch...
Updated weekly. Useful if you're building anything against FHIR data and want a realistic test dataset without standing up your own Synthea pipeline.
Happy to answer questions about the normalization approach or FHIR data modeling tradeoffs.
Free OMOP CDM views on BigQuery when healthcare researchers need PHI-free test data.
No-auth social metrics API across 8 platforms, but scraping maintenance is a real concern.
The only open-source BigQuery emulator with multi-client conformance testing.
Automated FHIR-to-OMOP translation beats manual dbt modeling, but lacks Fivetran's broader connector library.
SPF flattening via recursive DNS, solves real 10-lookup limit, but audience is narrow sysadmins-only.
Bloomberg-style UX for healthcare, but real-time data sourcing and differentiation unclear.