RCA Extract
Document extraction for healthcare PDFs. Ingests discharge summaries, ED assessments, referral letters, imaging reports and pathology reports. Returns structured fields ready for downstream analytics, EMR ingest, or audit work.
From Document to Insight
Native Snowflake architecture means zero data movement and instant analytics on extracted results.
Document Ingestion
Stage PDFs in a Snowflake internal or external stage. Multi-page PDF, TIFF, PNG and JPEG inputs supported.
Extraction Pipeline
RCA Extract runs as a Snowflake Native App function. Per-document extraction returns structured fields plus bounding boxes for the labeled fields.
Structured Output
Results land directly in Snowflake tables. Pre-defined healthcare schemas. FHIR-aligned output available where the schema applies.
Analytics-Ready
Query results with standard SQL. Connect to any Snowflake-compatible BI tool. Joins against your existing patient or facility tables work as normal.
Supported document types
The current production set covers high-volume Australian healthcare document types. Each is evaluated end-to-end against the matching family in the RCA Medical Library, so extraction quality can be scored against known ground truth. Additional document types ship on request.
Discharge summary
Demographics, registrar, consultant, principal diagnosis, principal ICD, dates, length of stay, medications, follow-up.
ED assessment
Triage category, presenting complaint, disposition, timings.
Referral letter
Referrer, recipient specialty, presenting problem, requested action.
Imaging report
Accession number, modality, body region, findings, impression.
Pathology report
Lab reference, specimen, test panel, result fields, abnormal flags.
Why a synthetic-first vendor
RCA Extract is built and tested against the same synthetic medical documents we sell as the RCA Medical Library. That gives us:
- A controlled test set across 40+ document types where ground truth is known by construction.
- A scanned-variant test set for photocopy and JPEG-noise robustness.
- Versioned releases. Each release of RCA Extract is pinned to a generator seed and library version.
- Transparent evaluation. If you want to verify our extraction quality before committing, we can ship you the same documents and you score against the same ground truth.
We do not publish blanket accuracy numbers until we have published benchmark methodology and results. If you need a benchmark for a specific document type, contact us.
What you get with a Snowflake Native deployment
RCA Extract runs inside your Snowflake account. The product inherits the security and audit capabilities of your own Snowflake environment.
- Runs entirely within your Snowflake account. Patient data never leaves your environment.
- Inherits your existing Snowflake RBAC, audit and access policies.
- No external API calls. No third-party data processors involved in extraction.
- Snowflake Marketplace listing: GZSUZU1HJP.
- Encryption at rest and in transit, provided by Snowflake.
Pricing
Contact us. Pricing depends on document volume, deployment shape and SLA. Pilot packs are available before commitment.
Browse RCA Extract on Snowflake Marketplace
Listing ID GZSUZU1HJP. Snowflake Native App. Runs in your account.