Skip to main content
Snowflake Native App

RCA Extract

Document extraction for healthcare PDFs. Ingests discharge summaries, ED assessments, referral letters, imaging reports and pathology reports. Returns structured fields ready for downstream analytics, EMR ingest, or audit work.

Snowflake
Native App deployment
AU
Healthcare conventions
10 Min
Marketplace install time
Zero
Data egress

From Document to Insight

Native Snowflake architecture means zero data movement and instant analytics on extracted results.

Document Ingestion

Stage PDFs in a Snowflake internal or external stage. Multi-page PDF, TIFF, PNG and JPEG inputs supported.

Extraction Pipeline

RCA Extract runs as a Snowflake Native App function. Per-document extraction returns structured fields plus bounding boxes for the labeled fields.

Structured Output

Results land directly in Snowflake tables. Pre-defined healthcare schemas. FHIR-aligned output available where the schema applies.

Analytics-Ready

Query results with standard SQL. Connect to any Snowflake-compatible BI tool. Joins against your existing patient or facility tables work as normal.

Supported document types

The current production set covers high-volume Australian healthcare document types. Each is evaluated end-to-end against the matching family in the RCA Medical Library, so extraction quality can be scored against known ground truth. Additional document types ship on request.

Discharge summary

Demographics, registrar, consultant, principal diagnosis, principal ICD, dates, length of stay, medications, follow-up.

ED assessment

Triage category, presenting complaint, disposition, timings.

Referral letter

Referrer, recipient specialty, presenting problem, requested action.

Imaging report

Accession number, modality, body region, findings, impression.

Pathology report

Lab reference, specimen, test panel, result fields, abnormal flags.

Why a synthetic-first vendor

RCA Extract is built and tested against the same synthetic medical documents we sell as the RCA Medical Library. That gives us:

  • A controlled test set across 40+ document types where ground truth is known by construction.
  • A scanned-variant test set for photocopy and JPEG-noise robustness.
  • Versioned releases. Each release of RCA Extract is pinned to a generator seed and library version.
  • Transparent evaluation. If you want to verify our extraction quality before committing, we can ship you the same documents and you score against the same ground truth.

We do not publish blanket accuracy numbers until we have published benchmark methodology and results. If you need a benchmark for a specific document type, contact us.

What you get with a Snowflake Native deployment

RCA Extract runs inside your Snowflake account. The product inherits the security and audit capabilities of your own Snowflake environment.

  • Runs entirely within your Snowflake account. Patient data never leaves your environment.
  • Inherits your existing Snowflake RBAC, audit and access policies.
  • No external API calls. No third-party data processors involved in extraction.
  • Snowflake Marketplace listing: GZSUZU1HJP.
  • Encryption at rest and in transit, provided by Snowflake.
Runs in Your Snowflake Account
No Third-Party Processors
FHIR Compatible Output
Zero Data Movement

Pricing

Contact us. Pricing depends on document volume, deployment shape and SLA. Pilot packs are available before commitment.

Browse RCA Extract on Snowflake Marketplace

Listing ID GZSUZU1HJP. Snowflake Native App. Runs in your account.