RCA Medical Library
Synthetic Australian medical training documents
40+ document types across hospital, ED, GP clinic, pathology, imaging and specialist correspondence. Ground truth, bounding boxes and scanned variants shipped with every document.
40+ document types
Three groups. The five document types covered by RCA Extract are starred.
Hospital and ED
- Discharge summaryRCA Extract
- ED assessmentRCA Extract
- Admission checklist
- ICU daily plan
- Anaesthetic record
- Fluid order
- Progress note
- Patient safety checklist
- Transfusion compatibility report
- Haemodialysis flow sheet
- Infusion pump checklist
- Medication administration record
GP and primary care
- Referral letterRCA Extract
- Medical certificate
- Prescription
- Mental health care plan
- Mental health assessment
- Advance care directive
- Home care plan
- Treatment plan
- External correspondence
Pathology, imaging, specialist
- Pathology request
- Pathology reportRCA Extract
- Imaging request
- Imaging reportRCA Extract
- Bone density report
- ECG (12-lead and rhythm)
- Echo report
- Vascular ultrasound report
- Pacemaker report
- Ophthalmology assessment
- Audiology assessment
- Speech pathology assessment
- Physiotherapy assessment
- Endoscopy report
- HADS questionnaire
Full list with document_type weights documented in the library manifest.json.
What you actually get
Below is a real discharge summary from the RCA Medical Library: NSW hospital header, AU patient name conventions, Medicare format, NSW Local Health District, AU consultant postnominals, and a medications table. Same page rendered clean and with every labelled field outlined.


25 to 35 representative documents. Same-day delivery on request. PDFs, ground truth, bboxes and scanned variants.
AU-specific realism
- Patient names use AU-common first names and surnames drawn from broad surname pools (not a single ethnicity).
- Addresses use NSW postcodes that match the stated suburb. Postcode-to-suburb mapping is sourced from public ABS data and is computer-generated; no real residential address is referenced.
- Medicare numbers follow the displayed AU format (10 digits plus IRN) but are computer-generated and do not validate against the real Medicare system.
- Provider numbers use the TRN-PROV-XXXXX format with the synthetic TRN prefix. The TRN prefix is deliberate so any pipeline that ingests these documents can filter out synthetic provider numbers.
- Clinician postnominals use AU specialty fellowships: FRACGP, FRACP, FRCPA, FRANZCR, FACEM, FRACS.
- Hospitals carry NSW Local Health District labels (synthetic, not real LHD names).
- Phone numbers use AU area codes.
These conventions are commonly the source of extraction failures on models trained primarily on US-only documents. Models that handle US date formats, US ZIP codes and DEA numbers will frequently fail on AU postcodes, Medicare numbers and provider numbers without retraining.
65+ curated clinical case archetypes
The Medical Library is built from hand-authored case archetypes. Each case has internally consistent demographics, presenting complaint, labs, treatments, follow-up plans and discharge instructions. A single case can be rendered as several different document types within the same library so the documents in a pack hang together as a plausible patient journey.
Adding a new case is roughly 50 lines of Python. We accept paid feature requests for new case archetypes. Common requests: paediatric ED, renal failure with dialysis, post-op infection, mental health crisis presentation.
Diversity controls
Eight style profiles ship today:
Each document type has three named template families that vary header / footer / section ordering without changing field labels or ground truth values. Visible synthetic disclaimer placement varies per document: footer line, top banner, boxed notice, or pale strip.
Pricing
| Tier | Scale | Price | Delivery |
|---|---|---|---|
| Free review pack | 25 to 35 documents | Free for qualified prospects | Same day on request |
| QA library | 200 documents | On request | Scoped per order |
| Training library | 500, 5,000+ documents | On request | Scoped per order |
| Pilot Pack | 100 to 200 docs scoped to your use case | On request | Scoped per order |
| Custom variants | New document types, new case archetypes | On request | Scoped per order |
Synthetic safety
Every PDF carries a visible synthetic disclaimer on every page. Patient names, dates of birth, Medicare numbers, MRNs, addresses, phone numbers, clinician names, provider numbers and hospital names are computer-generated and do not refer to any real person or organisation.
Not for clinical care, coding, billing, or regulatory use.
Get the free 25-doc review pack
25 to 35 representative medical documents. Five-minute review path. Free for qualified prospects.