DOCS — INTEGRATIONS · NATIVE MEDICAL SERVICES

Native Medical Services¶

OpenMed currently operates across three relevant boundaries:

the local operator runtime
the configured model-provider path for general agent reasoning
the native medical services that power extraction, de-identification, terminology, and HCC/RAF work

The point of this split is not locality for its own sake. It is to keep sensitive or high-volume clinical processing on dedicated medical-service planes that can be governed separately, respond faster on long unstructured inputs, and operate more cost-efficiently than sending every page through a frontier LLM path.

OpenMed Agent is currently in preview. To evaluate the binary and hosted medical-service tier, request preview access.

Reference architecture

Inspectable runtime. Protected medical services. Explicit boundary.

OpenMed keeps sessions, plans, workflow previews, and artifacts in the operator runtime while general agent reasoning uses the configured model provider. Clinical extraction and medical-coding services sit behind a separate native service plane, so teams do not need to route every PHI-heavy or page-heavy workload through the same frontier-model path.

Plane 01

Operator runtime

CLI, TUI, sessions, plans, provenance, and workflow artifacts remain on the operator machine, with the review loop visible before teams depend on outputs.

Local runtime Review surface Artifact control

Plane 02

Clinical extraction service

NER, PII detection, de-identification, and batch extraction run through the native service plane.

Native extraction Private container Accelerated hardware

Plane 03

Medical coding and terminology service

ICD-10, CPT, SNOMED, LOINC, RxNorm, MedlinePlus, PubMed, HCC mapping, and RAF scoring run through a separate service boundary.

Terminology service LOINC / RxNorm / MedlinePlus HCC / RAF Protected endpoint

Service planes¶

Clinical extraction plane¶

The extraction plane is the native service boundary for clinical text processing.

It powers native OpenMed tools such as:

extract_entities
extract_pii
deidentify_text
batch extraction workflows built on the same service

Operationally, this is where OpenMed handles:

clinical NER
billing-oriented entity extraction
PII detection
de-identification

Current posture:

preview builds use the OpenMed-hosted native service by default
custom deployments will be able to move this boundary to customer-managed infrastructure
the service boundary can move without changing the local workflow surface

Coding and terminology plane¶

The coding plane is the native service boundary for terminology, education, and risk adjustment.

It powers native OpenMed tools such as:

PubMed search and abstract retrieval
ICD-10, CPT, SNOMED, and LOINC search / lookup / validation
RxNorm medication normalization and related-concept lookup
MedlinePlus patient-education topic lookup
code crosswalks
HCC mapping
RAF score calculation

This is the service boundary that makes the HCC and revenue-integrity story concrete: OpenMed can orchestrate clinical note review, extract coding candidates, and then hand those codes to a separate terminology/HCC service instead of collapsing everything into one generic model call.

Current posture:

preview builds use the OpenMed-hosted native service by default
custom deployments will be able to move this boundary to customer-managed infrastructure
the service boundary can move without changing the local workflow surface

Why the split matters¶

This architecture gives OpenMed a stronger healthcare deployment story than a single undifferentiated model endpoint:

extraction and de-identification can scale independently from coding and terminology lookup
HCC and RAF operations can live behind a dedicated protected service boundary
sensitive or page-heavy clinical inputs do not need to consume frontier-model context for every extraction or coding pass
teams can reserve the model-provider path for general reasoning while keeping protected clinical processing on dedicated services
the service tier can be swapped without changing the operator workflow surface

These service planes are native OpenMed backends. They are not remote MCP servers.

Deployment patterns¶

Pattern	Runtime	Extraction / PII	Coding / HCC
Preview reference	Local operator machine	OpenMed-hosted native extraction endpoint	OpenMed-hosted native coding and terminology endpoint
Customer cloud	Local operator machine or managed desktop	Private container in VPC / private cloud	Private terminology and HCC service in the same environment
On-prem / edge	Managed workstation	Local GPU or internal inference cluster	Internal terminology / HCC API
Lab / dev	Local operator machine	Local or sandbox endpoint	Local or sandbox endpoint

The real product boundary is the native service plane, not one fixed hosting environment.

Security and access model¶

OpenMed itself does not claim that every deployment is automatically compliant because one part of the product is local. The defensible statement is narrower and more useful:

the operator runtime and review loop are inspectable
the model-provider path and the medical-service tier are separate, explicit boundaries
the actual privacy and compliance posture depends on where those services are hosted and governed
no product telemetry is built into the runtime

During preview, OpenMed serves the native medical-service tier so evaluators do not need to deploy it themselves. The same workflow surface can later target customer-managed cloud or on-prem environments while preserving the OpenMed runtime and review loop.

If you need an evaluator build, request preview access.

Configuration surface¶

Preview binaries use OpenMed's native services without extra local service setup. Customer-managed deployments are planned for teams that need to run the same service plane in their own cloud, VPC, or on-prem environment.

See Privacy & Security for the runtime-boundary explanation.