Native Medical Services¶
OpenMed currently operates across three relevant boundaries:
- the local operator runtime
- the configured model-provider path for general agent reasoning
- the native medical services that power extraction, de-identification, terminology, and HCC/RAF work
The point of this split is not locality for its own sake. It is to keep sensitive or high-volume clinical processing on dedicated medical-service planes that can be governed separately, respond faster on long unstructured inputs, and operate more cost-efficiently than sending every page through a frontier LLM path.
OpenMed Agent is currently in preview. To evaluate the binary and hosted medical-service tier, request preview access.
Inspectable runtime. Protected medical services. Explicit boundary.
OpenMed keeps sessions, plans, workflow previews, and artifacts in the operator runtime while general agent reasoning uses the configured model provider. Clinical extraction and medical-coding services sit behind a separate native service plane, so teams do not need to route every PHI-heavy or page-heavy workload through the same frontier-model path.
Operator runtime
CLI, TUI, sessions, plans, provenance, and workflow artifacts remain on the operator machine, with the review loop visible before teams depend on outputs.
Clinical extraction service
NER, PII detection, de-identification, and batch extraction run through the native service plane.
Medical coding and terminology service
ICD-10, CPT, SNOMED, LOINC, RxNorm, MedlinePlus, PubMed, HCC mapping, and RAF scoring run through a separate service boundary.
Service planes¶
Clinical extraction plane¶
The extraction plane is the native service boundary for clinical text processing.
It powers native OpenMed tools such as:
extract_entitiesextract_piideidentify_text- batch extraction workflows built on the same service
Operationally, this is where OpenMed handles:
- clinical NER
- billing-oriented entity extraction
- PII detection
- de-identification
Current posture:
- preview builds use the OpenMed-hosted native service by default
- custom deployments will be able to move this boundary to customer-managed infrastructure
- the service boundary can move without changing the local workflow surface
Coding and terminology plane¶
The coding plane is the native service boundary for terminology, education, and risk adjustment.
It powers native OpenMed tools such as:
- PubMed search and abstract retrieval
- ICD-10, CPT, SNOMED, and LOINC search / lookup / validation
- RxNorm medication normalization and related-concept lookup
- MedlinePlus patient-education topic lookup
- code crosswalks
- HCC mapping
- RAF score calculation
This is the service boundary that makes the HCC and revenue-integrity story concrete: OpenMed can orchestrate clinical note review, extract coding candidates, and then hand those codes to a separate terminology/HCC service instead of collapsing everything into one generic model call.
Current posture:
- preview builds use the OpenMed-hosted native service by default
- custom deployments will be able to move this boundary to customer-managed infrastructure
- the service boundary can move without changing the local workflow surface
Why the split matters¶
This architecture gives OpenMed a stronger healthcare deployment story than a single undifferentiated model endpoint:
- extraction and de-identification can scale independently from coding and terminology lookup
- HCC and RAF operations can live behind a dedicated protected service boundary
- sensitive or page-heavy clinical inputs do not need to consume frontier-model context for every extraction or coding pass
- teams can reserve the model-provider path for general reasoning while keeping protected clinical processing on dedicated services
- the service tier can be swapped without changing the operator workflow surface
These service planes are native OpenMed backends. They are not remote MCP servers.
Deployment patterns¶
| Pattern | Runtime | Extraction / PII | Coding / HCC |
|---|---|---|---|
| Preview reference | Local operator machine | OpenMed-hosted native extraction endpoint | OpenMed-hosted native coding and terminology endpoint |
| Customer cloud | Local operator machine or managed desktop | Private container in VPC / private cloud | Private terminology and HCC service in the same environment |
| On-prem / edge | Managed workstation | Local GPU or internal inference cluster | Internal terminology / HCC API |
| Lab / dev | Local operator machine | Local or sandbox endpoint | Local or sandbox endpoint |
The real product boundary is the native service plane, not one fixed hosting environment.
Security and access model¶
OpenMed itself does not claim that every deployment is automatically compliant because one part of the product is local. The defensible statement is narrower and more useful:
- the operator runtime and review loop are inspectable
- the model-provider path and the medical-service tier are separate, explicit boundaries
- the actual privacy and compliance posture depends on where those services are hosted and governed
- no product telemetry is built into the runtime
During preview, OpenMed serves the native medical-service tier so evaluators do not need to deploy it themselves. The same workflow surface can later target customer-managed cloud or on-prem environments while preserving the OpenMed runtime and review loop.
If you need an evaluator build, request preview access.
Configuration surface¶
Preview binaries use OpenMed's native services without extra local service setup. Customer-managed deployments are planned for teams that need to run the same service plane in their own cloud, VPC, or on-prem environment.
See Privacy & Security for the runtime-boundary explanation.