SCS System Architecture

Version: v1.0 (Draft) Status: Draft Authoritative: Yes

Purpose

This document defines how SCS (Symphony Core System) realizes the SCP protocol as a concrete, deployable system.

It answers:

how the physical node network is structured and what runs where
how services map to the SCP 3 Planes + 2 Services architecture
how nodes communicate with each other and with external users
what data each node type stores and how persistence is organized
how smart contracts on Aptos realize staking, escrow, and settlement
how TEE environments are integrated for privacy-preserving computation
how the system handles failure, reconciliation, and operational concerns

For the normative protocol semantics that this system must preserve, see:

Design Principles

SCS realizes protocol truth defined by SCP, not redefines it. This means:

SCP defines lifecycle meaning, actor duties, semantic rules, and economic invariants
SCS defines the runtime services, storage, APIs, and operational mechanisms that preserve those meanings in production
a deployment may combine or separate services differently, as long as the same protocol semantics are preserved
SCS adds no protocol concepts — every SCS behavior must trace back to an SCP requirement

Part 1: Deployment Topology

Node Network

The SCS network consists of two node types operating in a permissioned topology:

                    ┌──────────────────────────────────────┐
                    │         Master Node Cluster           │
                    │   (3 nodes, BFT consensus, Symphony)  │
                    │                                       │
                    │  ┌─────────┐ ┌─────────┐ ┌─────────┐│
                    │  │Master-1 │ │Master-2 │ │Master-3 ││
                    │  └────┬────┘ └────┬────┘ └────┬────┘│
                    │       └──────┬────┴────────────┘     │
                    │              │ BFT Consensus          │
                    └──────────────┼───────────────────────┘
                                   │
                    ┌──────────────┼───────────────────────┐
                    │    Inter-Node Communication Layer     │
                    │   (mTLS, authenticated, encrypted)    │
                    └──────┬───────┼──────────┬────────────┘
                           │       │          │
              ┌────────────┴┐  ┌───┴────────┐ ┌┴────────────┐
              │Enterprise-A │  │Enterprise-B│ │Enterprise-C │
              │  (Vault-A)  │  │  (Vault-B) │ │  (Vault-C)  │
              └─────────────┘  └────────────┘ └─────────────┘

Master Node Cluster

The 3 master nodes are operated by the Symphony Foundation. They collectively run:

Admission Plane services — API gateway, identity, consent verification, semantic resolution, privacy budget reservation, TaskEnvelope assembly
Settlement Plane services — result verification, challenge lifecycle, reward accounting, penalty enforcement, payout execution
Governance Service — policy version management, epoch management, actor registry, governance proposals
BFT consensus engine — protocol finality requires agreement from at least 2 of 3 master nodes

Master nodes do NOT store private user data. They see only commitments, metadata, and protocol-visible artifacts.

Each master node is a full replica:

every master node runs the complete set of master services
any single master node failure does not interrupt protocol operation
state is synchronized through the BFT consensus protocol
each master node independently validates all protocol transitions

Enterprise Nodes

Enterprise nodes are operated by approved enterprises. Each enterprise node runs:

Vault services (Data Sovereignty Service) — encrypted record storage, consent management, privacy budget ledger, authorization enforcement
Execution services (Execution Plane subset) — Vault-internal computation, optional TEE execution, slice result production
Node agent — handles inter-node communication, heartbeat, and coordination protocol with the master cluster

Enterprise nodes do NOT run admission, verification, settlement, or governance services. Those remain on the master cluster.

Each enterprise node is independently operated:

the enterprise controls its own infrastructure, storage, and operational practices
the enterprise's data never leaves the Vault boundary except as commitment references or TEE-processed results
the enterprise manages its own consent records for data subjects within its Vault
enterprise nodes may optionally contribute compute capacity beyond their own Vault data

End Users

End users (Data Producers and Task Submitters) interact with the system through the master cluster's API gateway. They do not run nodes. Their data flows:

Data upload: user → API gateway → master node routes to assigned enterprise Vault → parse execution within Vault
Task submission: user → API gateway → Admission Plane on master → TaskEnvelope → dispatched to relevant nodes

Part 2: Service Architecture

Service-to-Node Mapping

Service	Runs On	SCP Component
API Gateway	Master nodes	Admission Plane entry
Identity & Auth Service	Master nodes	Admission Plane
Consent Verification Service	Master nodes (invokes Consent Manager on enterprise nodes via RPC)	Admission Plane
Semantic Registry Service	Master nodes	Admission Plane
Privacy Budget Reservation Service	Master nodes (invokes Privacy Budget Ledger on enterprise nodes via RPC)	Admission Plane
TaskEnvelope Assembly Service	Master nodes	Admission Plane
Task Orchestrator	Master nodes	Execution Plane (coordination)
Multi-Vault Coordinator	Master nodes	Execution Plane (coordination)
Vault Storage Engine	Enterprise nodes	Data Sovereignty Service
Consent Manager	Enterprise nodes	Data Sovereignty Service
Privacy Budget Ledger	Enterprise nodes	Data Sovereignty Service
Computation Runtime	Enterprise nodes (+ optional TEE)	Execution Plane (computation)
Aggregation Runtime	Master nodes (in TEE)	Execution Plane (aggregation)
Result Verification Service	Master nodes	Settlement Plane
Challenge Manager	Master nodes	Settlement Plane
Reward Accounting Service	Master nodes	Settlement Plane
Payout Service	Master nodes	Settlement Plane
Policy Version Manager	Master nodes	Governance Service
Epoch Manager	Master nodes	Governance Service
Actor Registry	Master nodes	Governance Service
BFT Consensus Engine	Master nodes	Cross-cutting

Master Node Services Detail

API Gateway

single entry point for all external requests (user uploads, task submissions, result queries)
TLS termination, rate limiting, DDoS protection
routes requests to the appropriate internal service based on request type
does NOT perform protocol logic — it is a routing and protection layer only

Task Orchestrator

The central state machine driver:

receives validated TaskEnvelope from the Admission Plane
drives the task through canonical state transitions (accepted → resolving → dispatched → verifying → awaiting_settlement → completed)
for multi-Vault tasks: delegates to Multi-Vault Coordinator
for composite tasks: manages sub-task sequencing and iteration control
emits protocol events for every state transition (auditable, replayable)
enforces timeouts at every stage

Multi-Vault Coordinator

expands the coordination envelope into per-Vault execution assignments
sends execution assignments to target enterprise nodes
tracks per-Vault slice progress and enforces quorum
triggers aggregation when sufficient slices complete
handles partial completion and timeout logic

Aggregation Runtime

executes inside the aggregation environment declared in the coordination envelope. SCP permits three such environments: (a) an attested TEE, (b) a secure multi-party computation protocol, or (c) a homomorphic encryption scheme. SCS v1 ships (a) as the default implementation on master nodes; (b) and (c) are supported as pluggable runtimes that may be enabled per policy without changing the outer service contract
when the runtime is an attested TEE, it is provisioned on a master node but operates under a dedicated aggregator actor identity, distinct from the master node's Settlement/Governance identities, so that the aggregator is bound by Executor-class staking, penalty, and governance rules as required by SCP. The aggregator actor must never coincide with the task submitter
receives per-Vault slice results (commitment references and privacy-safe outputs)
executes the declared aggregation method (union, intersection, secure_sum, federated_average)
enforces cardinality thresholds at the aggregate level
produces aggregate result with cryptographic proof and (for TEE runtimes) an AttestationReport
does NOT retain per-Vault inputs after producing the aggregate

Because the Aggregation Runtime and the Result Verification Service may physically co-locate on the master cluster, SCS enforces logical separation: the aggregator signs with an Executor-class key, verification runs in a separate service process with its own key material, and challenge evidence always distinguishes the aggregator actor from the verifier actor. A deployment that exposes Aggregation Runtime to an independent third-party Executor is permitted and encouraged once the actor registry supports external aggregators.

Semantic Registry Service

stores domain hierarchy, canonical attributes, query attributes, derivation rules
handles semantic resolution requests during task admission
manages attribute lifecycle transitions (proposed → registered → active → deprecated → retired)
supports local attribute candidate aggregation and promotion workflows

Reward Accounting Service

receives finalized settlement contexts
calculates per-actor reward shares using the fee distribution formula and policy version. The full set of recipient classes is: Data Contributors (Data Producers), Vault Operators, Executors, Aggregators, and Verifiers. The sum of all per-Vault shares plus aggregator and verifier shares must equal the total task reward budget, with no unaccounted remainder (SCP economics invariant)
applies staking multiplier (1.0 for staked, 0.5 for unstaked Data Producers) and routes the unrewarded 50% back to the epoch reward pool via an explicit pool-return record, so that the invariant in (2) holds after the multiplier is applied
calculates data quality rewards for parsed records
calculates data usage dividends for record contributions
produces reward records and submits payout instructions to the Payout Service

Payout Service

transforms finalized reward records into PayoutInstructionSet for the Aptos chain
manages signer isolation: payout and reward distribution use keys distinct from the Treasury multi-sig and from the BFT consensus keys of the master nodes, so that a compromise of one key class does not automatically grant authority over another
batches payout instructions by epoch
after Settlement Plane finalization, anchors the corresponding SettlementRoot hash on Aptos as part of the same epoch batch, tracks confirmation, and retries on reorg without mutating the finalized settlement context
handles submission, retry, and confirmation tracking
records payout success or failure without mutating finalized reward records

Enterprise Node Services Detail

Vault Storage Engine

stores canonical private records in encrypted form
maintains Commitment = SHA256(Serialize(CR)) for every record
enforces encryption at rest and in transit
supports record retrieval only through authenticated, authorized channels
no plaintext leaves the Vault except into an attested TEE

stores per-data-subject consent records within the Vault
enforces consent checks before any data access
handles consent grant, withdrawal, and expiry
consent state never exported as plaintext — only consent verification results (yes/no) are shared

Privacy Budget Ledger

maintains per-data-subject, per-usage-scope budget tracking
processes reserve operations from the master cluster during task admission
processes commit operations during execution
processes release operations when tasks fail before execution
enforces that committed consumption is append-only

Computation Runtime

executes authorized computation on Vault-internal records
for parse tasks: runs OCR, extraction, and structured output generation
for query/compute/train slices: evaluates against authorized records and produces SliceResultBundle
may delegate to a local TEE for cross-Vault tasks where policy requires
produces commitment-linked, deterministic outputs

Part 3: Inter-Node Communication

Communication Model

All inter-node communication uses mutual TLS (mTLS) with certificate-based authentication.

Master-to-Master

BFT consensus messages: state synchronization, block proposals, vote messages
Service replication: configuration, registry, and accounting state
protocol: internal consensus protocol (gRPC-based)

Master-to-Enterprise

Execution assignments: TaskEnvelope + per-Vault execution parameters
Consent verification requests: "does data subject X have consent for usage scope Y?"
Privacy budget operations: reserve, commit, release
Slice result collection: enterprise returns SliceResultBundle after execution
Heartbeat and health: enterprise node liveness monitoring
protocol: gRPC over mTLS, with message signing

Enterprise-to-Master

Slice results: completed execution output with proof references
Privacy budget responses: reservation confirmations, commit acknowledgments
Consent verification responses: yes/no results
Health reports: storage capacity, compute availability, uptime metrics

User-to-Master

Data upload: user uploads raw data (receipt images, documents) via REST/gRPC API
Task submission: user submits task requests via REST/gRPC API
Result retrieval: user queries task status and retrieves settled results
Account management: staking, unstaking, reward queries
protocol: HTTPS REST or gRPC, with JWT/OAuth2 authentication

Part 4: Data Architecture

Persistence by Node Type

Master Node Persistence

Master nodes store protocol state, not private data:

Store	Contents	Properties
Task State Store	Task lifecycle events, TaskEnvelopes, state transitions	Append-only, replicated across 3 masters
Semantic Registry	Domains, canonical attributes, query attributes, derivation rules	Version-aware, replicated
Settlement Store	Settlement contexts (candidate and finalized), `SettlementRoot` records	Append-only, replicated
Reward Ledger	Per-actor reward records, adjustments, payout intents	Append-only, replicated
Challenge Store	Challenge lifecycle records, evidence references	Append-only, replicated
Epoch Store	Epoch definitions, window metadata, aggregation scopes	Replicated
Policy Store	Policy version definitions, governance proposals	Replicated
Actor Registry	Actor identities, role bindings, staking state	Replicated
Audit Log	All protocol events with timestamps	Append-only, immutable

Enterprise Node Persistence

Enterprise nodes store private data and local protocol state:

Store	Contents	Properties
Vault Record Store	Encrypted canonical private records	Encrypted at rest, access-controlled
Commitment Store	`Commitment` references for all records	Integrity-linked to records
Consent Store	Per-data-subject consent records	Never exported as plaintext
Privacy Budget Ledger	Per-subject, per-scope budget entries	Append-only for committed entries
Local Attribute Pool	Local semantic candidates and stability metrics	Vault-scoped
Execution Transcript Store	Per-task execution evidence and proof references	Append-only

Replay and Idempotency

The system must preserve:

replay-critical version and epoch context on all persisted records
idempotent create behavior for externally retried requests (deduplication by request ID)
deterministic linkage between tasks, semantic resolution, execution artifacts, and verification decisions
append-only finalized accounting — corrections through explicit adjustment records, never silent mutation

Part 5: Smart Contract Architecture

Contracts on Aptos

All protocol-managed SYM is held by smart contracts on the Aptos blockchain:

Staking Contract

accepts SYM deposits for S_master, S_enterprise, and S_producer tiers
enforces minimum stake requirements per tier, read from the Policy Store at the active policy_version
processes slashing instructions from the Settlement Plane
manages cooldown periods for unstaking. Cooldown durations are governance parameters, not hard-coded constants; the current policy values (e.g., 30 days for nodes, 7 days for Data Producers) are loaded from the Policy Store and may be revised through a governance proposal
emits staking state changes as on-chain events

Escrow Contract

locks SYM from Task Submitters at task admission
holds locked funds until settlement finalization
releases funds to recipients upon finalized reward distribution
returns funds to submitter for rejected-before-execution tasks
supports partial release for tasks that fail during execution

Reward Contract

receives epoch-aggregated reward distributions from the Payout Service
distributes SYM to recipients: Data Producers (Data Contributors), Vault Operators, Executors, Aggregators, and Verifiers. Every payout batch carries per-class amounts so that the on-chain record preserves the SCP invariant that per-Vault shares plus aggregator and verifier shares equal the total task reward budget, with no unaccounted remainder
applies staking multiplier: full distribution to staked producers, 50% to unstaked; the unrewarded 50% is explicitly routed back to the epoch reward pool through a pool-return instruction rather than left as a silent remainder
maintains per-epoch reward records on-chain

Treasury Contract

holds the Protocol Treasury and Ecosystem Incentives allocations
releases funds only through governance-approved proposals
receives the Protocol Treasury share of task fees
funds data quality rewards and operational costs during bootstrap
multi-sig controlled by a dedicated Treasury signer set that is disjoint from the keys used by the master nodes for BFT consensus, Settlement Plane operations, and Payout submission. This signer isolation ensures that a compromise of the operational master cluster does not automatically confer authority to move treasury funds. Treasury signers are bound by governance policy, and any release additionally requires an on-chain governance proposal reference

Inflation Controller

manages the declining bootstrap inflation schedule. The current schedule values (e.g., 8% → 5% → 3% → 1.5% → 0%) and the bootstrap duration (expected 3-5 years) are governance parameters loaded from the Policy Store, not hard-coded constants. Governance may revise the schedule or shorten/extend the bootstrap phase within the bounds set by SCP economics
mints new SYM per epoch according to the active schedule
distributes minted SYM to the reward pool and treasury
automatically disables when the governance-defined terminal epoch is reached (by default at the end of the bootstrap phase)

On-Chain vs Off-Chain Boundary

Concern	On-Chain (Aptos)	Off-Chain (SCS services)
Staking state	Current stake, slashing records	Staking UI, deposit workflow
Task fees	Locked in escrow, distributed on settlement	Fee calculation, admission validation
Rewards	Final distribution amounts	Reward calculation, multi-Vault distribution
Settlement	`SettlementRoot` hash anchored on-chain	Full settlement context stored off-chain
Governance	Proposal voting results	Proposal creation, discussion, execution

The on-chain layer provides finality and transparency. The off-chain layer provides computation, privacy, and throughput.

Part 6: TEE Integration

TEE Provisioning

master nodes maintain a TEE environment for secure aggregation
enterprise nodes may maintain local TEE environments for cross-Vault execution
TEE enclaves are provisioned with pre-approved computation code (measurement hash registered in the governance registry)
enclave identity and measurement are verified at provisioning time

TEE Execution Flow

For a cross-Vault computation:

master node's Multi-Vault Coordinator assigns execution to TEE
enterprise Vaults encrypt record material with the TEE enclave's public key
encrypted data is transferred to the TEE environment
TEE decrypts data inside the enclave, performs computation
TEE produces result with AttestationReport (enclave identity, measurement hash, platform identity, freshness nonce)
result and attestation are returned to the master node
TEE purges all record material after execution

TEE Attestation Verification

the Result Verification Service on master nodes validates every AttestationReport
verification checks: enclave measurement matches registered code, platform is not revoked, freshness nonce is valid
failed attestation results in task rejection
TEE platform revocations are tracked in the governance registry

Part 7: Operational Concerns

Monitoring and Alerting

The system should monitor:

Node health: master and enterprise node liveness, resource utilization
Task throughput: admission rate, execution latency, settlement time
Privacy budget: budget utilization rates per Vault, approaching-exhaustion warnings
Staking: stake levels, approaching-minimum warnings, slashing events
Consensus: BFT round times, missed votes, view changes
On-chain: transaction confirmation times, gas costs, contract state

Failure Handling

Master Node Failure

single master failure: the remaining 2 masters maintain consensus and protocol operation
double master failure: protocol halts until at least 2 masters are available (safety over liveness)
failed master rejoins through state catch-up from the surviving nodes

Enterprise Node Failure

enterprise node failure during task execution: affected slices time out
if quorum is still met with remaining Vaults: task proceeds with partial coverage
if quorum is broken: task is rejected or retried under policy
persistent enterprise node failure: governance may suspend the node and freeze task dispatch to its Vault. Consent records are never exported from the suspended Vault in plaintext (this would violate the Consent Manager invariant that consent state is Vault-local). Instead, affected data subjects are notified and may re-authorize their data under a different approved enterprise node through a fresh consent grant, or the records remain quarantined until the original node is restored. Any migration of private records between Vaults proceeds through a TEE-mediated, re-encrypted transfer authorized by a governance proposal, not through direct plaintext copy

On-Chain Failure

Aptos chain congestion: payout batches queue until confirmation
contract failure: payout retried without mutating off-chain reward records
chain reorganization: reconciliation service detects and re-submits affected transactions

Reconciliation

The system must detect and handle:

duplicate submissions (idempotent deduplication)
partial downstream effects (incomplete payout batches)
missing confirmations (payout submitted but not confirmed)
reconciliation drift (off-chain reward records vs on-chain payout state)
epoch-window accounting inconsistencies
privacy budget two-phase drift: because budget reservation is initiated by master nodes while the authoritative ledger lives on enterprise nodes, the reserve → commit → release sequence is a distributed transaction. SCS runs a periodic reconciliation between master-side task state and enterprise-side Privacy Budget Ledger entries and detects: (a) reservations on master with no matching enterprise reserve, (b) enterprise reserves with no task to commit or release them, (c) committed enterprise entries whose task was rejected or timed out on master, (d) released master reservations that were never released on the enterprise ledger. Every reserve/commit/release event is linked into the task evidence chain by task_id and request_id so the reconciliation is deterministic
SettlementRoot anchoring drift: the on-chain anchored SettlementRoot must match the off-chain finalized settlement context; any mismatch triggers a re-anchor through Payout Service without mutating the settlement context

Corrections are always explicit through adjustment records, never through silent mutation.

Upgrade and Migration

service upgrades on master nodes: rolling upgrade with one node at a time, maintaining 2-of-3 consensus throughout
smart contract upgrades: through governance proposal, with upgrade proxy pattern on Aptos
protocol version transitions: at epoch boundaries, with one-epoch advance notice
enterprise node software upgrades: coordinated with master cluster, with grace period for version compatibility
TEE enclave measurement upgrades: when an aggregation or execution enclave image changes, the new measurement hash must be registered in the governance registry through a governance proposal before any task may be dispatched to it. During the transition, old and new measurements may be temporarily allowed in parallel so that in-flight tasks complete under the measurement they were admitted with; tasks admitted after the cutover use only the new measurement. Revocation of a measurement (for example, after a side-channel disclosure) blocks new dispatch immediately while still allowing challenge evaluation of already-completed tasks, per SCP's TEE failure handling rules

Part 8: Implementation Assurance

Required System Capabilities

SCS must support at least:

Replay: execution and verification artifacts can be replayed to reproduce the same result under the same inputs
Settlement-root reproducibility: SettlementRoot can be independently computed from the same settlement context
Semantic-resolution auditability: the semantic resolution path for any task can be reconstructed
Epoch-window reconstruction: the exact membership of any epoch can be reproduced
Reward-accounting reconciliation: reward records can be reconciled against on-chain payout records
Privacy budget auditability: per-subject budget consumption can be audited through the task evidence chain

Contract Packaging

The Markdown specification set remains the canonical semantic source.

SCS must additionally produce machine-readable implementation artifacts:

API schemas (OpenAPI/gRPC protobuf) for all external and internal interfaces
event schemas for all protocol events (task transitions, settlement, rewards, penalties)
version annotations for schema_version, semantic_version, and policy_version
error catalogs mapping protocol error families to HTTP/gRPC status codes
smart contract ABIs for all Aptos contracts
TEE enclave measurement hashes for all approved computation code

Relationship to SCP

This document describes implementation realization only.

For the normative meaning of protocol architecture, task lifecycle, semantic model, data protection, settlement, and economics, see:

SCS System Architecture ​

Purpose ​

Design Principles ​

Part 1: Deployment Topology ​

Node Network ​

Master Node Cluster ​

Enterprise Nodes ​

End Users ​

Part 2: Service Architecture ​

Service-to-Node Mapping ​

Master Node Services Detail ​

API Gateway ​

Task Orchestrator ​

Multi-Vault Coordinator ​

Aggregation Runtime ​

Semantic Registry Service ​

Reward Accounting Service ​

Payout Service ​

Enterprise Node Services Detail ​

Vault Storage Engine ​

Consent Manager ​

Privacy Budget Ledger ​

Computation Runtime ​

Part 3: Inter-Node Communication ​

Communication Model ​

Master-to-Master ​

Master-to-Enterprise ​

Enterprise-to-Master ​

User-to-Master ​

Part 4: Data Architecture ​

Persistence by Node Type ​

Master Node Persistence ​

Enterprise Node Persistence ​

Replay and Idempotency ​

Part 5: Smart Contract Architecture ​

Contracts on Aptos ​

Staking Contract ​

Escrow Contract ​

Reward Contract ​

Treasury Contract ​

Inflation Controller ​

On-Chain vs Off-Chain Boundary ​

Part 6: TEE Integration ​

TEE Provisioning ​

TEE Execution Flow ​

TEE Attestation Verification ​

Part 7: Operational Concerns ​

Monitoring and Alerting ​

Failure Handling ​

Master Node Failure ​

Enterprise Node Failure ​

On-Chain Failure ​

Reconciliation ​

Upgrade and Migration ​

Part 8: Implementation Assurance ​

Required System Capabilities ​

Contract Packaging ​

Relationship to SCP ​