Sokrates Roadmap

From current implementation to full product vision. Organized by dependency chain, not calendar.

See also: Sokrates Implementation Status for what’s shipped today. Technical Architecture Whitepaper for the full vision. Product Description and Feature Set for the product-level spec.


How to Read This

Epics are grouped into phases. Phases have dependency relationships — Phase 1 work generally depends on Phase 0 foundations. Within a phase, epics are independent unless noted. Each epic links to the wiki page(s) that define the target architecture and to the code location where implementation will land.


Phase 0: Foundational Infrastructure

Prerequisites that unlock everything downstream. These are the load-bearing gaps between “what’s built” and “what the product needs.”

E0.1: DMCG Schema Ingestion Pipeline

The single highest-leverage unbuilt system. Everything in the product vision — automated data hygiene, schema healing, zero-configuration integration — depends on this.

WhatOpenAPI spec in, typed Hyle BaseNode subclasses out, registered in NodeRegistry, queryable immediately
Wiki refsTechnical Architecture Whitepaper SS2, Hyle Graph-Native ORM with Dynamic Schema Registry, Hyle Design Lineage (Grimoire to Hyle)
Code targeteidos/src/eidos/infrastructure/schema_ingestion/ (new)
Depends onNothing — BaseNode, HyleMeta, NodeRegistry, @node decorator are all shipped
UnlocksE0.2 (schema healing), E1.1 (customer onboarding), E2.1 (semantic enrichment), E3.2 (fleet basis learning)

Scope:

  1. Add datamodel-code-generator dependency to eidos/pyproject.toml
  2. Build ingestion service: accepts OpenAPI spec (string, URL, or file path) calls DMCG Python API with --base-class hyle.BaseNode flags writes module to disk with version-namespaced path
  3. Build loader: importlib loads generated module HyleMeta fires during class creation types auto-register in NodeRegistry
  4. Build API endpoint: POST /ingest accepts OpenAPI spec, returns registered type names and count
  5. Integration test: round-trip from OpenAPI spec registered types Neo4j persistence query via unified DSL

E0.2: Schema Healing (Hot-Swap)

Makes DMCG ingestion continuous rather than one-shot.

WhatDetect schema changes, regenerate models, hot-swap via importlib.reload(), update Neo4j constraints
Wiki refsTechnical Architecture Whitepaper SS2.3, Hyle Graph-Native ORM with Dynamic Schema Registry
Code targeteidos/src/eidos/infrastructure/schema_ingestion/healing.py (new)
Depends onE0.1 (DMCG pipeline)
UnlocksSelf-healing data integration — the core product promise

Scope:

  1. Schema change detection: poll or webhook for OpenAPI spec changes, diff against stored spec hash
  2. Version-namespaced regeneration: hyle.generated.customer_v{N+1} module naming
  3. Hot-swap: remove old module from sys.modules, load new via importlib, HyleMeta re-registers
  4. Old version preserved on disk for audit and rollback

E0.3: Eidos Observer Wiring

Connect the NodeRegistry.on_register() hooks to actual Eidos infrastructure.

WhatWhen a new node type registers, Eidos auto-creates Neo4j constraints/indexes, updates schema cache, optionally triggers relationship inference
Wiki refsTechnical Architecture Whitepaper SS1.2
Code targeteidos/src/eidos/main.py (lifespan), eidos/src/eidos/infrastructure/neo4j/
Depends onNothing (hook mechanism is shipped in hyle.py:74)
UnlocksE0.1 becomes truly zero-configuration (types register and Neo4j is ready automatically)

Scope:

  1. In Eidos lifespan startup, subscribe to _default_registry.on_register()
  2. Callback creates Neo4j unique constraint on id for the new label
  3. Callback creates any _graph_indexes declared by the @node decorator
  4. Callback updates an in-memory schema cache for the unified query DSL
  5. If the type has an embedding field, ensure it’s included in the vector index

E0.4: Curator End-to-End Pipeline

Wire the existing curator agent into a fully operational loop.

WhatCron-triggered curator runs CurationReport Telegram/Discord approval action execution
Wiki refsEidos Curator Agent and Hyle Ontological Framework
Code targetagents/src/agents/curator/, hermes-plugin/sokrates/, Hermes cron config
Depends onNothing (curator agent, models, EidosClient, and Hermes plugin all exist)
UnlocksAutonomous graph maintenance — the Sókrates Agent can trust Eidos data quality

Scope:

  1. Configure Hermes cron job to invoke run_curator daily
  2. Surface CurationReport.summary via Hermes channel (Telegram/Discord)
  3. Implement approval flow: human replies approve/reject proposals
  4. Build action executor: executes approved merges, archives, deletions via Eidos API
  5. Log all actions as Observation nodes in the graph (audit trail)

Phase 1: Product Launch Readiness

Features needed for first customer engagement. Assumes Phase 0 foundations are in place.

E1.1: Customer Onboarding Pipeline

The end-to-end flow for connecting a new customer’s systems.

WhatAutomated flow: customer provides API credentials DMCG ingests their schemas Eidos indexes Sokrates Agent has a queryable graph of their operations
Wiki refsProduct Description and Feature Set SS4.1, Go-to-Market Strategy SS6.1
Code targeteidos/src/eidos/services/onboarding/ (new)
Depends onE0.1 (DMCG), E0.3 (observer wiring)
UnlocksFirst customer deployment, EDIH-IS trial, the “14-day audit” GTM motion

E1.2: Inefficiency Invoice (Monthly Value Report)

Automated monthly report quantifying what the Sokrates Agent found and fixed.

WhatMonthly report: structural friction discovered, estimated cost, remediation status. PDF or structured markdown, delivered via Hermes channel.
Wiki refsGemini Deep Think analysis (scratchpad.md), Revenue Model and Pricing (churn prevention)
Code targetagents/src/agents/reporter/ (new)
Depends onE0.4 (curator producing observations), metalayer or at minimum basic topology queries
UnlocksChurn prevention, value demonstration to CEO buyer, expansion revenue conversations

Scope:

  1. Monthly aggregation of curator observations, anomalies, and remediation actions
  2. Cost estimation heuristics (configurable per customer: average hourly rates, workflow frequencies)
  3. Template-driven report generation (structured markdown PDF via weasyprint or similar)
  4. Delivery via Hermes channel to customer’s designated contact
  5. Historical report storage as Observation nodes in customer’s Eidos graph

E1.3: Auto-Generated Claude Skills from Operational Topology

The Sokrates Agent discovers workflows and writes Claude Skills that encode them.

WhatAgent observes operational topology via Eidos graph generates SKILL.md files deploys to customer’s Claude workspace
Wiki refsProduct Description and Feature Set (plugin design mode), Gemini Deep Think analysis
Code targetagents/src/agents/skill_generator/ (new)
Depends onE0.1 (DMCG — needs typed nodes to reason about), E0.4 (curator — needs clean graph)
UnlocksThe core product loop: Sokrates Agent discovers friction generates automation customer uses it in Claude

E1.4: Hardware Tiering (Standard / Sovereign)

Two deployment targets based on sovereignty needs.

WhatStandard (GMKtec/Beelink, API-routed inference) and Sovereign (DGX Spark, fully local inference) NixOS configurations
Wiki refsTechnical Architecture Whitepaper SS5, Gemini Deep Think analysis
Code targetnixos/ (extend existing configs)
Depends onE2.2 for Sovereign tier (Gemma must run locally); Standard tier depends on nothing new
UnlocksBroader addressable market — sovereignty-sensitive and cost-sensitive customers both served

Note: Standard tier is essentially what’s already shipped (GMKtec dev box + API inference). Sovereign tier requires the Gemma integration work in Phase 2.


Phase 2: Intelligence Layer

On-device inference, semantic enrichment, and the metalayer. The transition from “knowledge graph with connectors” to “autonomous organizational intelligence.”

E2.1: Gemma 4 Semantic Enrichment

On-device model for schema enrichment, ontological classification, and entity resolution.

WhatGemma 4 31B Dense model running locally on DGX Spark, performing semantic field enrichment, auto-classification into four primitives, and cross-source entity resolution
Wiki refsTechnical Architecture Whitepaper SS3
Code targeteidos/src/eidos/infrastructure/enrichment/ (new)
Depends onE0.1 (DMCG — needs generated types to enrich), DGX Spark hardware
UnlocksE2.3 (metalayer — needs enriched types), fully automated onboarding

E2.2: Local Inference Stack (DGX Spark)

NixOS configuration for running Gemma on the DGX Spark.

WhatNixOS module for DGX Spark hardware, model serving (vLLM or similar), integration with Eidos
Wiki refsTechnical Architecture Whitepaper SS5.1-5.2
Code targetnixos/modules/, nixos/hardware/
Depends onDGX Spark hardware availability
UnlocksE1.4 Sovereign tier, fully sovereign deployment, on-device LoRA fine-tuning

E2.3: Hypergraph Metalayer (MVP)

Generating queries that compute organizational topology.

WhatNamed queries whose result sets define hyperedge membership. Materialized/virtual distinction. Basic stratification.
Wiki refsTechnical Architecture Whitepaper SS4, Hyle Graph-Native ORM with Dynamic Schema Registry (hypergraph semantics)
Code targeteidos/src/eidos/domain/metalayer/ (new)
Depends onE0.1 (DMCG — needs ground facts), E2.1 (enrichment — needs classified nodes) for full value; can prototype against existing hand-crafted nodes
UnlocksE1.2 (Inefficiency Invoice — topology-driven cost estimation), the “always-on organizational intelligence” product promise

Scope:

  1. Metalayer DSL: define generating queries with MATERIALIZED/VIRTUAL annotation
  2. Evaluation engine: execute generating queries, cache materialized results, recompute virtual on demand
  3. Stratification: prevent circular dependencies between generating queries
  4. Integration with unified query DSL: hyperedge results queryable through existing /query endpoint
  5. Prototype differential evaluation (recompute only affected hyperedges when ground facts change)

E2.4: GQL Query Builder

Portable query compilation targeting both Cypher and GQL.

WhatAST layer in the query builder that serializes to Cypher (Neo4j) or GQL (ISO 39075) via dialect drivers
Wiki refsTechnical Architecture Whitepaper SS4.6, Hyle Graph-Native ORM with Dynamic Schema Registry
Code targeteidos/src/eidos/infrastructure/neo4j/query_builder/ (extend existing)
Depends onNothing — existing Cypher builder provides the foundation
UnlocksMicrosoft Fabric portability, future database migration flexibility

Phase 3: Scale and Fleet Intelligence

Cross-deployment learning, fleet management, and the compounding flywheel.

E3.1: Self-Evolution Harness

DSPy/GEPA integration for systematic skill and prompt optimization.

WhatWrap Hermes skills as DSPy modules, apply GEPA optimization, evaluate against real session history, deploy improved versions
Wiki refsTechnical Architecture Whitepaper SS5.3
Code targetagents/src/agents/evolution/ (new)
Depends onSufficient session history from real customer engagements
UnlocksAutonomous agent improvement, metalayer query evolution

E3.2: Cross-Fleet Basis Learning

Pattern sharing across deployments without sharing customer data.

WhatStructural intelligence (schema patterns, ontological classifications, workflow templates) extracted from customer deployments and aggregated into the shared basis
Wiki refsTechnical Architecture Whitepaper SS9, Revenue Model and Pricing (basis as moat)
Code targetFleet management infrastructure (TBD)
Depends onE2.1 (enrichment — patterns must be abstracted from customer data), multiple live deployments
UnlocksThe flywheel: each deployment makes the next faster and more accurate

E3.3: Fleet Command (DGX Station GB300)

Central infrastructure for cross-fleet model training and basis aggregation.

WhatCentral DGX Station running larger model, producing improved LoRA adapters, distributing basis updates to edge boxes
Wiki refsTechnical Architecture Whitepaper SS5.6
Depends onFleet scale (30+ deployments), E3.2 (basis learning)
UnlocksTrillion-parameter MoE for complex inference, centralized training

Research Track

No committed timeline. Informs future architecture but no dependency chain leads here.

TopicWiki RefNotes
Cognitive architecture / strange loopTechnical Architecture Whitepaper SS5.4Patterns observing and selecting for better patterns
Metacognitive daemonsTechnical Architecture Whitepaper SS5.4Failure-mode detection from execution traces
Negative space compressionTechnical Architecture Whitepaper SS5.3Efficiently eliminating unproductive regions of idea-space
Riemannian Intent ManifoldRiemannian Intent Manifold From Philosophy to Empirical Research ProgramMathematical framework for intent-space geometry

Dependency Graph

Phase 0 (Foundation)
  E0.1 DMCG Pipeline ─────────┬──→ E0.2 Schema Healing
  E0.3 Observer Wiring ────┐  │
  E0.4 Curator Pipeline    │  │
                           │  │
Phase 1 (Product Launch)   │  │
  E1.1 Onboarding ←───────┴──┘
  E1.2 Inefficiency Invoice ←── E0.4
  E1.3 Auto-Generated Skills ←── E0.1, E0.4
  E1.4 Hardware Tiering (Standard = ready; Sovereign = needs E2.2)

Phase 2 (Intelligence)
  E2.1 Gemma Enrichment ←── E0.1, DGX Spark hardware
  E2.2 Local Inference Stack ←── DGX Spark hardware
  E2.3 Metalayer MVP ←── E0.1, E2.1 (or prototype without)
  E2.4 GQL Query Builder ←── nothing (extend existing)

Phase 3 (Scale)
  E3.1 Self-Evolution ←── session history from live deployments
  E3.2 Fleet Basis Learning ←── E2.1, multiple live deployments
  E3.3 Fleet Command ←── E3.2, fleet scale

Critical Path

The fastest route to first customer deployment:

  1. E0.1 DMCG Pipeline + E0.3 Observer Wiring (parallel)
  2. E0.4 Curator Pipeline (parallel with above — no dependencies)
  3. E1.1 Customer Onboarding (depends on E0.1 + E0.3)
  4. E1.2 Inefficiency Invoice + E1.3 Auto-Generated Skills (parallel, depend on E0.1 + E0.4)

Everything else is valuable but not on the critical path to first revenue.