Skip to content

Releases: ThirdKeyAI/Symbiont

v0.4.0

28 Aug 20:42
Compare
Choose a tag to compare

[0.4.0] - 2025-08-28

Added

🧠 SLM-First Architecture (New)

  • Policy-Driven Routing Engine: Intelligent routing between Small Language Models (SLMs) and Large Language Models (LLMs)
  • Task Classification System: Automatic categorization of requests for optimal model selection
    • Task-aware routing with capability matching
    • Pattern recognition and keyword analysis for task classification
  • Confidence-Based Quality Control: Adaptive learning system for model performance tracking
    • crates/runtime/src/routing/confidence.rs: Confidence monitoring and threshold management
    • Real-time quality assessment with configurable confidence thresholds
    • Automatic fallback on low-confidence responses

⚡ Performance & Reliability

  • Thread-Safe Operations: Full async/await support with proper concurrency handling
  • Error Recovery: Graceful fallback mechanisms with exponential backoff retry logic
  • Runtime Configuration: Dynamic policy updates and threshold adjustments without restart
  • Comprehensive Logging: Detailed audit trail of routing decisions and performance metrics

Improved

Routing & Model Management

  • Model Catalog Integration: Deep integration with existing model catalog for SLM selection
  • Resource Management: Intelligent resource allocation and constraint handling
  • Load Balancing: Multiple strategies for distributing requests across available models
  • Scheduler Integration: Seamless integration with the existing agent scheduler

Developer Experience

  • Comprehensive Testing: Complete test coverage for all routing components with mock implementations
  • Documentation: Extensive design documents and implementation guides
  • Configuration Validation: Enhanced validation of routing policies and model configurations

Fixed

  • Module Exports: Fixed routing module structure in crates/runtime/src/routing/mod.rs
    • Added missing pub mod config; and pub mod policy; declarations
    • Added corresponding pub use statements for proper re-exports
  • Task Type Updates: Replaced deprecated TaskType::TextGeneration with TaskType::CodeGeneration
    • Updated routing engine references throughout codebase
    • Fixed task type usage in test modules and policy evaluation
  • Import Resolution: Resolved compilation errors in routing components
    • Updated ModelLogger constructor calls to match current API
    • Fixed import paths in test modules for proper dependency resolution
  • Code Quality: Applied clippy suggestions and resolved all warnings
    • Improved code patterns and removed unused imports
    • Enhanced error handling and async operation safety

Performance Improvements

  • Routing Throughput: Optimized routing decision performance with efficient policy evaluation
  • Memory Efficiency: Reduced memory overhead in confidence monitoring and statistics tracking
  • Async Operations: Enhanced async runtime efficiency for concurrent request handling
  • Configuration Loading: Optimized configuration parsing and validation performance

Breaking Changes

  • Routing API: New routing engine interface with SLM-first architecture
  • Task Classification: Updated task type enumeration with CodeGeneration replacing TextGeneration
  • Configuration Schema: Enhanced routing configuration structure with policy-driven settings

v0.3.1

10 Aug 21:27
Compare
Choose a tag to compare

Added

🔒 Security Enhancements

  • Centralized Configuration Management: New config.rs module for secure configuration handling
    • Environment variable abstraction layer with validation
    • Multiple secret key providers (environment, file, external services)
    • Centralized configuration access patterns
  • Enhanced CI/CD Security: Automated security scanning in GitHub Actions
    • Daily cargo audit vulnerability scanning
    • Clippy security lints integration
    • Secret leak detection in build pipeline

📋 API Documentation

  • SwaggerUI Integration: Interactive API documentation for HTTP endpoints
    • Auto-generated OpenAPI specifications
    • Interactive API testing interface
    • Complete endpoint documentation with examples

Security Fixes

🛡️ Critical Vulnerability Resolutions

  • RUSTSEC-2022-0093: Fixed ed25519-dalek Double Public Key Signing Oracle Attack
    • Updated from v1.0.1 → v2.2.0
  • RUSTSEC-2024-0344: Resolved curve25519-dalek timing variability vulnerability
    • Updated from v3.2.0 → v4.1.3 (transitive dependency)
  • RUSTSEC-2025-0009: Fixed ring AES panic vulnerability
    • Updated from v0.16 → v0.17.12
  • Timing Attack Prevention: Implemented constant-time token comparison
    • Replaced vulnerable string comparison in authentication middleware
    • Added subtle crate for constant-time operations
    • Enhanced authentication logging and error handling

Improved

Configuration Management

  • Environment Variable Security: Eliminated direct env::var usage throughout codebase
  • Secret Handling: Secure configuration management with validation
  • Error Handling: Enhanced configuration error reporting and validation

Authentication & Security

  • Middleware Security: Updated authentication middleware to use configuration management
  • Request Logging: Enhanced security logging for authentication failures
  • Token Validation: Improved bearer token validation with timing attack prevention

Dependencies

Security Updates

  • Updated: ed25519-dalek from v1.0.1 to v2.2.0 (critical security fix)
  • Updated: reqwest from v0.11 to v0.12 (security and performance)
  • Updated: ring from v0.16 to v0.17.12 (AES panic fix)
  • Added: subtle v2.5 for constant-time cryptographic operations

Documentation & Tooling

  • Added: utoipa and utoipa-swagger-ui for API documentation generation
  • Enhanced: CI/CD security workflow with automated vulnerability scanning

v0.3.0

09 Aug 02:50
Compare
Choose a tag to compare

Added

🚀 HTTP API Server (New)

  • Complete API Server: Full-featured HTTP server implementation using Axum framework
    • RESTful endpoints for agent management, execution, and monitoring
    • Authentication middleware with bearer token and JWT support
    • CORS support and comprehensive security headers
    • Request tracing and structured logging
    • Graceful shutdown with active request completion
  • Agent Management API: Create, update, delete, and monitor agents via HTTP
    • Agent status tracking with real-time metrics
    • Agent execution history and performance data
    • Agent configuration updates without restart
  • System Monitoring: Health checks, metrics collection, and system status endpoints
    • Real-time system resource utilization
    • Agent scheduler statistics and performance metrics
    • Comprehensive health check with component status

🧠 Advanced Context & Knowledge Management (New)

  • Hierarchical Memory System: Multi-layered memory architecture for agents
    • Working Memory: Variables, active goals, attention focus for immediate processing
    • Short-term Memory: Recent experiences and temporary information
    • Long-term Memory: Persistent knowledge and learned experiences
    • Episodic Memory: Structured experience episodes with events and outcomes
    • Semantic Memory: Concept relationships and domain knowledge graphs
  • Knowledge Base Operations: Comprehensive knowledge management capabilities
    • Facts: Subject-predicate-object knowledge with confidence scoring
    • Procedures: Step-by-step procedural knowledge with error handling
    • Patterns: Learned behavioral patterns with occurrence tracking
    • Knowledge Sharing: Inter-agent knowledge sharing with trust scoring
  • Context Persistence: File-based and configurable storage backend
    • Automatic context archiving and retention policies
    • Compression and encryption support for sensitive data
    • Migration utilities for legacy storage formats
  • Vector Database Integration: Semantic search and similarity matching
    • Qdrant integration for high-performance vector operations
    • Embedding generation and storage for context items
    • Batch operations for efficient data processing
  • Context Examples: Comprehensive context_example.rs demonstration

⚡ Production-Grade Agent Scheduler (New)

  • Priority-Based Scheduling: Multi-level priority queue with resource-aware scheduling
    • Configurable priority levels and scheduling algorithms
    • Resource requirements tracking and allocation
    • Load balancing with multiple strategies (round-robin, resource-based)
  • Task Management: Complete lifecycle management for agent tasks
    • Task health monitoring and failure detection
    • Automatic retry logic with exponential backoff
    • Timeout handling and graceful termination
  • System Monitoring: Real-time scheduler metrics and health monitoring
    • Agent performance tracking (CPU, memory, execution time)
    • System capacity monitoring and utilization alerts
    • Comprehensive scheduler statistics and dashboards
  • Graceful Shutdown: Production-ready shutdown with active task completion
    • Resource cleanup and allocation tracking
    • Metrics persistence and system state preservation
    • Configurable shutdown timeouts and force termination

📊 Enhanced Documentation & Examples

  • Production Examples: Real-world usage patterns and best practices
    • RAG engine integration with rag_example.rs
    • Context persistence and management workflows
    • Agent lifecycle and resource management
  • API Reference: Complete HTTP API documentation with examples
    • OpenAPI-compatible endpoint specifications
    • Authentication and authorization guides
    • Integration examples for common use cases

Improved

Runtime Stability & Performance

  • Memory Management: Optimized memory usage with configurable limits
  • Error Handling: Enhanced error propagation and recovery mechanisms
  • Async Performance: Improved async runtime efficiency and task scheduling
  • Resource Utilization: Better CPU and memory resource management

Configuration & Deployment

  • Feature Flags: Granular feature control for different deployment scenarios
    • http-api: HTTP server and API endpoints
    • http-input: Webhook input processing
    • vector-db: Vector database integration
    • embedding-models: Local embedding model support
  • Directory Structure: Standardized data directory layout
    • Separate directories for state, logs, prompts, and vector data
    • Automatic directory creation and permission management
    • Legacy migration utilities for existing deployments

Developer Experience

  • Examples: Comprehensive example implementations for all major features
  • Testing: Enhanced test coverage with integration tests
  • Logging: Structured logging with configurable verbosity levels
  • Debugging: Improved debugging capabilities with detailed metrics

Fixed

  • Scheduler Deadlocks: Resolved potential deadlock conditions in agent scheduling
  • Memory Leaks: Fixed memory leaks in context management and vector operations
  • Graceful Shutdown: Improved shutdown reliability under high load
  • Configuration Validation: Enhanced validation of configuration parameters
  • Error Recovery: Better error recovery in network and storage operations

Dependencies

  • Added: Axum 0.7 for HTTP server implementation
  • Added: Tower and Tower-HTTP for middleware and CORS support
  • Added: Governor for rate limiting capabilities
  • Added: Qdrant-client 1.14.0 for vector database operations
  • Updated: Tokio async runtime optimizations
  • Updated: Enhanced serialization with serde improvements

Breaking Changes

  • Context API: Updated context management API with hierarchical memory model
  • Scheduler Interface: New scheduler trait with enhanced lifecycle management
  • Configuration Format: Updated configuration structure for directory management

Performance Improvements

  • Scheduler Throughput: Up to 10x improvement in agent scheduling performance
  • Memory Efficiency: 40% reduction in memory usage for large context operations
  • Vector Search: Optimized vector database operations with batch processing
  • HTTP Response Time: Sub-100ms response times for standard API operations

Security Enhancements

  • Authentication: Multi-factor authentication support for HTTP API
  • Encryption: Enhanced encryption for data at rest and in transit
  • Access Control: Improved permission management for context operations
  • Data Protection: Secure handling of sensitive agent data and configurations

Installation

Docker

docker pull ghcr.io/thirdkeyai/symbi:v0.3.0

Cargo (with all features)

cargo install symbi-runtime --features full

Cargo (minimal installation)

cargo install symbi-runtime --features minimal

From Source

git clone https://github.com/thirdkeyai/symbiont.git
cd symbiont
git checkout v0.3.0
cargo build --release --features full

Quick Start - HTTP API

use symbi_runtime::api::{HttpApiServer, HttpApiConfig};

let config = HttpApiConfig {
    bind_address: "0.0.0.0".to_string(),
    port: 8080,
    enable_cors: true,
    enable_tracing: true,
};

let server = HttpApiServer::new(config);
server.start().await?;

Quick Start - Context Management

use symbi_runtime::context::{StandardContextManager, ContextManagerConfig};

let config = ContextManagerConfig {
    max_contexts_in_memory: 1000,
    enable_auto_archiving: true,
    enable_vector_db: true,
    ..Default::default()
};

let context_manager = StandardContextManager::new(config, "system").await?;
let session_id = context_manager.create_session(agent_id).await?;

Full Changes: v0.1.2...v0.3.0

v0.1.2

31 Jul 21:50
Compare
Choose a tag to compare

Release Notes - v0.1.2

🎯 HTTP Input Module (New)

Webhook Server: Full HTTP server implementation for triggering agents via webhooks
Authentication: Bearer token and JWT authentication support
Request Routing: Flexible routing rules to direct requests to specific agents
CORS & Security: Built-in CORS support, request size limits, and audit logging
Concurrency Control: Configurable rate limiting and concurrent request handling

🤖 Agent Examples & Documentation

Webhook Handler Agent: Complete webhook_handler.dsl example for security alert processing
HTTP Input Guide: Comprehensive documentation at docs/http-input.md
Integration Examples: Ready-to-use configuration examples for GitHub, Slack, and Stripe webhooks

🧪 Testing & Quality

Integration Tests: Full HTTP input test suite with webhook simulation
Configuration Testing: Comprehensive testing of authentication, CORS, and routing
Error Handling: Robust error handling with appropriate HTTP status codes

🔧 Infrastructure Improvements

OSS Sync Enhancement: Added agents/ directory to GitHub release pipeline
Example Distribution: Agent examples now included in open-source releases
Documentation Updates: Enhanced API reference and getting started guides

📦 Installation

# Docker
docker pull ghcr.io/thirdkeyai/symbi:v0.1.2

# Cargo (with HTTP input feature)
cargo install symbiont-runtime --features http-input

🚀 Quick Start - HTTP Input

use symbiont_runtime::http_input::{HttpInputConfig, start_http_input};

let config = HttpInputConfig {
    port: 8081,
    path: "/webhook",
    auth_header: Some("Bearer your-token".to_string()),
    ..Default::default()
};

start_http_input(config, None, None).await?;

Full Changes: v0.1.1...v0.1.2

v0.1.1

26 Jul 22:03
Compare
Choose a tag to compare

The comprehensive secure secrets management system has been successfully implemented with:

Core New Features:

  • Vault compatibility with OpenBao or Hashicorp Vault
  • Encrypted file backend using AES-256-GCM encryption
  • CLI subcommands for encrypt/decrypt/edit operations via symbiont secrets
  • OS keychain integration for secure master key storage
  • Comprehensive auditing and configuration system