Community & Releases

Release Changelog

Release tags mapping completed features and visual layouts.

#Changelog — ObservabilityOS

All notable changes to the ObservabilityOS project are documented here, grouped by release tags.

#[v1.4.1] — AI Gateway Consolidation & Dynamic Routing

#Added

  • Third-Party AI Gateway Integration: Added support for the AICredits gateway endpoint (https://aicredits.in/v1/chat/completions) as the primary LLM provider.
  • Dynamic Model Selection: Enabled setting and failing over custom models dynamically via process.env.AICREDITS_MODEL, with standard defaults for Claude 3.5 Haiku and GPT-4o mini.
  • Sandbox Protection & Cost Safeguard: Integrated automatic playground/sandbox detection via trace_playground_ trace IDs to bypass external LLM calls and fallback directly to Local Mock Heuristics.
  • Plan-based Limit Controls: Enforced Free Developer plan limits by automatically bypassing live LLM calls and defaulting to Local Mock Heuristics for projects on the free tier.
  • API Health Check Reporting: Integrated AICredits active status checks inside /api/health.

#[v1.4.0] — SRE Resilience & Fault Tolerance

#Added

  • Resilience Core Utilities (resilience.ts): Implemented zero-dependency timeout control, exponential backoff retries with jitter, and a stateful CircuitBreaker pattern supporting CLOSED, OPEN, and HALF_OPEN states.
  • Central API Health Route (/api/health): Created SRE health endpoint returning JSON health status reports for MongoDB, Redis, and configured AI services.
  • React Client Error Boundaries: Integrated custom error boundary overlays at root and dashboard levels, preventing page blanking and facilitating client-side retries.

#Changed

  • Database Hardening: Refactored Mongoose connection module to use database connection pooling (maxPoolSize: 10, minPoolSize: 2), custom connect/socket timeouts, automatic retries, and graceful process shutdown listeners.
  • Redis Failover Layer: Refactored caching module to monitor connection states and transparently fall back to local in-memory Map cache layers during Redis outages.
  • AI Integration Safeguards: Configured provider failover chains (Anthropic ➡️ OpenAI ➡️ Mock) wrapped in timeouts and rate limit retries, along with regex JSON validation parsing.
  • Alert Webhook Circuit Breaker: Wrapped Slack, Discord, and Teams notification calls in circuit breakers to avoid blocking event loops.
  • In-Memory Rate Limiting Fallback: Modified rate limiting validation to drop to a local in-memory sliding-window token bucket when Redis is down.

#[v1.3.0] — Documentation Platform, Licensing & SEO Optimization

#Added

  • Documentation Portal (apps/docs): Built a standalone statically-rendered Next.js documentation platform with dynamic routing support.
  • Marked Markdown Compiler Link Resolution: Developed custom marked compiler extensions rewriting relative markdown and licensing references to target web paths or remote GitHub links dynamically.
  • Hybrid Licensing Framework: Standardized commercial and open-source boundaries, releasing packages/sdk under MIT and the core platform (apps/web, apps/worker) under a custom Source Available license.
  • Advanced SEO / Discoverability: Programmed sitemap generators, robots configurations, meta metadata, and JSON-LD structured schemas to facilitate search ranking and AI crawler discoverability.
  • Branding Refinements: Added heartbeat-style favicon, app icons, and OpenGraph/Twitter social media preview visuals for both platforms.

#Fixed

  • Layout Scroll Correction: Resolved Next.js client-side layout jumping warnings by embedding scroll attributes.
  • Scroll & Navigation Overflows: Hardened header containment heights, mobile responsiveness layouts, and container scroll containment.

#[v1.2.0] — Security Hardening, Rate Limiting & PII Scrubbing

#Added

  • PII Redaction Scrubber: Recursive local Winston/Pino logger data scrubbing (scrubber.ts) for authorization, tokens, email addresses, credit cards, and MongoDB connection URIs.
  • Redis rate limiter: sliding-window rate limit configuration on public ingest API paths (/api/ingest, /api/metrics/ingest).
  • SSE log streaming: Live stdout/stream logging route (/api/logs/stream) with dashboard toggle selectors.
  • SHA-256 API Hashing: Secured API keys inside MongoDB via SHA-256 validation.

#Changed

  • Centralized Auth: Consolidated user session loading in getAuthenticatedUser() helper function to prevent token redundancy.

#[v1.1.0] — UI Refactoring & shadcn UI Integration

#Added

  • shadcn/ui Integration: Replaced raw HTML/CSS inputs, dropdowns, and button containers with Radix primitives (Button, Card, Switch, Input, Select).
  • Tailwind CSS v4 CSS variables: Refactored styling tokens into Tailwind CSS variables inside globals.css.

#[v1.0.0] — Production search indexes, Cache, & Core Ingest

#Added

  • Lucene Full-text Search: Lucene-backed MongoDB Atlas Search index configuration for rapid log query performance.
  • Redis Metrics Cache: Caches dashboard aggregates and CPU/RAM/Latency aggregates, invalidating cached data dynamically upon new anomaly detection triggers.

#[v0.9.0] — Data Exports, Saved Queries & Audit Trail

#Added

  • Log Exporter: CSV/JSON download streams on logs queries.
  • Saved Queries: Custom search console configurations pinned to the dashboard sidebar.
  • Administrative Audit Trail: Setting console mutation trails capturing SLO deletions and webhook edits.

#[v0.8.0] — Incident Collaboration & Post-Mortem Export

#Added

  • Incident Comments: Threaded commenting on active microservice incident cards.
  • Runbook Links: Links diagnostic alerts directly to runbooks and troubleshooting checklists.
  • Markdown Post-Mortem Exporter: One-click markdown export files for incident summaries.

#[v0.7.0] — SLO Alerting, Webhooks & Onboarding

#Added

  • SLO budgets: Dynamic calculations of error budgets entirely within index-based MongoDB counts.
  • Multi-channel Alerts: Dispatch alerts automatically to Slack, Discord, and Teams webhooks.
  • Onboarding Wizard: 3-step project onboarding wizard flow.