Monitoring

Health Check Endpoints

th0th provides multiple endpoints for monitoring system health and performance.

Basic Health Check

The simplest health check endpoint returns service status:

curl http://localhost:3333/health

{
  "status": "ok",
  "service": "th0th-tools-api",
  "version": "1.0.0",
  "timestamp": "2026-03-09T12:34:56.789Z"
}

This endpoint is used by Docker health checks and load balancers. It has minimal overhead and should respond within 100ms.

Comprehensive Health Check

For detailed local-first service health:

curl http://localhost:3333/api/v1/system/health/local

Sample Response

{
  "status": "healthy",
  "checks": {
    "ollama": {
      "status": "healthy",
      "latency": 45,
      "baseUrl": "http://localhost:11434",
      "model": "nomic-embed-text:latest"
    },
    "databases": {
      "status": "healthy",
      "vector": true,
      "cache": true,
      "keyword": true,
      "memory": true
    },
    "dataDirectory": {
      "status": "healthy",
      "path": "/home/user/.rlm",
      "writable": true,
      "diskSpace": "45GB"
    }
  },
  "recommendations": [],
  "timestamp": "2026-03-09T12:34:56.789Z"
}

Checks performed:

Ollama API connectivity and model availability
SQLite database file existence and accessibility
Data directory write permissions
Disk space availability

System Status

Get service-level status information:

curl http://localhost:3333/api/v1/system/status

{
  "status": "healthy",
  "services": {
    "memories": true,
    "vectorStore": true,
    "searchCache": true,
    "analytics": true,
    "keywordSearch": true,
    "embeddingCache": true
  },
  "timestamp": "2026-03-09T12:34:56.789Z"
}

If status is degraded, one or more critical databases are missing or inaccessible. Check the services object to identify the issue.

System Metrics

System Information

Get detailed system and resource information:

curl http://localhost:3333/api/v1/system/info

Sample Response

{
  "version": "1.0.0",
  "service": "th0th-tools-api",
  "node": "v20.11.0",
  "platform": "linux",
  "arch": "x64",
  "uptime": 3600,
  "memory": {
    "total": 16777216000,
    "free": 8388608000,
    "used": 8388608000,
    "process": {
      "rss": 157286400,
      "heapTotal": 52428800,
      "heapUsed": 41943040,
      "external": 2097152
    }
  },
  "dataDir": "/home/user/.rlm",
  "databases": {
    "sizes": {
      "memories.db": "2.3 MB",
      "vector-store.db": "456 MB",
      "search-cache.db": "89 MB",
      "search-analytics.db": "12 MB",
      "keyword-search.db": "234 MB",
      "embedding-cache.db": "178 MB"
    },
    "total": "971.3 MB",
    "totalBytes": 1018368000
  },
  "timestamp": "2026-03-09T12:34:56.789Z"
}

Key metrics:

uptime: Process uptime in seconds
memory.process: Node.js process memory usage
databases.total: Total disk space used by all databases

Performance Metrics

Get aggregated performance and usage metrics:

curl http://localhost:3333/api/v1/system/metrics

{
  "system": {
    "dataSize": "971.3 MB",
    "dataSizeBytes": 1018368000,
    "uptime": 3600,
    "memory": {
      "heapUsed": "40 MB",
      "heapTotal": "50 MB",
      "rss": "150 MB"
    }
  },
  "timestamp": "2026-03-09T12:34:56.789Z"
}

Ollama Status

Check Ollama service and available models:

curl http://localhost:3333/api/v1/system/ollama

{
  "status": "healthy",
  "latency": 42,
  "models": [
    "nomic-embed-text:latest",
    "bge-m3:latest",
    "llama3.2:latest"
  ],
  "configuredModel": "nomic-embed-text:latest",
  "baseUrl": "http://localhost:11434"
}

Analytics and Usage

Track search performance, cache efficiency, and usage patterns:

curl -X POST http://localhost:3333/api/v1/analytics \
  -H "Content-Type: application/json" \
  -d '{"type": "summary"}'

Analytics Types

curl -X POST http://localhost:3333/api/v1/analytics \
  -H "Content-Type: application/json" \
  -d '{"type": "summary"}'

Docker Health Checks

When running via Docker Compose, automatic health checks are configured:

healthcheck:
  test: ["CMD", "curl", "-sf", "http://localhost:3333/health"]
  interval: 30s
  timeout: 3s
  start_period: 40s
  retries: 3

Configuration:

interval: Check every 30 seconds
timeout: Fail if no response within 3 seconds
start_period: Allow 40 seconds for startup
retries: Mark unhealthy after 3 consecutive failures

View health status:

docker ps
# Look for "(healthy)" or "(unhealthy)" in STATUS column

Inspect health history:

docker inspect th0th-api | grep -A 20 Health

Monitoring Integrations

Prometheus

Export metrics for Prometheus scraping:

# prometheus.yml
scrape_configs:
  - job_name: 'th0th'
    static_configs:
      - targets: ['localhost:3333']
    metrics_path: '/api/v1/system/metrics'
    scrape_interval: 30s

Uptime Monitoring

Use services like UptimeRobot, Pingdom, or StatusCake:

Monitor URL: http://your-server:3333/health
Interval: 5 minutes
Method: GET
Expected: 200 OK
Keyword: "ok"

Custom Health Checks

Create custom monitoring scripts:

#!/bin/bash
RESPONSE=$(curl -sf http://localhost:3333/health)
if [ $? -eq 0 ] && echo "$RESPONSE" | grep -q '"status":"ok"'; then
  echo "th0th is healthy"
  exit 0
else
  echo "th0th is unhealthy"
  exit 1
fi

Log Monitoring

API Logs

View real-time logs:

# Docker deployment
docker compose logs -f api

# Source deployment
bun run start:api  # Logs to stdout

Log Levels

Configure log verbosity via LOG_LEVEL environment variable:

LOG_LEVEL=debug  # Verbose debugging
LOG_LEVEL=info   # Default operational logs
LOG_LEVEL=warn   # Warnings and errors only
LOG_LEVEL=error  # Errors only

Startup Health Check

On startup, th0th automatically runs a local-first health check:

th0th Tools API running at http://localhost:3333
Swagger docs at http://localhost:3333/swagger
Local-first health: ALL SERVICES HEALTHY

If services are degraded:

Local-first health: DEGRADED
  -> Ollama is not responding at http://localhost:11434
  -> Run: ollama serve
  -> Or update OLLAMA_BASE_URL environment variable

Performance Benchmarks

Monitor key performance indicators:

Response Time Targets

Endpoint	Target	Acceptable
`/health`	< 50ms	< 200ms
`/api/v1/system/status`	< 100ms	< 500ms
`/api/v1/search/project`	< 2s	< 5s
`/api/v1/project/index`	N/A	Depends on size

Cache Hit Rates

Monitor cache performance:

curl -X POST http://localhost:3333/api/v1/analytics \
  -H "Content-Type: application/json" \
  -d '{"type": "cache"}'

Target metrics:

L1 cache hit rate: > 60%
L2 cache hit rate: > 80%
Embedding cache hit rate: > 90% (for repeated content)

Resource Usage

Memory:

Base memory: ~150MB
With active caches: ~250-500MB
Under heavy load: < 1GB

Disk:

Fresh install: < 50MB
After indexing 10 projects: ~500MB - 2GB
Embedding cache grows with unique content

Alerting

Set up alerts for critical conditions:

Service Down

Alert when /health returns non-200 status

Ollama Unavailable

Alert when /api/v1/system/ollama reports unhealthy status

Disk Space

Alert when database total size exceeds threshold (e.g., 10GB)

High Memory

Alert when process RSS exceeds 2GB

Next Steps

Deployment

Deployment configuration and setup

Troubleshooting

Common issues and solutions

Get Started

Integration

Core Concepts

Configuration

Operations

Health Check Endpoints

Basic Health Check

Comprehensive Health Check

System Status

System Metrics

System Information

Performance Metrics

Ollama Status

Analytics and Usage

Analytics Types

Docker Health Checks

Monitoring Integrations

Prometheus

Uptime Monitoring

Custom Health Checks

Log Monitoring

API Logs

Log Levels

Startup Health Check

Performance Benchmarks

Response Time Targets

Cache Hit Rates

Resource Usage

Alerting

Next Steps

Deployment

Troubleshooting

Get Started

Integration

Core Concepts

Configuration

Operations

Documentation Index

​Health Check Endpoints

​Basic Health Check

​Comprehensive Health Check

​System Status

​System Metrics

​System Information

​Performance Metrics

​Ollama Status

​Analytics and Usage

​Analytics Types

​Docker Health Checks

​Monitoring Integrations

​Prometheus

​Uptime Monitoring

​Custom Health Checks

​Log Monitoring

​API Logs

​Log Levels

​Startup Health Check

​Performance Benchmarks

​Response Time Targets

​Cache Hit Rates

​Resource Usage

​Alerting

​Next Steps

Deployment

Troubleshooting

Health Check Endpoints

Basic Health Check

Comprehensive Health Check

System Status

System Metrics

System Information

Performance Metrics

Ollama Status

Analytics and Usage

Analytics Types

Docker Health Checks

Monitoring Integrations

Prometheus

Uptime Monitoring

Custom Health Checks

Log Monitoring

API Logs

Log Levels

Startup Health Check

Performance Benchmarks

Response Time Targets

Cache Hit Rates

Resource Usage

Alerting

Next Steps