MCP Servers

A collection of Model Context Protocol servers, templates, tools and more.

G
Gpu Cluster Operator MCP Agent

An autonomous Site Reliability Engineer (SRE) agent powered by the Model Context Protocol (MCP) and Gemini. This agent connects to your Cherry Studio (or other MCP clients), intelligently diagnoses Linux server issues, performs network scans, and executes remediation steps using an OODA Loop (Observe, Orient, Decide, Act) strategy.

Created 1/6/2026
Updated 2 days ago
Repository documentation and setup instructions

🤖 Linux SRE MCP Agent

An autonomous GPU cluster Reliability Engineer (SRE) agent powered by the Model Context Protocol (MCP) and Gemini. This agent connects to your MCP clients (such as Cherry Studio), intelligently diagnoses Linux server issues, performs network scans, and executes remediation steps using an OODA Loop (Observe, Orient, Decide, Act) strategy.

✨ Features

  • 🧠 Batch Diagnostics: Uses Gemini to analyze multiple system states (Processes, Logs, Network) in a single pass.
  • 🚀 SSH Multiplexing: Implements ControlMaster for millisecond-latency executions.
  • 🛡️ Hybrid Execution: Automatically detects if the target is Local or Remote.
  • 🔑 Key-Based Auth: Secure, password-less operation using SSH keys and sudo NOPASSWD.
  • 🕵️ Network Awareness: Capable of scanning local subnets.
  • 🔄 Self-Healing: Detects errors and autonomously digs for root causes.

🚀 Installation

  1. Install Dependencies

    pip install -r requirements.txt
    
  2. Configure Environment

    cp .env.example .env
    # Edit .env with your API keys
    
  3. Run

    python src/server.py
    
Quick Setup
Installation guide for this server

Install Package (if required)

uvx gpu-cluster-operator-mcp-agent

Cursor configuration (mcp.json)

{ "mcpServers": { "skyphy-gpu-cluster-operator-mcp-agent": { "command": "uvx", "args": [ "gpu-cluster-operator-mcp-agent" ] } } }