coder · ThomasK33 · Nov 21, 2025 · Nov 24, 2025 · Nov 24, 2025 · Nov 25, 2025
diff --git a/.cmux/scripts/demo b/.cmux/scripts/demo
@@ -0,0 +1,21 @@
+#!/usr/bin/env bash
+# Description: Demo script to showcase the script execution feature. Accepts no arguments.
+set -euo pipefail
+
+# Progress messages to stderr (shown to user, not sent to agent)
+echo "Running demo script..." >&2
+echo "Current workspace: $(pwd)" >&2
+echo "Timestamp: $(date)" >&2
+
+# Structured output to stdout (sent to agent)
+cat <<'EOF'
+## 🎉 Script Execution Demo
+
+✅ Script executed successfully!
+
+**Output Semantics:**
+- `stdout`: Sent to the agent as tool result
+- `stderr`: Shown to user only (progress/debug info)
+
+The demo script completed. You can create workspace-specific scripts to automate tasks.
+EOF
diff --git a/.cmux/scripts/echo b/.cmux/scripts/echo
@@ -0,0 +1,35 @@
+#!/usr/bin/env bash
+# Description: Echo arguments demo. Accepts any number of arguments (strings) which will be echoed back.
+set -euo pipefail
+
+# Check if arguments were provided
+if [ $# -eq 0 ]; then
+  cat <<'EOF'
+## ⚠️ No Arguments Provided
+
+Usage: `/s echo <message...>`
+
+Example: `/s echo hello world`
+EOF
+  exit 0
+fi
+
+# Structured output to stdout (sent to agent)
+cat <<EOF
+## 🔊 Echo Script
+
+**You said:** $@
+
+**Arguments received:**
+- Count: $# arguments
+- First arg: ${1:-none}
+- Second arg: ${2:-none}
+- All args: $@
+
+**Individual arguments:**
+EOF
+
+# Loop through each argument
+for i in $(seq 1 $#); do
+  echo "- Arg $i: ${!i}"
+done
diff --git a/.cmux/scripts/wait_pr_checks b/.cmux/scripts/wait_pr_checks
@@ -0,0 +1,7 @@
+#!/usr/bin/env bash
+# Description: Wait for PR checks to pass on GitHub. Use this after pushing changes to origin, to catch CI failures. Accepts no arguments.
+set -euo pipefail
+
+BRANCH=$(git branch --show-current)
+NUMBER=$(gh pr list --head "$BRANCH" --json number | jq -cr '.[0].number')
+./scripts/wait_pr_checks.sh "$NUMBER"
diff --git a/.github/workflows/codex-comment-watch.yml b/.github/workflows/codex-comment-watch.yml
@@ -0,0 +1,47 @@
+name: Codex Comment Watch
+
+on:
+  issue_comment:
+    types:
+      - created
+  pull_request_review_comment:
+    types:
+      - created
+  pull_request_review:
+    types:
+      - submitted
+
+permissions:
+  contents: read
+  pull-requests: read
+
+concurrency:
+  group: codex-comment-watch-${{ github.event.issue.number || github.event.pull_request.number || github.run_id }}
+  cancel-in-progress: true
+
+jobs:
+  check-codex-comments:
+    name: Check Codex Comments
+    runs-on: ${{ github.repository_owner == 'coder' && 'depot-ubuntu-22.04-16' || 'ubuntu-latest' }}
+    if: >
+      contains(fromJson('["chatgpt-codex-connector","chatgpt-codex-connector[bot]"]'), github.event.sender.login)
+      && (github.event_name != 'issue_comment' || github.event.issue.pull_request != null)
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+        with:
+          fetch-depth: 0 # Required for git describe to find tags
+
+      - name: Determine PR number
+        id: determine-pr
+        run: |
+          if [[ "${{ github.event_name }}" == "issue_comment" ]]; then
+            echo "value=${{ github.event.issue.number }}" >> "$GITHUB_OUTPUT"
+          else
+            echo "value=${{ github.event.pull_request.number }}" >> "$GITHUB_OUTPUT"
+          fi
+
+      - name: Check for unresolved Codex comments
+        env:
+          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+        run: ./scripts/check_codex_comments.sh ${{ steps.determine-pr.outputs.value }}
diff --git a/docs/SUMMARY.md b/docs/SUMMARY.md
@@ -13,6 +13,7 @@
   - [SSH](./ssh.md)
   - [Forking](./fork.md)
   - [Init Hooks](./init-hooks.md)
+  - [Workspace Scripts](./scripts.md)
 - [VS Code Extension](./vscode-extension.md)
 - [Models](./models.md)
 - [Keyboard Shortcuts](./keybinds.md)

diff --git a/docs/scripts.md b/docs/scripts.md
@@ -0,0 +1,187 @@
+# Workspace Scripts
+
+Execute custom scripts from your workspace using slash commands or let the AI Agent run them as tools.
+
+## Overview
+
+Scripts are stored in `.mux/scripts/` within each workspace. They serve two purposes:
+
+1. **Human Use**: Executable via `/script <name>` or `/s <name>` in chat.
+2. **Agent Use**: Automatically exposed to the AI as tools (`script_<name>`), allowing the agent to run complex workflows you define.
+
+Scripts run in the workspace directory with full access to project secrets and environment variables.
+
+**Key Point**: Scripts are workspace-specific. Each workspace has its own custom toolkit defined in `.mux/scripts/`.
+
+## Creating Scripts
+
+1. **Create the scripts directory**:
+
+   ```bash
+   mkdir -p .mux/scripts
+   ```
+
+2. **Add an executable script**:
+
+   ```bash
+   #!/usr/bin/env bash
+   # Description: Deploy to staging. Accepts one optional argument: 'dry-run' to simulate.
+
+   if [ "${1:-}" == "dry-run" ]; then
+     echo "Simulating deployment..."
+   else
+     echo "Deploying to staging..."
+   fi
+   ```
+
+   **Crucial**: The `# Description:` line is what the AI reads to understand the tool. Be descriptive about what the script does and what arguments it accepts.
+
+3. **Make it executable**:
+
+   ```bash
+   chmod +x .mux/scripts/deploy
+   ```
+
+## Agent Integration (AI Tools)
+
+Every executable script in `.mux/scripts/` is automatically registered as a tool for the AI Agent.
+
+- **Tool Name**: `script_<name>` (e.g., `deploy` -> `script_deploy`, `run-tests` -> `script_run_tests`)
+- **Tool Description**: Taken from the script's header comment (`# Description: ...`).
+- **Arguments**: The AI can pass an array of string arguments to the script.
+
+### Optimization for AI
+
+To make your scripts effective AI tools:
+
+1. **Clear Descriptions**: Explicitly state what the script does and what arguments it expects.
+
+   ```bash
+   # Description: Fetch logs. Requires one argument: the environment name (dev|prod).
+   ```
+
+2. **Robustness**: Use `set -euo pipefail` to ensure the script fails loudly if something goes wrong, allowing the AI to catch the error.
+3. **Clear Output**: Write structured output to stdout so the agent can understand results and take action.
+
+## Usage
+
+### Basic Execution
+
+Type `/s` or `/script` in chat to see available scripts with auto-completion:
+
+```
+/s deploy
+```
+
+### With Arguments
+
+Pass arguments to scripts:
+
+```
+/s deploy --dry-run
+/script test --verbose --coverage
+```
+
+Arguments are passed directly to the script as `$1`, `$2`, etc.
+
+## Execution Context
+
+Scripts run with:
+
+- **Working directory**: The workspace directory.
+- **Environment**: Full workspace environment + project secrets + special cmux variables.
+- **Timeout**: 5 minutes by default.
+- **Streams**: stdout/stderr are captured.
+  - **Human**: Visible in the chat card.
+  - **Agent**: Returned as the tool execution result.
+
+### Standard Streams
+
+Scripts follow Unix conventions for output:
+
+- **stdout**: Sent to the agent as the tool result. Use this for structured output the agent should act on.
+- **stderr**: Shown to the user in the UI but **not** sent to the agent. Use this for progress messages, logs, or debugging info that doesn't need AI attention.
+
+This design means scripts work identically whether run inside mux or directly from the command line.
+
+#### Example: Test Runner
+
+```bash
+#!/usr/bin/env bash
+# Description: Run tests and report failures for the agent to fix
+
+set -euo pipefail
+
+# Progress to stderr (user sees it, agent doesn't)
+echo "Running test suite..." >&2
+
+if npm test > test.log 2>&1; then
+  # Success message to stdout (agent sees it)
+  echo "✅ All tests passed"
+else
+  # Structured failure info to stdout (agent sees and can act on it)
+  cat << EOF
+❌ Tests failed. Here is the log:
+
+\`\`\`
+$(cat test.log)
+\`\`\`
+
+Please analyze this error and propose a fix.
+EOF
+  exit 1
+fi
+```
+
+**Result**:
+
+1. User sees "Running test suite..." progress message.
+2. On failure, agent receives the structured error with test log and instructions.
+3. Agent can immediately analyze and propose fixes.
+
+## Example Scripts
+
+### Deployment Script
+
+```bash
+#!/usr/bin/env bash
+# Description: Deploy application. Accepts one arg: environment (default: staging).
+set -euo pipefail
+
+ENV=${1:-staging}
+echo "Deploying to $ENV..."
+# ... deployment logic ...
+echo "Deployment complete!"
+```
+
+### Web Fetch Utility
+
+```bash
+#!/usr/bin/env bash
+# Description: Fetch a URL. Accepts exactly one argument: the URL.
+set -euo pipefail
+
+if [ $# -ne 1 ]; then
+    echo "Usage: $0 <url>"
+    exit 1
+fi
+curl -sL "$1"
+```
+
+## Script Discovery
+
+- Scripts are discovered automatically from `.mux/scripts/` in the current workspace.
+- Discovery is cached for performance but refreshes intelligently.
+- **Sanitization**: Script names are sanitized for tool use (e.g., `my-script.sh` -> `script_my_script_sh`).
+
+## Troubleshooting
+
+**Script not appearing in suggestions or tools?**
+
+- Ensure file is executable: `chmod +x .mux/scripts/scriptname`
+- Verify file is in `.mux/scripts/` directory.
+- Check for valid description header.
+
+**Agent using script incorrectly?**
+
+- Improve the `# Description:` header. Explicitly tell the agent what arguments to pass.
diff --git a/src/browser/App.stories.tsx b/src/browser/App.stories.tsx
@@ -85,6 +85,12 @@ function setupMockAPI(options: {
           success: true,
           data: { success: true, output: "", exitCode: 0, wall_duration_ms: 0 },
         }),
+      listScripts: () => Promise.resolve({ success: true, data: [] }),
+      executeScript: () =>
+        Promise.resolve({
+          success: true,
+          data: { success: true, output: "", exitCode: 0, wall_duration_ms: 0 },
+        }),
     },
     projects: {
       list: () => Promise.resolve(Array.from(mockProjects.entries())),
@@ -1255,6 +1261,12 @@ main
                   data: { success: true, output: "", exitCode: 0, wall_duration_ms: 0 },
                 });
               },
+              listScripts: () => Promise.resolve({ success: true, data: [] }),
+              executeScript: () =>
+                Promise.resolve({
+                  success: true,
+                  data: { success: true, output: "", exitCode: 0, wall_duration_ms: 0 },
+                }),
             },
           },
         });
@@ -1463,6 +1475,12 @@ These tables should render cleanly without any disruptive copy or download actio
                   success: true,
                   data: { success: true, output: "", exitCode: 0, wall_duration_ms: 0 },
                 }),
+              listScripts: () => Promise.resolve({ success: true, data: [] }),
+              executeScript: () =>
+                Promise.resolve({
+                  success: true,
+                  data: { success: true, output: "", exitCode: 0, wall_duration_ms: 0 },
+                }),
             },
           },
         });