Skip to content

Conversation

@tsbhangu
Copy link
Contributor

Summary

  • Add extensible system prompt caching support for Anthropic and Bedrock providers
  • Create CacheConfig and ProviderCapabilities dataclasses for provider-agnostic configuration
  • Anthropic: uses cache_control with ephemeral type per their API spec
  • Bedrock: uses cachePoint with default type per AWS Converse API
  • Cohere: logs warning when caching requested (no native support)

Test plan

  • Unit tests for CacheConfig and ProviderCapabilities dataclasses
  • Unit tests for Anthropic provider caching format
  • Unit tests for Bedrock provider caching format
  • Unit tests for Cohere warning log
  • Unit tests for factory passthrough
  • All existing LLM tests pass
  • Lint and type checks pass

🤖 Generated with Claude Code

tsbhangu and others added 2 commits November 24, 2025 16:36
Add extensible prompt caching support for Anthropic and Bedrock providers:
- Create CacheConfig and ProviderCapabilities dataclasses
- Anthropic: uses cache_control with ephemeral type
- Bedrock: uses cachePoint with default type
- Cohere: logs warning (no native caching support)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
Test coverage for:
- CacheConfig and ProviderCapabilities dataclasses
- Anthropic provider caching with ephemeral cache_control
- Bedrock provider caching with cachePoint
- Cohere warning log when caching unsupported
- Factory cache_config passthrough

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
@vercel
Copy link
Contributor

vercel bot commented Nov 24, 2025

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Preview Updated (UTC)
dev.ferndocs.com Ready Ready Preview Nov 24, 2025 11:46pm
fern-dashboard Ready Ready Preview Nov 24, 2025 11:46pm
fern-dashboard-dev Ready Ready Preview Nov 24, 2025 11:46pm
prod-assets.ferndocs.com Ready Ready Preview Nov 24, 2025 11:46pm
prod.ferndocs.com Ready Ready Preview Nov 24, 2025 11:46pm
1 Skipped Deployment
Project Deployment Preview Updated (UTC)
fern-platform Ignored Ignored Nov 24, 2025 11:46pm

@github-actions
Copy link
Contributor

🚀 FAI Chat Lambda Preview Deployed

Your Lambda function has been deployed to a preview environment!

🔗 Preview URL: https://1k3721ejqg.execute-api.us-east-1.amazonaws.com/dev2

📝 Available Endpoints:

  • Health: GET https://1k3721ejqg.execute-api.us-east-1.amazonaws.com/dev2/health
  • Chat (streaming): POST https://1k3721ejqg.execute-api.us-east-1.amazonaws.com/dev2/chat

📋 Example Usage:

# Test health endpoint
curl "https://1k3721ejqg.execute-api.us-east-1.amazonaws.com/dev2/health"

# Test chat endpoint (currently returns hardcoded response)
curl -X POST "https://1k3721ejqg.execute-api.us-east-1.amazonaws.com/dev2/chat"

🏷️ Stack Name: fai-chat-preview-5475

ℹ️ Note: This preview will be automatically destroyed when the PR is closed or merged.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants