|
24 | 24 |
|
25 | 25 | - **OpenAI Compatible API**: Fully compatible with OpenAI format `/v1/chat/completions` endpoint |
26 | 26 | - **TTS Speech Generation**: Supports Gemini 2.5 TTS models for single/multi-speaker audio generation |
| 27 | +- **Image Generation**: Supports Imagen 3 and Gemini 2.5 Flash (Nano Banana) image generation |
| 28 | +- **Video Generation**: Supports Veo 2 video generation, including image-to-video |
27 | 29 | - **Smart Model Switching**: Dynamically switch models in AI Studio via the `model` field |
28 | 30 | - **Anti-Fingerprint Detection**: Uses Camoufox browser to reduce detection risk |
29 | 31 | - **GUI Launcher**: Feature-rich **web** launcher for simplified configuration and management |
@@ -268,6 +270,7 @@ AIStudio2API/ |
268 | 270 | │ ├── config/ # Configuration management |
269 | 271 | │ ├── models/ # Data models |
270 | 272 | │ ├── tts/ # TTS Speech Generation modules |
| 273 | +│ ├── media/ # Media Generation modules (Imagen/Veo/Nano) |
271 | 274 | │ ├── proxy/ # Streaming proxy |
272 | 275 | │ └── static/ # Static resources |
273 | 276 | ├── data/ # Runtime data directory |
@@ -346,10 +349,13 @@ Issues and Pull Requests are welcome! |
346 | 349 | ## 📅 Development Roadmap |
347 | 350 |
|
348 | 351 | - ✅ **TTS Support**: Adapted `gemini-2.5-flash/pro-preview-tts` speech generation models |
| 352 | +- ✅ **Media Generation**: Supports Imagen 3, Veo 2, Nano Banana image/video generation |
| 353 | +- **Unified Click Logic**: Extract `_safe_click` method to global `operations.py`, unify click operations across all controllers |
349 | 354 | - **Documentation**: Update and optimize documentation in `docs/` directory |
350 | 355 | - **One-Click Deployment**: Provide fully automated install and launch scripts for Windows/Linux/macOS |
351 | 356 | - **Docker Support**: Provide standard Dockerfile and Docker Compose orchestration files |
352 | 357 | - **Go Refactoring**: Migrate core proxy service to Go for improved concurrency and reduced resource usage |
353 | 358 | - **CI/CD Pipeline**: Establish GitHub Actions automated testing and build release process |
354 | 359 | - **Unit Testing**: Increase test coverage for core modules (especially browser automation) |
355 | 360 | - **Load Balancing**: Support multi-Google account rotation pool for higher concurrency limits |
| 361 | + |
0 commit comments