Add streaming support in OpenAI client test script and update 0.2.4 changelog to note FORCE_ANONYMISATION mode.

Paweł Kędzia · Paweł Kędzia · commit 815adad309af · 2025-11-17T03:31:26.000+01:00
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -12,4 +12,4 @@
 | 0.2.1   | Fix stream: OpenAI->Ollama, Ollama->OpenAI. Add Redis caching of availability of model providers (when using `first_available` strategy). Add `llm_router_web` module with simple flask-based frontend to manage llm-router config files.                                                   |
 | 0.2.2   | Update dockerfile and requirements. Fix routing with vLLM.                                                                                                                                                                                                                                  |
 | 0.2.3   | New web configurator: Handling projects, configs for each user separately. First Available strategy is more powerful, a lot of improvements to efficiency.                                                                                                                                  |
-| 0.2.4   | Anonymizer module, integration anonymization with any endpoint (using dynamic payload analysis and full payload anonymisation), dedicated `/api/anonymize_text` endpoint as memory only anonymization.                                                                                      |
+| 0.2.4   | Anonymizer module, integration anonymization with any endpoint (using dynamic payload analysis and full payload anonymisation), dedicated `/api/anonymize_text` endpoint as memory only anonymization. Whole router may be run in `FORCE_ANONYMISATION` mode.                               |
diff --git a/tests/openai-client.py b/tests/openai-client.py
@@ -8,9 +8,21 @@
     base_url="http://192.168.100.65:8080/v1",
 )
 
+use_stream = True
+
+model_1 = "google/gemma-3-12b-it"
+model_2 = "gpt-oss:120b"
+
 response = client.chat.completions.create(
-    model="google/gemma-3-12b-it",
-    messages=[{"role": "user", "content": "Hello world"}],
+    model=model_1,
+    messages=[{"role": "user", "content": "Write simple somethig"}],
+    stream=use_stream,
 )
 
-print(response.choices)
+if use_stream:
+    for s in response:
+        if not s:
+            break
+        print(s.choices[0].delta.content, end="")
+else:
+    print(response.choices)