llmtesting

Here are 4 public repositories matching this topic...

avi350751 / test-llm-with-deepeval

A hands-on exploration of Deepeval — an open-source framework for evaluating and red-teaming large language models (LLMs). This repository documents my journey of testing, benchmarking, and improving LLM reliability using custom prompts, metrics, and pipelines.

evals deepeval llmtesting

Updated Nov 2, 2025
Jupyter Notebook

avi350751 / bfsi-red-team

Star

Red teaming a banking and finance llm assistant

yaml cybersecurity redteam promptfoo aitesting llmtesting

Updated Nov 19, 2025

avi350751 / autogen-playground

Star

This repo is my playground to experiment with autogen and use the same to converse, build pipelines and do LLM testing

mcp multiagent autogen llmtesting

Updated Oct 30, 2025
Python

avi350751 / promptfoo-cicd

Star

Integrating promptfoo into CI/CD pipelines to automatically evaluate prompts, test for security vulnerabilities, and ensure quality before deployment.

promptfoo llmtesting

Updated Oct 24, 2025

Improve this page

Add a description, image, and links to the llmtesting topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llmtesting topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llmtesting

Here are 4 public repositories matching this topic...

avi350751 / test-llm-with-deepeval

avi350751 / bfsi-red-team

avi350751 / autogen-playground

avi350751 / promptfoo-cicd

Improve this page

Add this topic to your repo