rt-inod-jailbreaking

Red Teaming Innodata Jailbreaking

Introduced in Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations2024

About this Dataset

The Innodata Red Teaming Prompts aims to rigorously assess models’ factuality and safety. This dataset, due to its manual creation and breadth of coverage, facilitates a comprehensive examination of LLM performance across diverse scenarios.

Source: Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations

Dataset Variants

rt-inod-jailbreaking

Papers1

Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations

Tasks

EDIT

Dialogue Safety Prediction

Similar Datasets

MNIST

CelebA

GLUE

Statistics

Papers

1

Tasks

2

Introduced

2024

License

CC BY-SA 4.0