Response signature
AI agents have a similar response signature to the pre-built classifiers, with the addition of arule
field.
The label of the rule that was triggered. Returns NEUTRAL if no rules were
triggered.
The score of the most probable label. LLama Guard returns a probability score
between 0 and 1, whereas GPT-4o-mini returns a binary classification 1 or 0.
An object containing all the rules with either 0 or 1 as the score.
The description of the rule that was triggered. Returns null if no rules were
triggered.