Shape
Each guideline has two fields:| Field | Limit | What it is |
|---|---|---|
key | 100 chars | A unique identifier for the rule on this channel (e.g. no-spam, be-respectful). |
instructions | 750 chars | A natural-language description of what is not allowed. |
In API responses
Every guideline shows up as its own entry inresponse.policies. The id is guideline/<key>, the type is classifier, and you read probability / flagged like any other classifier:
Writing good instructions
A few patterns that work well:- Lead with the violation, not the policy.
Posts that ask users to DM, follow on another platform, or share contact infobeatsAnti-spam rule for off-platform redirection. - Be specific about edge cases. If “promoting other products” should allow users mentioning tools they use, say so:
Promoting other products or services. Mentioning tools a user personally uses is allowed. - One concept per guideline. Split unrelated rules into separate guidelines so you can flag them independently and tune thresholds per rule.
- Keep it under ~300 characters. Longer instructions don’t make the LLM more accurate — they make it slower and harder to debug.
Configuring guidelines
Guidelines are configured in the dashboard, per channel — open the channel’s Policies tab and add them under Guidelines. They run on text and audio content.Project context
Guidelines are evaluated with your project context in scope. That’s the description you set under Configure → Context in the dashboard — typically a short paragraph about what your platform is, who uses it, and the kind of content you expect. It’s passed to the LLM alongside your guideline instructions so the rules are interpreted in the right setting. For example, a guideline likeNo promoting other platforms will behave differently for a developer forum (where mentioning GitHub is normal) than for a dating app (where directing users off-platform is a red flag). A well-written project context lets the LLM make that call without you spelling out every exception in the guideline itself.
See Understanding API responses for the full response shape, and Optimizing accuracy for guidance on tuning thresholds.