LLM Safety Filters

Security & Ethics

phi-2 prompt injection detector (https://huggingface.co/ysy970923/phi-2-prompt-injection-QLoRA)
Profanity Filters with Zero-Shot LLM Prompting
PII (personal identifiable information) checking via Microsoft Presidio Analyzer

Information value estimate

RAG based LLM prompting with existing responses in the vector DB.
1. check plagiarism
2. estimate information value

PreviousKnowledge Staking via Discussion NextPay to Chat

Last updated 2 years ago