Back to Projects
Zero-Shot Profanity Filter
CompletedPythonFlaskTransformers+6 more

Zero-Shot Profanity Filter

AI-powered profanity filtering for text, images, and Telegram moderation using zero-shot classification.

Timeline

3 months

Role

Full Stack

Team

Solo

Status
Completed

Technology Stack

Python
Flask
Transformers
PyTorch
HTML
CSS
JavaScript
Telegram Bot API
Pillow

Key Challenges

  • Model Accuracy
  • Threshold Tuning
  • Real-time Moderation
  • Image Moderation
  • Telegram Bot Permissions
  • False Positives

Key Learnings

  • Zero-Shot Classification
  • Flask API Design
  • Content Moderation Workflow
  • Model Threshold Calibration
  • Telegram Bot Automation
  • NSFW Image Classification

Zero-Shot Profanity Filter: AI moderation for text and images

Overview

Zero-Shot Profanity Filter is an AI moderation tool that detects and filters profane text, checks image safety, and supports Telegram group moderation.

What Users Can Do

  • Check Profanity: Check whether input text is profane with confidence scores using a zero-shot classifier.
  • Filter Text: Filter and censor profane text using multiple modes (full, word-level/sentence-level, aggressive).
  • Upload Images: Upload images to detect NSFW/profane visual content.
  • Telegram Bot: Use a Telegram bot that removes profane messages, applies strike rules, and bans repeat offenders.
  • Monitor API: Monitor API health and integrate endpoints into moderation workflows.

Why I built this

I built this platform to solve a fundamental issue I faced while studying:

  • Unmoderated online spaces often become unsafe due to abusive or profane language.
  • Most simple keyword-based filters miss context and multilingual variations.
  • Communities also need basic image safety checks, not only text filtering.
  • Group moderation should be automated to reduce manual admin effort.

Tech Stack

  • Python
  • Flask
  • Transformers
  • PyTorch
  • Pillow
  • HTML
  • CSS
  • JavaScript
  • python-telegram-bot
  • python-dotenv

After launch & Impact

  • Built a unified moderation workflow for both text and images in one project.
  • Added flexible filtering modes and threshold control to adapt to different safety requirements.
  • Enabled practical Telegram moderation with strike-based enforcement and automatic banning.
  • Improved understanding of real-time AI-assisted moderation and deployment trade-offs.
  • Established a reusable API foundation for integrating moderation into other apps.

Future Plans

  • Improve model performance and threshold calibration for better accuracy across different languages.
  • Add user-level moderation analytics and moderation history dashboards.
  • Add support for custom moderation policies and category-based filtering.
  • Optimize inference performance for lower latency and production-scale usage.
  • Deploy a hosted version with authentication and usage limits.

anvesh.dev

© 2026 Anvesh Mishra. All rights reserved.