703 / 795

Roblox introduces real-time AI-powered chat rephraser for inappropriate language

TL;DR

Roblox is replacing its blunt #### chat censorship with a real-time AI rephraser that rewrites inappropriate messages into cleaner alternatives.

Key Points

  • Previously, policy violations were silently blocked, making chats hard to follow. Now, all participants see the rephrased version plus a note that the message was edited.
  • The sender also sees exactly which language was changed, adding transparency to the moderation process.
  • Profanity is the starting point: 'Hurry TF up' becomes 'Hurry up!' – repeat offenders still face platform penalties regardless.

Nauti's Take

This is one of the rare cases where AI moderation actually feels like an upgrade – #### frustrates everyone and communicates nothing useful. That said, the core risk is real: who defines 'more appropriate,' and how often does the model get it wrong?

Misrephrased messages can distort meaning, which is especially concerning on a platform full of minors. Starting with profanity is the sensible call – it's a narrow, well-defined problem.

The real test comes when edge cases and viral fails inevitably surface.

Context

Roblox's young user base makes chat moderation higher-stakes than on most platforms. The shift from blunt censorship to context-aware rewriting shows how real-time AI can make moderation feel more human without killing conversational flow. The sender transparency is a notable design choice: users learn what was flagged rather than just hitting a wall of hashtags.

Sources