AI Text Cleaner

Paste text to detect and remove invisible characters and typography artifacts left by AI models and copy-paste. Everything runs in your browser.

Your text is inspected and cleaned locally in your browser and is never uploaded, but avoid pasting confidential content into any online tool.

Seeing garbled accents like Café? Try the Mojibake Fixer.

About AI Text Cleaner

This AI Text Cleaner is a simple inspector and sanitizer for text that carries characters you cannot see. Paste anything a language model generated, or content you copied from a doc, PDF, or web page, and it counts and removes the hidden format characters that slip in: zero-width spaces and joiners, the narrow and regular no-break spaces, soft hyphens, bidirectional controls, invisible math operators, and emoji variation selectors. Optional toggles straighten curly quotes, turn em and en dashes into plain hyphens, collapse runs of spaces, and apply Unicode NFC normalization. You get a removed-by-type summary plus before and after character and UTF-8 byte counts, then copy or download the cleaned result. The whole check runs in your browser, so the text you paste stays on your device.

Features

How to use the AI Text Cleaner

  1. Paste or type your text into the input box on the left.
  2. Read the banner and the removed-by-type list to see which hidden characters were found.
  3. Toggle any typography options you want, such as straightening quotes or collapsing spaces.
  4. Check the before and after character and byte counts to confirm what changed.
  5. Copy the cleaned text or download it as cleaned.txt.

Example

Input

The​ model said “this is clean”—or is it?

Output

The model said "this is clean"-or is it?

A zero-width space is removed; with the quote and dash options on, curly quotes become straight and the em dash becomes a hyphen.

Common errors & troubleshooting

Frequently asked questions

What is a zero-width space?
A zero-width space (U+200B) is a Unicode character with no visible width. It can create invisible word breaks, throw off exact-match searches, and change byte counts even though nothing appears on screen. This tool removes it along with related zero-width joiners and no-break spaces.
Why does AI-generated text sometimes contain hidden characters?
Models are trained on web and document text that already includes non-breaking spaces, smart quotes, soft hyphens, and other formatting, and tokenizers can reproduce them. Copy-pasting from rich editors, PDFs, and chat apps adds more. The result is text that looks clean but carries characters you cannot see.
Which characters does the AI Text Cleaner remove?
It removes zero-width space, joiner, non-joiner and word joiner, the byte-order mark, no-break and narrow no-break spaces, soft hyphens, bidirectional controls, invisible math operators (U+2061–U+2064), and emoji variation selectors (U+FE00–U+FE0F). Quote, dash, spacing and NFC changes are optional toggles.
What are bidirectional control characters and why remove them?
Bidi controls like the right-to-left override reorder how text is displayed. They are legitimate for mixed-script content but can also be used to disguise file names or code, so this tool flags and strips them so what you see matches the underlying characters.
Does this tool change my visible words or rewrite the text?
No. It is an inspector and sanitizer: by default it only deletes characters you cannot see and reports what it removed. It never paraphrases, adds, or rewrites your words, and the typography options only run when you turn them on.
Is my text uploaded anywhere when I clean invisible characters?
No. The detection and cleanup run entirely in your browser, so the text you paste is processed on your device and is not sent to a server.

Related tools

All ArrayKit tools