Ever had your beautifully-crafted HTML fail validation thanks to text pasted from a Word document the client provided? Tired of manually tracking down symbols and accented characters? Weary from inserting line and paragraph breaks?
Character Cleaner is a little web-based utility which converts pasted-in text to clean markup, optionally encoding certain characters and creating line breaks and paragraphs automatically. It’s entirely contained in the one file, so you can easily save it for offline use.
Version 2 uses only decimal entities (to simplify any later conversion/manipulation) and offers options for minimal encoding (for when working with fully Unicode-compliant documents/systems/editors), encoding anything that’s not 8-bit-safe (when working with ISO-8859-1/Latin-1, for example), and for being really cautious and making the markup ASCII-safe (anything other than basic characters gets encoded).
Get in touch if you find it useful or want to report any bugs.
