ftfy - fix unicode that's broken in various ways

Paste in some unicode text that appears to be broken and this tool will use the ftfy Python library to try and fix it.

import ftfy.bad_codecs  # enables sloppy- codecs
from ftfy.fixes import fix_partial_utf8_punct_in_1252
s = 'Arsenal v Wolfsburg: pre-season friendly âÂ\x80âÂ\x80Â\x9c live!'
s = s.encode('latin-1')
s = s.decode('utf-8')
s = fix_partial_utf8_punct_in_1252(s)
s = s.encode('sloppy-windows-1252')
s = s.decode('utf-8')
print(s)

Examples

Web app source code on GitHub