Paste in some unicode text that appears to be broken and this tool will use the ftfy Python library to try and fix it.
import ftfy.bad_codecs # enables sloppy- codecs from ftfy.fixes import fix_partial_utf8_punct_in_1252 s = 'Arsenal v Wolfsburg: pre-season friendly âÂ\x80âÂ\x80Â\x9c live!' s = s.encode('latin-1') s = s.decode('utf-8') s = fix_partial_utf8_punct_in_1252(s) s = s.encode('sloppy-windows-1252') s = s.decode('utf-8') print(s)
Web app source code on GitHub