Paste in some unicode text that appears to be broken and this tool will use the ftfy Python library to try and fix it.
from ftfy.fixes import fix_partial_utf8_punct_in_1252 s = "Direzione Pd, ok âÂ\x80Â\x9csenza modificheÂ\x94 all'Italicum." s = s.encode('latin-1') s = s.decode('utf-8') s = fix_partial_utf8_punct_in_1252(s) print(s)
Web app source code on GitHub