This lightweight tool converts non-UTF-encoded (such as GB2312, GBK, BIG5 encoded) files to UTF-8 encoded files. It can either be executed from command line (CLI), or imported into other Python code.
This repository contains a notebook demonstrating how to detect and convert character encodings of files using Python. The notebook provides functions to identify file encodings and convert them to ...