I received a ZIP file from a Japanese customer.
When I try to unzip it the file and folders names are messed up:
$ unzip ~/Downloads/【新入荷ECM】資料.zip
...
inflating: БyРVУ№Й╫ECMБzОСЧ┐/123_ГЖБ[ГXГPБ[ГX.xlsx
What is the problem, and how to avoid it?
The problem is that most ZIPs circulating in Japan have their content encoded as Shift JIS, which is not shown correctly by default on Ubuntu.
The solution is to use the
-O shift-jis
option in your command:This way, the expanded files are perfectly readable in Ubuntu.
Simple answer for this
unar can automatically recognize which encoding is used. It does only extract and not for compressing.
Then use 'convmv' to encode text after extract.
Vice versa, if you'd like to create file(s) from utf8 into shift_jis text for windows.
Alternative answers, use of Ubuntu Japanese team built automatic encoding 'unzip' but you have to add repository.