I was actually thinking size has to do with different language (Unicode), which requires complete different coding when Asian language is involved (thus, bigger size).
To prove this, open Notepad, and type random letters. Save one normally (which would be ANSI/ASCII saving), and resave it with...