Add ability to control number of byte in a page summary, fix a bug in char encoding detection, a=chris