files:encoding

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
files:encoding [2018/01/10 19:41] – external edit 127.0.0.1files:encoding [2018/10/04 17:14] (current) – external edit 127.0.0.1
Line 3: Line 3:
 ===== Preconditions ===== ===== Preconditions =====
  
-  - The file can have one encoding (same as code page). Encoding can be as Unicode ( UTF16 LE, BE (1200, 1201), UTF8 (65000) ) as not Unicode (for example 252 (Western European) etc). +  - The file can have one encoding (same as code page). Encoding can be Unicode ( UTF16 LE, BE (1200, 1201), UTF8 (65000) ) and not Unicode (for example 252 (Western European) etc). 
   - There are several places, where encoding conversion can be applied to document: Open, Save As, New, Search and Replace   - There are several places, where encoding conversion can be applied to document: Open, Save As, New, Search and Replace
-  - The encoding can be selected/changed inFile Open/Save dialog, via context menu or status bar, in Project Settings, in Tools->Options->Document settings, in syntax specification (here you can set as preferred encoding, as forced encoding). In addition to this HippoEDIT does an auto detection of the encoding using different algorithms (Check BOM bytes, statistics test for UTF16 LE/BE, statistics test for UTF8, check by encoding strings and same checks as IE uses).  +  - The encoding can be selected/changed in File Open/Save dialog, via context menu or status bar, in Project Settings, in Tools->Options->Document settings, in syntax specification (here you can set as preferred encoding, as forced encoding). In addition to this HippoEDIT does an auto-detection of the encoding using different algorithms (Check BOM bytes, statistics test for UTF16 LE/BE, statistics test for UTF8, check by encoding strings and same checks as IE uses).  
-  - If encoding for document once changed by the user, this preference has priority over all the rest of settings. Preferences are machine specific but can be reset, if HippoEDIT temp files would be deleted or format of them would change in new version.+  - If encoding for document once changed by the user, this preference has priority over all the rest of settings. Preferences are machine specific but can be reset, if HippoEDIT temp files would be deleted or format of them would change in the new version.
  
 So, how all this works together (or designed to work  ) : So, how all this works together (or designed to work  ) :
Line 32: Line 32:
   * Encoding selected in File Save dialog   * Encoding selected in File Save dialog
   * Current document encoding   * Current document encoding
-  * During save, HippoEDIT checks the consistency of current document encoding and encoding found with encoding strings (XML, HTML etc). If encoding does not match, user would be asked to select which encoding to use +  * During save, HippoEDIT checks the consistency of current document encoding and encoding found with encoding strings (XML, HTML etc). If encoding does not match, the user would be asked to select which encoding to use 
-  * Because HippoEDIT internally works with Unicode representation of text (UTF16 LE), on save, can happen that current text could not be saved without loss of information with currently selected encoding. In this case, HippoEDIT should pop-up a warning, informing the user about possible data loss and suggest to save the document as Unicode or using some another encoding. This behavior controlled by flag Check encoding accuracy in Tools->Options->Formatting+  * Because HippoEDIT internally works with Unicode representation of text (UTF16 LE), on saving, can happen that current text could not be saved without loss of information with currently selected encoding. In this case, HippoEDIT should pop-up a warning, informing the user about possible data loss and suggest to save the document as Unicode or using some another encoding. This behavior controlled by flag Check encoding accuracy in Tools->Options->Formatting
  
 ===== Search and Replace ===== ===== Search and Replace =====
-Search and Replace encoding uses same logic as for Open/Save file, just interactive selection of encoding, with Open/Save dialog, not available.+Search and Replace encoding uses same logic as for Open/Save file, but interactive selection of encoding, with Open/Save dialog, not available.
  
 ===== If there are problems ===== ===== If there are problems =====
Line 49: Line 49:
 </code> </code>
  
-Also from now on, extended encoding detection is enabled by default only for syntaxes inherited from deftext (as Plain Text, XML, and HTML).+Also from now on, extended encoding detection is enabled by default only for syntaxes inherited from //deftext// (as Plain Text, XML, and HTML).
  
 You can control encoding even in more granular way by disabling some encoding detection methods, which in most cases do not provide false positives. As: You can control encoding even in more granular way by disabling some encoding detection methods, which in most cases do not provide false positives. As:
Line 91: Line 91:
  
 Doing of changes to [[terms:syntax-schema|syntax schema]] files, please keep in mind that default syntax schemes can be overwritten on update (but modified [[terms:syntax-schema|syntax schema]] will be copied to *.old name).  Doing of changes to [[terms:syntax-schema|syntax schema]] files, please keep in mind that default syntax schemes can be overwritten on update (but modified [[terms:syntax-schema|syntax schema]] will be copied to *.old name). 
-Safest way here will be to create your own [[terms:syntax-schema|syntax schema]], inheriting from default, and overwriting of the settings you do not like.+The safest way here will be to create your own [[terms:syntax-schema|syntax schema]], inheriting from default, and overwriting of the settings you do not like.