change file encoding

12,468

Solution 1

You may need to change the content type header that your web server sends the client.

Edit: While this did work for this particular situation, using a tool to change the file encoding as suggested by other posters may be a better solution in other situations. YMMV.


Instructions for saving as UTF-8 in Eclipse (which I realize you already have):

You should probably change the Default Encoding in your workspace for the HTML document.

This is for Eclipse 3.4. If you have a different version, this may be slightly different.

Goto Window->Preferences
In the Preferences window goto General->Content Types
At this point, you can specify a 'Default Encoding' for files near the bottom of the preferences window. Expand 'Text' and select HTML. In the 'Default Encoding' entry, put UTF-8. Then click 'update' at the right.

After this, all HTML files should be saved in UTF-8 format.

Solution 2

The problem with UTF-8 is that there is no magic byte sequence at the beginning of these files - the browser's only chance to detect UTF-8 is either by the XML declaration, HTML meta tags, or some heuristics as fallback.

Make sure that there is either an XML encoding declaration or some HTML meta tags in the header of the HTML.

<?xml version="1.0" encoding="utf-8"?>

just below DOCTYPE if it's XHTML, or

<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

in the head section.

Solution 3

You can use iconv to convert files from one character encoding to another.

Solution 4

Try adding

<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

to the head section of your html files, or ensure that your server is serving the files with a Content-Type http header. Without either of these, the browser can only guess at the character encoding.

Solution 5

In Eclipse 3.7, go to:

Windows > Preferences > General > Workspace

Under "Text file encoding" set the file encoding you need.

Share:
12,468
Dónal
Author by

Dónal

I earn a living by editing text files. I can be contacted at: [email protected] You can find out about all the different kinds of text files I've edited at: My StackOverflow Careers profile

Updated on June 04, 2022

Comments

  • Dónal
    Dónal almost 2 years

    I have a problem with character encoding in some HTML pages. It seems that the cause of the problem is that some of the .html files are not saved as UTF-8 encoded files. Even though I have instructed Eclipse to save these files as UTF-8, when I open them in a browser, it indicates that the files are ISO-8859-1.

    How can I change the encoding of these files to UTF-8?

    UPDATE: I already have the following included in the section of each webpage

    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
    

    I am using the Apache web server.

    Thanks, Donal