How to Remove Accented Characters With the UTF-8 Utility

UTF-8, USC Transformation Format 8-bit, is a character encoding format for Unicode, a text based system. The format is used for displaying written words with html code for websites. Creating webpages with UTF-8 can be done in most text editors and browser supported editors. Links from webpages that are not UTF-8 supported can be misrepresented by accented characters. To remove accented characters you will need to add in replacement commands for each accent.

Instructions

    • 1

      Load your html document in a plain text editing program.

    • 2

      Locate a functional space, in the heading of the document, to insert four lines of code.

    • 3

      Type in this string of code:

      $title = "Trípode G5";
      $search = explode(",","ç,æ,œ,á,é,í,ó,ú,à,è,ì,ò,ù,ä,ë,ï,ö,ü,ÿ,â,ê,î,ô,û,å,e,i,ø,u");
      $replace = explode(",","c,ae,oe,a,e,i,o,u,a,e,i,o,u,a,e,i,o,u,y,a,e,i,o,u,a,e,i,o,u");
      $urlTitle = str_replace($search, $replace, $title);

      The code sets up each accent character as a variable to be replaced by an unaccented character throughout the html.

    • 4

      Press "File" and select "Save" to keep the changes.

    • 5

      Load your webpage and the accented characters should now be displayed in their unaccented form.

Related Searches:

References

Comments

Related Ads

Featured