PHP Issues With Decoding Strange Smart Quotes And Non-Standard ASCII Characters

When using strip_tags and html_entity_decode with PHP it often breaks and produces annoying diamonds with question marks.

It is probably because of characters like these:

… (looks like 3 dots but it is a single weird character).

’ (looks like a normal apostraphe but it is not)

” (looks like a normal double quote but it is not).

An easy way to sort this out is to copy the above and search in an ASCII table to extend the functionality below.

Basically the first array has a list of "all the bad stuff" and the second array is what to replace it with.

<?


function CleanupSmartQuotes($text)
    {
        $badwordchars=array(
                            chr(145), // weird apostraphe
                            chr(146), // weird apostraphe
                            chr(147), // weird quote
                            chr(148), // weird quote
                            chr(151), // weird dash
                            chr(133), // three dots
                            '&nbsp;',
                            );
        $fixedwordchars=array(
                            "'",
                            "'",
                            '&quot;',
                            '&quot;',
                            '&mdash;',
                            '...',
                            '',
                            );
        return str_replace($badwordchars,$fixedwordchars,$text);
    }

?>


Tags:

php, decoding, quotes, ascii, characterswhen, strip_tags, html_entity_decode, produces, characters, hellip, dots, rsquo, apostraphe, rdquo, extend, functionality, array, quot, cleanupsmartquotes, text, badwordchars, chr, dash, fixedwordchars, mdash, str_replace,

Latest Articles

  • FreePBX 17 How To Add a Trunk
  • Docker Container Onboot Policy - How to make sure a container is always running
  • FreePBX 17 How To Add Phones / Extensions and Register
  • Warning: The driver descriptor says the physical block size is 2048 bytes, but Linux says it is 512 bytes. solution
  • Cisco How To Use a Third Party SIP Phone (eg. Avaya, 3CX)
  • Cisco Unified Communication Manager (CUCM) - How To Add Phones
  • pptp / pptpd not working in DD-WRT iptables / router
  • systemd-journald high memory usage solution
  • How to Install FreePBX 17 in Linux Debian Ubuntu Mint Guide
  • How To Install Cisco's CUCM (Cisco Unified Communication Manager) 12 Guide
  • Linux Ubuntu Redhat How To Extract Images from PDF
  • Linux and Windows Dual Boot Issue NIC Won't work After Booting Windows
  • Cisco CME How To Enable ACD hunt groups
  • How to install gns3 on Linux Ubuntu Mint
  • How to convert audio for Asterisk .wav format
  • Using Cisco CME Router with Asterisk as a dial-peer
  • Cisco CME How To Configure SIP Trunk VOIP
  • Virtualbox host Only Network Error Failed to save host network interface parameter - Cannot change gateway IP of host only network
  • Cisco CME and C7200 Router Testing and Learning Environment on Ubuntu 20+ Setup Tutorial Guide
  • Abusive IP ranges blacklist