WordPress Strips Classnames, And How To Fix It

WordPress 2.0 introduced some sophisticated HTML inspecting and de-linting courtesy of kses.

kses is an HTML/XHTML filter written in PHP. It removes all unwanted HTML elements and attributes, and it also does several checks on attribute values. kses can be used to avoid Cross-Site Scripting (XSS), Buffer Overflows and Denial of Service attacks.

It’s a good addition, but it was also removing the class names from some of the elements of my posts. The result is that the following structured XHTML was coming through without any structure.

<ul class=“fullrecord”>
<li class=“title”><h3>Title</h3>
<ul>
<li>The Effects Of A Modified Ball In Developing The Volleyball Pass And Set For High School Students</li>
</ul>
</li>
<li class=“attribution”>...

Without the semantic value of the classnames, the XHTML loses all the microformatting, making it not only less re-usable/remixable but also harder to style.

<ul>
<li><h3>Title</h3>
<ul>
<li>The Effects Of A Modified Ball In Developing The Volleyball Pass And Set For High School Students</li>
</ul>
</li>
<li>...

A WordPress form post pointed me to the includes/kses.php file, where the $allowedposttags array set the standards for the acceptable tags and attributes. It begins like this:

$allowedposttags = array ('address' => array (), 'a' => array ('href' => array (), 'title' => array (), 'rel' => array ()...

It’s a hack, but changing the entries for some of the tags got me through.

'ul' => array ('class' => array())

WordPress, strip tags, kses, code, fix, hack, class names, semantic markup

6 Comments

  1. Comment by Steve Lawson on May 15, 2007 5:18 pm

    So this is why WP stripped out so many classes and IDs when I switched platforms and imported my old MT posts? Good to know. Man, that pissed me off, but by the time I’d realized what happened it seemed too late to go back and try again. Besides, I had no idea how to fix it.

  2. Comment by Casey Bisson on May 16, 2007 9:42 pm

    Yeah, it’s pretty frustrating when you don’t know what’s going on. And only a little less frustrating once you do.

    Still, it helps those who don’t know what they’re doing (and those who do, but have malicious intent) from breaking things.

  3. Comment by Alpha on August 13, 2007 4:43 pm

    Wordpress should replaces their kses code with the code used in this modified version of kses — http://sourceforge.net/tracker/index.php?func=detail&aid=1752954&group_id=81853&atid=564260

    The modified kses produces better HTML, fixes bugs, allows CSS usage, etc.

  4. Comment by dieselboi on March 6, 2008 12:30 am

    woo hooo. that totally worked!

  5. Pingback by Unfilter MU’s Filters | Page 2 on October 8, 2008 12:02 am

    [...] and Flash embeds. This would be fine if it were configurable, but sadly it is not. Various hacks exist to remedy the situation, but for such a pluggable piece of software as WordPress - hacks just seem [...]

  6. Pingback by Data Unbound » How WordPress Sanitizes Tags and Attributes on October 29, 2008 3:51 pm

    [...] in my span HTML elements. To fix the problem, I followed the trail of references leading from » WordPress Strips Classnames, And How To Fix It MaisonBisson.com. I learned that WordPress uses the kses PHP library to filter HTML of possibly invalid and [...]

Comments RSS TrackBack Identifier URI

Leave a comment

 

User contributed tags for this post:

wordpress kses (31) - fix.89.com (25) - php filter (17) - fix 89 (17) - wordpress strip html (14) - FİX 89.COM (13) - wordpress allowedposttags (10) - kses wordpress (8) - Wordpress strips html (8) - wordpress strips css (8) - wordpress (8) - class (7) - wordpress strips code (7) - wordpress strip (6) - wordpress strips classes (5) - Wordpress stripping HTML (5) - fıx.89 (5) - wordpress avoid strip tag (5) - wordpress strip style tag (4) - wordpress removes html (4) - www.fix 89.net (4) - allowedposttags style (4) - allowedposttags (4) - fix.89 net (4) - wordpress kses.php (3) - wordpress stripping CSS (3) - wordpress $allowedposttags (3) - wordpress kses style (3) - php strip html attributes (3) - wordpress strip tags (3) - wordpress strips a (3) - fix 89.net (3) - kses buffer wordpress (3) - wordpress strips (3) - php strip tags hack (3) - wordpress mu html filter (2) - wordpress kses plugin (2) - wordpress rss strips html (2) - wordpress strips style (2) - wordpress xmlrpc strip tags (2) - wpmu $allowedposttags (2) - wordpress strip word markup (2) - wordpress stripping tags fix (2) - wordpress striptags (2) - kses html wordpress (2) - wordpress classes (2) - www.fix.89 (2) - wordpress mu strip html (2) - wordpress archive strips out html (2) - wordpress strip html fix bug (2) - wordpress strips  (2) - php strip attributes (2) - replacing kses.php fixed the problem (2) - xml-rpc stripping < (2) - wordpress html filter (2) - wordpress code stripping form (2) - wordpress removes IDs and classes (2) - kses.php CSS (2) - wordpress class (2) - html striping wordpress (2) - wordpress strips css classes (2) - wordpress KSES -MU (2) - wordpress post by email a href (2) - Wordpress removes attribute tag xmlrpc (2) - wordpress strips style attribute (2) - wordpress stripping classes from tags (2) - wordpress strips HTML from comments (2) - wordpress kses fix (2) - wordpress strips my HTML (2) - wordpress feed strips html (2) - wordpress strips tags (2) - wordpress plugin strip tags (2) - fix 89 net (2) - kses plugin (2) - kses (2) - word press strips tags (2) - F (2) - wordpress strips class attributes (2) - fixing duplicate tag wordpress (1) - wordpress strip rss (1) - fix wordpress xhtml (1) - good class names php (1) - wordpress xml rpc html tags getting stripped (1) - wordpress stripping classes (1) - hacking wordpress kses.php file (1) - wordpress removes my html (1) - remove html codes from wordpress post (1) - wordpress tinymce removing classes (1) - wordpress allow style atribute (1) - kses php strips style tag (1) - wordpress kses strip (1) - wordpress mu strips php (1) - wordpress removing html name (1) - yutube 89.net (1) - wordpress removes name attribute (1) - wordpress disable kses comments (1) - wordpress remove html from post (1) - kses style tag (1) - wordpress strip style attributes (1) - wordpress filter strip html styles (1) - wordpress strips