Re: [xwiki-devs] [xwiki-notifications] r16999 - platform/core/trunk/xwiki-officeimporter/src/test/java/org/xwiki/officeimporter/internal/cleaner

23 Feb 2009

Hi Asiri,
On Feb 23, 2009, at 1:37 PM, asiri (SVN) wrote:
...
  Author: asiri
 Date: 2009-02-23 13:37:50 +0100 (Mon, 23 Feb 2009)
 New Revision: 16999
 Modified:
   platform/core/trunk/xwiki-officeimporter/src/test/java/org/xwiki/
 officeimporter/internal/cleaner/OpenOfficeHTMLCleanerTest.java
 Log:
 XWIKI-3259: Table headers are not handled properly
 * Added a unit test. 
[snip]
...
      /**
 +     * Test proper cleaning of {@code <th>} elements.
 +     */
 +    public void testTableHeaderItemCleaning()
 +    {
 +        // Isolated paragraph elements inside 'th' elements should
 be removed.
 +        String html =
 +            header +
"<table><thead><tr><th><p>Test</p></th></tr></
 thead><tbody><tr><td/></tr></tbody></table>"
 +                + footer;
 +        Document doc = cleaner.clean(new StringReader(html));
 +        NodeList nodes = doc.getElementsByTagName("th");
 +        Node hearderItemContent = nodes.item(0).getFirstChild();
 +        assertEquals(Node.TEXT_NODE,
 hearderItemContent.getNodeType());
 +        assertEquals("Test", hearderItemContent.getNodeValue()); 
Why is this only for th and not for td cells too?
Is this specific to the office importer? It looks very generic to me,
isn't it?
Why do paragraphs need to be removed?
What if there are 2 paragraphs elements? what happens? Do you have a
test for that too?
Thanks
-Vincent

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

Re: [xwiki-devs] [xwiki-notifications] r16999 - platform/core/trunk/xwiki-officeimporter/src/test/java/org/xwiki/officeimporter/internal/cleaner