Source for file WordDoc.php
Documentation is available at WordDoc.php
* $Id: WordDoc.php,v 1.4 2004/06/02 14:33:38 hfuecks Exp $
* Shows HTMLSax parsing Word generated HTML
require_once('XML/HTMLSax3.php');
function escape ($parser,$data) {
echo ('<pre>'. $data. "\n\n\n</pre>");
// Instantiate the parser
$parser->set_escape_handler ('escape');
if ( isset ($_GET['strip_escapes']) ) {
$parser->set_option ('XML_OPTION_STRIP_ESCAPES');
<h1>Parsing Word Documents</h1>
<p>Shows HTMLSax parsing a simple Word generated HTML document and the impact of the option 'XML_OPTION_STRIP_ESCAPES' which can be set like;
$parser->set_option('XML_OPTION_STRIP_ESCAPES');
<p>Word generates some strange XML / HTML escape sequences like <![endif]> - now (3.0.0+) handled by HTMLSax correctly.</p>
<a href=" <?php echo $_SERVER['PHP_SELF']; ?>">XML_OPTION_STRIP_ESCAPES = 0</a> :
<a href=" <?php echo $_SERVER['PHP_SELF']; ?>?strip_escapes=1">XML_OPTION_STRIP_ESCAPES = 1</a>
<p>Starting to parse...</p>
Documentation generated on Mon, 11 Mar 2019 15:11:50 -0400 by phpDocumentor 1.4.4. PEAR Logo Copyright © PHP Group 2004.
|