Content model

TopLeaf always applies the default content model when capturing scanned content. This means that:

  • If a child element within the scanned content declares an xml:space attribute, TopLeaf will use that declaration to determine the content model when scanning the content of the child element.

  • If the partition specifies a DTD, then TopLeaf will use the DTD to determine the content model when scanning the content of child elements found within the scan.

  • If the partition does not specify a DTD, then TopLeaf assumes a mixed content model when scanning the content of child elements found within the scan. Multiple whitespace characters are normalised to a single space character.

Scanned content that must be processed in Preserve Space mode should declare its xml:space attribute as preserve.