If the paragraph element or any of its child elements contains white-space characters, they are collapsed. Leading white-space characters at the paragraph start as well as trailing white-space characters at the paragraph end are ignored. In detail, the following conversions take place:
The following UNICODE characters are normalized to a SPACE character:
In addition, these characters are ignored if the preceding character is a white-space character. The preceding character can be contained in the same element, in the parent element, or in the preceding sibling element, as long as it is contained within the same paragraph element and the element in which it is contained processes white-space characters as described above. Whitespace characters at the start or end of the paragraph are ignored, regardless whether they are contained in the paragraph element itself, or in a child element in which white-space characters are collapsed as described above.
These white-space processing rules shall enable authors to use white-space characters to improve the readability of the XML source of an OpenDocument document in the same way as they can use them in HTML. In other words they are processed in the same way that HTML4 processes them.
In general, consecutive white-space characters in a paragraph are collapsed. For this reason, there is a special class Spaces used to represent the UNICODE character SPACE (0x0020).
This element is required to represent the second and all following SPACE characters in a sequence of SPACE characters.
The LineBreak class represents a line break in a heading or paragraph.
The SoftPageBreak class represents a soft page break within a heading or paragraph.
Soft hyphens, hyphens, and non-breaking blanks are represented by UNICODE characters.
The UNICODE character | Represents |
---|---|
SOFT HYPHEN (00AD) | soft hyphens |
NON-BREAKING HYPHEN (2011) | non-breaking hyphens |
NO-BREAK SPACE (00A0) | non-breaking blanks |