
In this session, we discuss some miscellaneous topics mainly concerned with what is and isn't legal content in XML documents, and also ways to circumvent these rules using special element constructs. The topics we cover are:
We will illustrate these concepts with an example, "The King's Speech", based on the multi-Oscar winning 2011 British Film of the year. This movie addressed King George VI's speech problem, and its correction by a speech therapist. Let us now see the King in modern times, with the BBC making a recording of his World War III speech to motivate the British Army troops stationed across the world. Here is how the BBC's state-of-the-art speech recording and filtering software would go about its task, while choosing XML to be the output format:
Stammering speech is stored as comments in the XML document. (This content is useful for creating statistics and studying them.) Long periods of silence (no noise) are treated as 1 white-space per second of silence. Special characters display mathematical statistics. Recording noise is also identified and isolated, but maintained in the document as special non-processed data, for the tool's own logging and improvement. Here is the document example:
<?xml version="1.0" encoding="UTF-8" ?>
<king-speech>
<!%u2014line> duh-duh-duh (This is Stammering) duh-duh </line -- >
<line>The time has come for the British to stand up for the rest of Humanity</line>
<line> </line>
<![CDATA[*****ALL RECORDING NOISE**********]]>
<!%u2014line>The rest of the speech</line -- >
<statistics>Stammer Percentage < 30 per cent</statistics>
<statistics>Noise Correction < 45 per cent</statistics>
</king-speech>
Here is how we map the syntax used in this document to the actual requirements, stated just preceding it:
The main point is that XML understands and stores only character sequence data, and has a few restricted keywords. But it also provides constructs to work around these limitations and syntactic rules, as we saw in the document example above.
Have a question but don't want to ask it publicly? If you are a prime member just ask right here and we will get back to you within 48 hours. Not prime yet? no worries you can ask publicly below.