I have an application that converts Word documents to DITA, and in the process it scans each paragraph for illegal characters. The original documents include left and right quotes, as well as regular double quotes, angle brackets. etc.
My process converts the standard five:
But for other characters I convert them to their ASCII code, wrapped in the "&#" and ";" characters, so left double quote becomes "“" and right double quote is "”", etc. but all I get in my output is "#".
Are only the previous five allowed, or did I misunderstand how to escape the other series of typable, but illegal characters?