Hello,
I have the following docx file (content-not-allowed-in-prolog.docx), where the first char in the file is a Byte Order Mark (BOM) or Zero-width no-break space (U+FEFF).
That's why doxc4j using the SAXParser cannot parse it.
If a docx file starts with a special character, such as a BOM, the following code cannot parse it correctly and the execution lands in the last else (see "Assuming Flat OPC XML").
Usually, such docx files are encoded UTF-8-BOM, ...