Cards on the table first of all - I'm not a Java developer, I'm a baffled C developer pretending to know what I'm doing - so even if it seems obvious to you, it won't be to me.
I have successfully used poi to make a DOC to HTML converter - and now I need a DOCX to HTML converter, with a decent degree of accuracy in the formatting. I think that docx4j might be ...