by jason » Tue Mar 12, 2013 6:23 pm
It depends on your notion of "equality".
Are they the same if they contain the same text, ignoring formatting? This is easy to compute.
Or are they the same if they have the same text and formatting? This is harder, especially if you want to say that <w:r><w:t>The quick brown fox</w:t></w:r> is the same as <w:r><w:t>The quick/w:t></w:r><w:r><w:t> brown fox</w:t></w:r> and <w:r><w:t>The quick/w:t><w:t> brown fox</w:t></w:r>
Context can help here. Are you comparing documents created in Word, and not subsequently saved using something else (including for example, docx4j)?