Jason,
Does the issue of character encoding need to be brought up here for when you are loading a WordprocessingMLpackage?
I have a bunch of strings that have been saved into my java program that were scavenged from a word document. I am trying to do some date parsing on some of thee strings, and characters such as "En Dash" are really giving me trouble. I can't seem to figure out how to do the conversion correctly in my java program to parse a string in my word document that looks like "08/2005 - present" and in my java program looks like "08/2005 (garbage) present"
Any tips on how to handle this?