I had to cleanup a HTML created out of MS Word Document manually. Honestly it is a pain to manually search and replace all the junks Word Document generating. So I have written a text conversion function in PHP to automatically cleanup the MS Word junks and output HTML entities.