Parse Word (.doc /.docx) document

11,818

Solution 1

in the following question you can find a general word doc parsing: How to parse text from MS Word document to string and in this MSDN page you can find how to handle pictures in the document: https://msdn.microsoft.com/en-us/library/office/hh965732(v=office.14).aspx

post your code example if you need a specific help.

Solution 2

You can do it with OpenXML SDK.

Share:
11,818
President Camacho
Author by

President Camacho

Updated on June 04, 2022

Comments

  • President Camacho
    President Camacho almost 2 years

    Is there a way/best possible solution to parse a .doc / .docx in c#/asp.net?

    My document look like this:

    (repeater)
    
    chapter(text)
    
    picture
    
    text
    
    (/repeater)
    

    It is very important that the parser readers the picture.