cancel
Showing results for 
Search instead for 
Did you mean: 

Extracting Data from MSWord - using Java

Former Member
0 Kudos

Hi All,

In my scenario, I have to write a java program, which <b>extracts the data from a MSWord file.</b>

Can you help me over this.

Thanks,

Prabhakar.

Accepted Solutions (1)

Accepted Solutions (1)

Former Member
0 Kudos

Hi,

This could help you to start

http://blogs.msdn.com/dotnetinterop/archive/2005/03/29/java-and-ms-word.aspx

Regards

Ayyapparaj

Former Member
0 Kudos

Thank you Armin and Ayyapparaj.

I will check your links and get back to you.

Thanks,

Prabhakar.

Answers (1)

Answers (1)

Former Member
0 Kudos

May be <a href="http://poi.apache.org/hwpf/index.html">this</a> could help you.

Armin

Former Member
0 Kudos

Hi,

apache POI is working, and I have extracted the <b>text</b> in a word document(MSWord <b>2003</b>).

But it is not working for MS Word <b>2007</b>. ( The client may use any MS Word format)

I have an other option,

<b>Word Document -


(Convert)-----> XML Document -


(XML Parser)--


> Data(Text, images,etc,...)</b>

I request suggestions and idea on this.

Thanks,

Prabhakar.