poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From acoli...@apache.org
Subject Re: Extraction of Images from a Word Document
Date Thu, 03 Nov 2005 13:09:10 GMT
There is presenlty nothing in HWPF which implements anything for image 
extraction really.


Shantanu Chakraborty wrote:
> Hi,
> 
> I have been using the POI API for text extraction from a Microsoft Word
> Document and have managed to fulfil my requirements there successfully.
> However, I now need to use this API for image extraction from a Word
> Document. I have been given a long list of requirements w.r.t image
> extraction, and I am listing below a few of them so that I can share with
> you all the complexity that I have to address. Below are some of the
> requirements:
> 
> 1. Extract the nth image from a Word Document, where n>0.
> 2. Extract the nth image from the mth page in a Word Document, where m, n >
> 0.
> 3. Extract the image in the n th row, and m th column of the i th table on
> page j, where m,n,i,j > 0.
> 
> The second and third requirements are indeed complex, and I would like to
> keep my focus on the first for the time being.
> 
> If anyone could provide me some pointers as to which classes in POI I should
> look at to be able to come up with a solution for my first requirement or
> provide some helpful code fragment w.r.t this problem, I would really
> appreciate it.
> 
> Thanks and Regards
> Shantanu
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org
> Mailing List:    http://jakarta.apache.org/site/mail2.html#poi
> The Apache Jakarta POI Project: http://jakarta.apache.org/poi/
> 
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org
Mailing List:    http://jakarta.apache.org/site/mail2.html#poi
The Apache Jakarta POI Project: http://jakarta.apache.org/poi/


Mime
View raw message