poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 47687] New: Is there any limitation at size of the MS Office document to extract using POI library?
Date Wed, 12 Aug 2009 13:18:54 GMT

           Summary: Is there any limitation at size of the MS Office
                    document to extract using POI library?
           Product: POI
           Version: 3.2-FINAL
          Platform: PC
        OS/Version: Windows XP
            Status: NEW
          Severity: normal
          Priority: P2
         Component: POI Overall
        AssignedTo: dev@poi.apache.org
        ReportedBy: ursbijju@gmail.com
                CC: ursbijju@gmail.com

--- Comment #0 from Bijju <ursbijju@gmail.com> 2009-08-12 06:18:52 PDT ---
We have been extracting many office documents successfully using POI 3.2. But
for a specific document of huge size >19MB file was not able to extract. 

But in practical scenarios we will ave more than 500MB documents also (in fact
no restriction at that). And technically, as POI is a Java library, size should
not be a concern while getting the handle of the document. I am using event
driven logic for document extraction.

But i have noticed, when document size is reduced POI extracts, if not fails.
Any reason for this? Am i missing any basic technical point here?

Also, POI treats HTML content of word document as another document than of
simple text. Need to check more on this. If this is yes, pls. let me know what
would be the reason for this?

Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org

View raw message