poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject [Bug 53556] New: Mispositioned Textboxes In Reading Doc Files Through HWPF
Date Tue, 17 Jul 2012 07:40:03 GMT
https://issues.apache.org/bugzilla/show_bug.cgi?id=53556

          Priority: P2
            Bug ID: 53556
          Assignee: dev@poi.apache.org
           Summary: Mispositioned Textboxes In Reading Doc Files Through
                    HWPF
          Severity: major
    Classification: Unclassified
                OS: Linux
          Reporter: vipulucky93@gmail.com
          Hardware: PC
            Status: NEW
           Version: 3.8
         Component: HWPF
           Product: POI

Created attachment 29070
  --> https://issues.apache.org/bugzilla/attachment.cgi?id=29070&action=edit
This is the document which i was unable to read properly.

I tried reading doc and docx files using Apache POI 3.8. It worked fine until i
encountered textboxes.

If the format of the document is like this: 
paragraph 1 
textbox 1 
paragraph 2 
textbox 2 
paragraph 3 

Then the output should be: 
paragraph 1 textbox 1 paragraph 2 textbox 2 paragraph 3 
But HWPF reads such .doc file as: 
paragraph 1 paragraph 2 paragraph 3 textbox 1 textbox 2 

It seems to be adding textboxes at the end and not at the place where it should
be, i.e. between the paragraphs.

In case of .docx files, XWPF didn't read textboxes at all.

I tried methods getText(), getTextFromPieces(), extractText(),
getParagraphText(), but none of these helped.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


Mime
View raw message