[poppler] poppler-dump

Marco ctxspi at gmail.com
Thu Mar 13 09:32:30 PDT 2014


Solved

I use bbox option in pdftotext.

Thanks to all


2014-03-13 13:07 GMT+01:00 Marco <ctxspi at gmail.com>:

> I have found rules in TextOutputDev.cc inside function 'void
> TextPage::dump(void *outputStream, TextOutputFunc outputFunc, GBool
> physLayout) { ... }' that give me right layout (for me) and code is:
>
> ...
> } else {
>     for (flow = flows; flow; flow = flow->next) {
>       for (blk = flow->blocks; blk; blk = blk->next) {
>         for (line = blk->lines; line; line = line->next) {
>           n = line->len;
>           if (line->hyphenated && (line->next || blk->next)) {
>             --n;
>           }
>           s = new GooString();
>           dumpFragment(line->text, n, uMap, s);
>           (*outputFunc)(outputStream, s->getCString(), s->getLength());
>           delete s;
>           // output a newline when a hyphen is not suppressed
>           if (n == line->len) {
>             (*outputFunc)(outputStream, eol, eolLen);
>           }
>         }
>       }
>       (*outputFunc)(outputStream, eol, eolLen);
>     }
>   }
>
> Do you know if same method can be imported for poppler-dump.cc?
>
> Please can you explained me howto?
>
> As you can see in my first mail problem to print in string was solved.
>
>
> 2014-03-13 12:30 GMT+01:00 Marco <ctxspi at gmail.com>:
>
> Hi Brad
>>
>> I think that the main problem is that poppler-cpp library cannot print
>> pdf file as same mode of pdftotext (command without any layout option).
>>
>>
>> 2014-03-13 10:20 GMT+01:00 Brad Hards <bradh at frogmouth.net>:
>>
>> On Thu, 13 Mar 2014 10:11:36 AM Marco wrote:
>>> > I have tried it more times but I need to have in output not ustring
>>> data
>>> > but string or pointer of chars.
>>> >
>>> > I need to have utf8 charset but not in the ustring format.
>>> From cpp/poppler-global.h header:
>>>
>>> class POPPLER_CPP_EXPORT ustring : public std::basic_string<unsigned
>>> short>
>>> {
>>> public:
>>>     ustring();
>>>     ustring(size_type len, value_type ch);
>>>     ~ustring();
>>>
>>>     byte_array to_utf8() const;
>>>     std::string to_latin1() const;
>>>
>>>     static ustring from_utf8(const char *str, int len = -1);
>>>     static ustring from_latin1(const std::string &str);
>>> ...
>>> }
>>>
>>>
>>
>>
>> --
>> E' meglio coltivare GNU/Linux... tanto Windows si pianta da solo!!
>>
>
>
>
> --
> E' meglio coltivare GNU/Linux... tanto Windows si pianta da solo!!
>



-- 
E' meglio coltivare GNU/Linux... tanto Windows si pianta da solo!!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20140313/abf84541/attachment.html>


More information about the poppler mailing list