[poppler] poppler-dump
Marco
ctxspi at gmail.com
Thu Mar 13 05:07:09 PDT 2014
I have found rules in TextOutputDev.cc inside function 'void
TextPage::dump(void *outputStream, TextOutputFunc outputFunc, GBool
physLayout) { ... }' that give me right layout (for me) and code is:
...
} else {
for (flow = flows; flow; flow = flow->next) {
for (blk = flow->blocks; blk; blk = blk->next) {
for (line = blk->lines; line; line = line->next) {
n = line->len;
if (line->hyphenated && (line->next || blk->next)) {
--n;
}
s = new GooString();
dumpFragment(line->text, n, uMap, s);
(*outputFunc)(outputStream, s->getCString(), s->getLength());
delete s;
// output a newline when a hyphen is not suppressed
if (n == line->len) {
(*outputFunc)(outputStream, eol, eolLen);
}
}
}
(*outputFunc)(outputStream, eol, eolLen);
}
}
Do you know if same method can be imported for poppler-dump.cc?
Please can you explained me howto?
As you can see in my first mail problem to print in string was solved.
2014-03-13 12:30 GMT+01:00 Marco <ctxspi at gmail.com>:
> Hi Brad
>
> I think that the main problem is that poppler-cpp library cannot print pdf
> file as same mode of pdftotext (command without any layout option).
>
>
> 2014-03-13 10:20 GMT+01:00 Brad Hards <bradh at frogmouth.net>:
>
> On Thu, 13 Mar 2014 10:11:36 AM Marco wrote:
>> > I have tried it more times but I need to have in output not ustring data
>> > but string or pointer of chars.
>> >
>> > I need to have utf8 charset but not in the ustring format.
>> From cpp/poppler-global.h header:
>>
>> class POPPLER_CPP_EXPORT ustring : public std::basic_string<unsigned
>> short>
>> {
>> public:
>> ustring();
>> ustring(size_type len, value_type ch);
>> ~ustring();
>>
>> byte_array to_utf8() const;
>> std::string to_latin1() const;
>>
>> static ustring from_utf8(const char *str, int len = -1);
>> static ustring from_latin1(const std::string &str);
>> ...
>> }
>>
>>
>
>
> --
> E' meglio coltivare GNU/Linux... tanto Windows si pianta da solo!!
>
--
E' meglio coltivare GNU/Linux... tanto Windows si pianta da solo!!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler/attachments/20140313/861f4aa9/attachment-0001.html>
More information about the poppler
mailing list