[poppler] "Addition of stream::getChars" patch breaking lots of PDFs

Albert Astals Cid aacid at kde.org
Sun Aug 8 05:11:02 PDT 2010


A Divendres, 6 d'agost de 2010, Harry Roberts va escriure:
> The following PDF reproduces the bug:
> http://midnight-labs.org/testextract.pdf
> 
> I tried to get the frontpage from a few other PDFs, but when I extract with
> `pdftk` poppler decides to like the new PDF that's produced.

What are you exactly doing? 

pdftotext, test-poppler-qt4 and poppler-glib-demo of the master branch all 
work like a charm here.

Albert

> 
> On 6 August 2010 14:52, Albert Astals Cid <aacid at kde.org> wrote:
> > A Dijous, 5 d'agost de 2010, Harry Roberts va escriure:
> > > Just to give you all a heads up I've had to revert (in my local branch)
> > 
> > the
> > 
> > > code which used getRawChars because it's breaking a huge number of
> > > PDFs.
> > > 
> > > See Albert's e-mail from 14 Jun:
> > > http://www.mail-archive.com/poppler@lists.freedesktop.org/msg03784.html
> > > 
> > > Has anybody else experienced this? I normally pull changes in from the
> > 
> > main
> > 
> > > git repo every few weeks when I do test builds but only got round to
> > > debugging it today.
> > > 
> > > e.g. the error:
> > > 
> > > Internal: called getRawChars() on non-predictor stream
> > > 
> > > It looks quite trivial to implement `getRawChars` on regular `Streams`,
> > 
> > but
> > 
> > > I've not tried it locally yet.
> > 
> > Make sure you rebuild your programs using libpoppler as my change breaks
> > binary compatibility and if you don't rebuild your programs the incorrect
> > symbols will be used resulting in weird behaviour.
> > 
> > If that fails provide a pdf that gets broken.


More information about the poppler mailing list