[poppler] c++ ustring encoding still completely broken

Albert Astals Cid aacid at kde.org
Sat Dec 1 23:06:53 UTC 2018


El dissabte, 1 de desembre de 2018, a les 23:20:46 CET, Jeroen Ooms va escriure:
> I maintain the poppler bindings for the R programming language and get
> a lot of bug reports about corrupted text extracted with poppler.
> Below a minimal example that illustrates the problem:
> 
>   git clone https://github.com/jeroen/popplertest
>   cd popplertest
>   g++ -std=c++11 encoding.cpp -o encoding $(pkg-config --cflags --libs
> poppler-cpp)
>   ./encoding hello.pdf
> 
> The output shows a lot of Chinese characters which is incorrect (all
> text in the pdf is english).
> 
> Back in March 2018, Suzuki Toshiya had posted a patch with at least a
> partial solution:
> https://lists.freedesktop.org/archives/poppler/2018-March/012962.html
> . I hope we can revisit this.

Can someone please post a patch to the new gitlab merge requests? It's muuuuuch easier to keep track of what needs reviewing if we have it all there.

Cheers,
  Albert

> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/poppler
> 






More information about the poppler mailing list