[poppler] c++ ustring encoding still completely broken
Albert Astals Cid
aacid at kde.org
Sat Dec 1 23:06:53 UTC 2018
El dissabte, 1 de desembre de 2018, a les 23:20:46 CET, Jeroen Ooms va escriure:
> I maintain the poppler bindings for the R programming language and get
> a lot of bug reports about corrupted text extracted with poppler.
> Below a minimal example that illustrates the problem:
>
> git clone https://github.com/jeroen/popplertest
> cd popplertest
> g++ -std=c++11 encoding.cpp -o encoding $(pkg-config --cflags --libs
> poppler-cpp)
> ./encoding hello.pdf
>
> The output shows a lot of Chinese characters which is incorrect (all
> text in the pdf is english).
>
> Back in March 2018, Suzuki Toshiya had posted a patch with at least a
> partial solution:
> https://lists.freedesktop.org/archives/poppler/2018-March/012962.html
> . I hope we can revisit this.
Can someone please post a patch to the new gitlab merge requests? It's muuuuuch easier to keep track of what needs reviewing if we have it all there.
Cheers,
Albert
> _______________________________________________
> poppler mailing list
> poppler at lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/poppler
>
More information about the poppler
mailing list