[Poppler-bugs] [Bug 69608] poppler_page_get_text() ordering does not agree with poppler_page_get_text_layout() as docs say it should
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Sun Sep 22 02:32:55 PDT 2013
https://bugs.freedesktop.org/show_bug.cgi?id=69608
--- Comment #5 from Peter Waller <p at pwaller.net> ---
Here is a session showing what I observe:
pwaller at fractal:~$ aptitude show libpoppler-glib8
Package: libpoppler-glib8
State: installed
Automatically installed: no
Multi-Arch: same
Version: 0.24.1-0ubuntu1
Priority: optional
Section: libs
Maintainer: Ubuntu Developers <ubuntu-devel-discuss at lists.ubuntu.com>
Architecture: amd64
Uncompressed Size: 355 k
Depends: libc6 (>= 2.14), libcairo2 (>= 1.12.0), libfreetype6 (>= 2.2.1),
libglib2.0-0 (>= 2.37.3), libpoppler43 (>= 0.24.1), libstdc++6 (>=
4.1.1)
PreDepends: multiarch-support
Breaks: libpoppler-glib8 (!= 0.24.1-0ubuntu1)
Replaces: libpoppler-glib8 (< 0.24.1-0ubuntu1)
Description: PDF rendering library (GLib-based shared library)
Poppler is a PDF rendering library based on Xpdf PDF viewer.
This package provides the GLib-based shared library for applications using the
GLib interface to Poppler.
Homepage: http://poppler.freedesktop.org/
pwaller at fractal:~$ wget
http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf
--2013-09-22 10:27:25--
http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf
Resolving ww1.microchip.com (ww1.microchip.com)... 77.67.21.35, 77.67.21.27
Connecting to ww1.microchip.com (ww1.microchip.com)|77.67.21.35|:80...
connected.
HTTP request sent, awaiting response... 200 OK
Length: 2337393 (2.2M) [application/pdf]
Saving to: ‘22197B.pdf’
100%[======================================>] 2,337,393 1.42MB/s in 1.6s
2013-09-22 10:27:27 (1.42 MB/s) - ‘22197B.pdf’ saved [2337393/2337393]
pwaller at fractal:~$ python
Python 2.7.5+ (default, Sep 19 2013, 13:48:49)
[GCC 4.8.1] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> from gi.repository import Poppler
>>> doc = Poppler.Document.new_from_file("file:///home/pwaller/22197B.pdf", "")>>> page = doc.get_page(0)
>>> ok, layout = page.get_text_layout()
>>> text = page.get_text()
>>> len(layout), len(text)
(1476, 1520)
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130922/15bc4bb3/attachment.html>
More information about the Poppler-bugs
mailing list