[Poppler-bugs] [Bug 69608] poppler_page_get_text() ordering does not agree with poppler_page_get_text_layout() as docs say it should

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Sun Sep 22 02:32:55 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=69608

--- Comment #5 from Peter Waller <p at pwaller.net> ---
Here is a session showing what I observe:

pwaller at fractal:~$ aptitude show libpoppler-glib8
Package: libpoppler-glib8                
State: installed
Automatically installed: no
Multi-Arch: same
Version: 0.24.1-0ubuntu1
Priority: optional
Section: libs
Maintainer: Ubuntu Developers <ubuntu-devel-discuss at lists.ubuntu.com>
Architecture: amd64
Uncompressed Size: 355 k
Depends: libc6 (>= 2.14), libcairo2 (>= 1.12.0), libfreetype6 (>= 2.2.1),
         libglib2.0-0 (>= 2.37.3), libpoppler43 (>= 0.24.1), libstdc++6 (>=
         4.1.1)
PreDepends: multiarch-support
Breaks: libpoppler-glib8 (!= 0.24.1-0ubuntu1)
Replaces: libpoppler-glib8 (< 0.24.1-0ubuntu1)
Description: PDF rendering library (GLib-based shared library)
 Poppler is a PDF rendering library based on Xpdf PDF viewer. 

 This package provides the GLib-based shared library for applications using the
 GLib interface to Poppler.
Homepage: http://poppler.freedesktop.org/

pwaller at fractal:~$ wget
http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf
--2013-09-22 10:27:25-- 
http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf
Resolving ww1.microchip.com (ww1.microchip.com)... 77.67.21.35, 77.67.21.27
Connecting to ww1.microchip.com (ww1.microchip.com)|77.67.21.35|:80...
connected.
HTTP request sent, awaiting response... 200 OK
Length: 2337393 (2.2M) [application/pdf]
Saving to: ‘22197B.pdf’

100%[======================================>] 2,337,393   1.42MB/s   in 1.6s   

2013-09-22 10:27:27 (1.42 MB/s) - ‘22197B.pdf’ saved [2337393/2337393]

pwaller at fractal:~$ python
Python 2.7.5+ (default, Sep 19 2013, 13:48:49) 
[GCC 4.8.1] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> from gi.repository import Poppler
>>> doc = Poppler.Document.new_from_file("file:///home/pwaller/22197B.pdf", "")>>> page = doc.get_page(0)
>>> ok, layout = page.get_text_layout()
>>> text = page.get_text()
>>> len(layout), len(text)
(1476, 1520)

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20130922/15bc4bb3/attachment.html>


More information about the Poppler-bugs mailing list