<html>
<head>
<base href="https://bugs.freedesktop.org/" />
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW --- - poppler_page_get_text() ordering does not agree with poppler_page_get_text_layout() as docs say it should"
href="https://bugs.freedesktop.org/show_bug.cgi?id=69608#c5">Comment # 5</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW --- - poppler_page_get_text() ordering does not agree with poppler_page_get_text_layout() as docs say it should"
href="https://bugs.freedesktop.org/show_bug.cgi?id=69608">bug 69608</a>
from <span class="vcard"><a class="email" href="mailto:p@pwaller.net" title="Peter Waller <p@pwaller.net>"> <span class="fn">Peter Waller</span></a>
</span></b>
<pre>Here is a session showing what I observe:
pwaller@fractal:~$ aptitude show libpoppler-glib8
Package: libpoppler-glib8
State: installed
Automatically installed: no
Multi-Arch: same
Version: 0.24.1-0ubuntu1
Priority: optional
Section: libs
Maintainer: Ubuntu Developers <<a href="mailto:ubuntu-devel-discuss@lists.ubuntu.com">ubuntu-devel-discuss@lists.ubuntu.com</a>>
Architecture: amd64
Uncompressed Size: 355 k
Depends: libc6 (>= 2.14), libcairo2 (>= 1.12.0), libfreetype6 (>= 2.2.1),
libglib2.0-0 (>= 2.37.3), libpoppler43 (>= 0.24.1), libstdc++6 (>=
4.1.1)
PreDepends: multiarch-support
Breaks: libpoppler-glib8 (!= 0.24.1-0ubuntu1)
Replaces: libpoppler-glib8 (< 0.24.1-0ubuntu1)
Description: PDF rendering library (GLib-based shared library)
Poppler is a PDF rendering library based on Xpdf PDF viewer.
This package provides the GLib-based shared library for applications using the
GLib interface to Poppler.
Homepage: <a href="http://poppler.freedesktop.org/">http://poppler.freedesktop.org/</a>
pwaller@fractal:~$ wget
<a href="http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf">http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf</a>
--2013-09-22 10:27:25--
<a href="http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf">http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf</a>
Resolving ww1.microchip.com (ww1.microchip.com)... 77.67.21.35, 77.67.21.27
Connecting to ww1.microchip.com (ww1.microchip.com)|77.67.21.35|:80...
connected.
HTTP request sent, awaiting response... 200 OK
Length: 2337393 (2.2M) [application/pdf]
Saving to: ‘22197B.pdf’
100%[======================================>] 2,337,393 1.42MB/s in 1.6s
2013-09-22 10:27:27 (1.42 MB/s) - ‘22197B.pdf’ saved [2337393/2337393]
pwaller@fractal:~$ python
Python 2.7.5+ (default, Sep 19 2013, 13:48:49)
[GCC 4.8.1] on linux2
Type "help", "copyright", "credits" or "license" for more information.
<span class="quote">>>> from gi.repository import Poppler
>>> doc = Poppler.Document.new_from_file("file:///home/pwaller/22197B.pdf", "")>>> page = doc.get_page(0)
>>> ok, layout = page.get_text_layout()
>>> text = page.get_text()
>>> len(layout), len(text)</span >
(1476, 1520)</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>