<html>
    <head>
      <base href="https://bugs.freedesktop.org/" />
    </head>
    <body>
      <p>
        <div>
            <b><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW --- - poppler_page_get_text() ordering does not agree with poppler_page_get_text_layout() as docs say it should"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=69608#c5">Comment # 5</a>
              on <a class="bz_bug_link 
          bz_status_NEW "
   title="NEW --- - poppler_page_get_text() ordering does not agree with poppler_page_get_text_layout() as docs say it should"
   href="https://bugs.freedesktop.org/show_bug.cgi?id=69608">bug 69608</a>
              from <span class="vcard"><a class="email" href="mailto:p@pwaller.net" title="Peter Waller <p@pwaller.net>"> <span class="fn">Peter Waller</span></a>
</span></b>
        <pre>Here is a session showing what I observe:

pwaller@fractal:~$ aptitude show libpoppler-glib8
Package: libpoppler-glib8                
State: installed
Automatically installed: no
Multi-Arch: same
Version: 0.24.1-0ubuntu1
Priority: optional
Section: libs
Maintainer: Ubuntu Developers <<a href="mailto:ubuntu-devel-discuss@lists.ubuntu.com">ubuntu-devel-discuss@lists.ubuntu.com</a>>
Architecture: amd64
Uncompressed Size: 355 k
Depends: libc6 (>= 2.14), libcairo2 (>= 1.12.0), libfreetype6 (>= 2.2.1),
         libglib2.0-0 (>= 2.37.3), libpoppler43 (>= 0.24.1), libstdc++6 (>=
         4.1.1)
PreDepends: multiarch-support
Breaks: libpoppler-glib8 (!= 0.24.1-0ubuntu1)
Replaces: libpoppler-glib8 (< 0.24.1-0ubuntu1)
Description: PDF rendering library (GLib-based shared library)
 Poppler is a PDF rendering library based on Xpdf PDF viewer. 

 This package provides the GLib-based shared library for applications using the
 GLib interface to Poppler.
Homepage: <a href="http://poppler.freedesktop.org/">http://poppler.freedesktop.org/</a>

pwaller@fractal:~$ wget
<a href="http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf">http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf</a>
--2013-09-22 10:27:25-- 
<a href="http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf">http://ww1.microchip.com/downloads/en/DeviceDoc/22197B.pdf</a>
Resolving ww1.microchip.com (ww1.microchip.com)... 77.67.21.35, 77.67.21.27
Connecting to ww1.microchip.com (ww1.microchip.com)|77.67.21.35|:80...
connected.
HTTP request sent, awaiting response... 200 OK
Length: 2337393 (2.2M) [application/pdf]
Saving to: ‘22197B.pdf’

100%[======================================>] 2,337,393   1.42MB/s   in 1.6s   

2013-09-22 10:27:27 (1.42 MB/s) - ‘22197B.pdf’ saved [2337393/2337393]

pwaller@fractal:~$ python
Python 2.7.5+ (default, Sep 19 2013, 13:48:49) 
[GCC 4.8.1] on linux2
Type "help", "copyright", "credits" or "license" for more information.
<span class="quote">>>> from gi.repository import Poppler
>>> doc = Poppler.Document.new_from_file("file:///home/pwaller/22197B.pdf", "")>>> page = doc.get_page(0)
>>> ok, layout = page.get_text_layout()
>>> text = page.get_text()
>>> len(layout), len(text)</span >
(1476, 1520)</pre>
        </div>
      </p>
      <hr>
      <span>You are receiving this mail because:</span>
      
      <ul>
          <li>You are the assignee for the bug.</li>
      </ul>
    </body>
</html>