[poppler] getting /Catalog /Lang in poppler-glib
Albert Astals Cid
aacid at kde.org
Thu Apr 14 22:25:20 UTC 2022
El dijous, 14 d’abril de 2022, a les 13:16:58 (CEST), Pablo Rodríguez va escriure:
> Dear list,
>
> I use the Python bindings for Poppler (through GObject introspection) to
> extract some metadata from PDF documents.
>
> Here is a minimal script:
>
>
> import sys
>
> import os
>
> import gi
>
> gi.require_version('Poppler', '0.18')
>
> from gi.repository import Poppler
>
> gi.require_version('Gst', '1.0')
>
> from gi.repository import Gst
>
> Gst.init(sys.argv)
>
> pdf = "a.pdf"
>
> uri = Gst.filename_to_uri(os.path.abspath(pdf))
>
> doc = Poppler.Document.new_from_file(uri, None)
>
> title = doc.get_title()
>
> print(title)
>
>
> Is there a way that I can extract the /Lang value from the /Catalog
> dictionary? (Attached PDF document with that entry.)
No, thought it should be relatively easy to add support for it. Will you contribute a patch?
Cheers,
Albert
>
> I’m afraid I searched https://lazka.github.io/pgi-docs/, but I wasn’t
> able to find anything that could give the language from the document.
>
> Many thanks for your help,
>
> Pablo
>
More information about the poppler
mailing list