[poppler] [patch] Add ability to extract embedded files.
Jonathan Blandford
jrb at redhat.com
Mon Aug 29 11:01:01 EST 2005
Brad Hards <bradh at frogmouth.net> writes:
> G'day team,
>
> As a by-product of my flight to Akademy (yeah, thanks BA - I would like my
> clothes, back )-:, I worked up this little patch. It provides the capability
> to extract an "attached" or embedded file. I haven't seen this used a lot,
> but it was an interesting exercise in understanding the API a bit more, and
> the feature is a lot more usable in Acrobat 7, so users might start attaching
> the source document to files.
>
> The change to the core poppler code is as shown below. The change is actually
> pretty small - I did have to expand the API for the NameTree class a little.
> I'm not sure I'm walking down the datastructures in a reliable way, because I
> forgot to generate a test file with Acrobat before leaving home, and resorted
> to creating one with pdftk. See the attached document to see the proposed Qt4
> API / test application. I've also checked in that pdftk-created test example
> (into test/unittestcases/, as WithAttachments.pdf)
>
> There is potentially other metadata that could be extracted. At this stage
> there is a description that shows up in Acrobat Reader that I can't find in
> the file. I guess that Acrobat 7 will put more that pdftk-1.12, based on the
> column headers in Acrobat 7 Reader.
>
> This is a work in progress (especially the Qt4 bindings), but feedback would
> be appreciated - including help with testing it against Acrobat-generated
> files.
I haven't seen a doc using this yet, but it's pretty cool. After we
finish getting GNOME 2.12 out and branch, I'll try to add support for
this.
Thanks,
-Jonathan
More information about the poppler
mailing list