[poppler] Help with pdfimages on USGS maps

Phil Endecott phil_srutw_endecott at chezphil.org
Sat Mar 21 12:09:14 PDT 2009


Albert Astals Cid wrote:
> A Dijous, 19 de mar? de 2009, Phil Endecott va escriure:
>> Dear All,
>>
>> USGS maps can be downloaded, free, from their web site at
>> http://store.usgs.gov/.  Here's an example of what you can get (17
>> MByte file): http://chezphil.org/tmp/Boston_South_K42071C1_geo.PDF.
>> It's a PDF from which pdfimages will happily extract a few hundred JPEGs.
>>
>> What I'd like to do is to assemble a single large raster image (TIFF,
>> JPEG, whatever) at the natural resolution of those embedded images.
>> That means assembling those few hundred JPEG images in the right
>> pattern.  And I'd like to be able to do that automatically for a large
>> number of these files.  So:
>>
>> - Does pdfimages write out the images in an order that has some
>> guaranteed relationship to the position of the images on the page?
>
> I'm almost sure it just outputs them as they are found on the pdf commands 
> that has nothing to do with their position on the page.
>
>> - Can pdfimages be hacked to output some hint of the positions of each
>> image?
>
> Should not be very difficult, but you need to be a coder to do that.

Hi Albert,

Thanks for that.  I may have found another way to solve my problem, but 
if I need to pursue this I'll look into it further.


Cheers,  Phil.





More information about the poppler mailing list