<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-size: 14px; font-family: Calibri, sans-serif; "><div>The current systems appears to try and use system-available fonts as approximations for whatever font is in the PDF. For pdftohtml, I am considering adding in a preferred behavior:</div><div><br></div><div>1. Extract the original font from the PDF</div><div>2. Create a font file for that font</div><div>3. Reference the font file, using "@font-face" in the generated HTML.</div><div><br></div><div>This should give us an exact representation of the original font in the PDF, though it will only work with modern browsers, since earlier browsers don't support "@font-face". For IE, I'll have to convert the font to EOT, and for the others I'll probably use regular OpenType (not TrueType) format.</div><div><br></div><div>If I only use the extracted font to display the original document in it's original form, and not to draw additional glyphs in any document, I believe I'll be in compliance with "fair use" and digital copyright rules for the font.</div><div><br></div><div>Does anyone see an issue with the approach, or have any advice? For instance, I'm not sure how much luck I'll have with converting especially Type 3 fonts to OpenType/EOT.</div><div><br></div><div>Thanks, --josh</div></body></html>