[Poppler-bugs] [Bug 77087] High CPU usage
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Sun Apr 13 09:25:59 PDT 2014
https://bugs.freedesktop.org/show_bug.cgi?id=77087
Jason Crain <jason at aquaticape.us> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |jason at aquaticape.us
--- Comment #4 from Jason Crain <jason at aquaticape.us> ---
Tracked the regression to this commit:
9c5612f6e013a8698eff6531ec388a7e6c1fb89a is the first bad commit
commit 9c5612f6e013a8698eff6531ec388a7e6c1fb89a
Author: Marek Kasik <mkasik at redhat.com>
Date: Fri Feb 12 14:31:01 2010 +0100
Distinguish between columns and tables when selecting text
This commit add ability to detect tables in text by checking borders
of 4 neighbouring text blocks for arrangement (to the left, to the right,
center, ...). Detected border of whole table is then stored in ExMin,
ExMax,
EyMin and EyMax of each block together with id of detected table. Sorting
of blocks is then performed on the these borders to be able to distinguish
tables from columns.
Pasting of selected text was modified so that tables are pasted correctly
(even with multi line cells).
:040000 040000 e58e22d3707422029f1ca753868164eb22cf8bb4
46b3ae4c8d7cee01fc6e69a4038140ecab7ce361 M poppler
It's slowed in a recursive function, TextBlock::visitDepthFirst. The algorithm
in there runs at O(n^3) in the worst case, and the tables patch makes the worst
case more likely. I think the pdftoppm/pdftocairo issue mentioned above is a
separate problem, possibly caused by the large size of the document (819 inches
x 30 inches).
--
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.freedesktop.org/archives/poppler-bugs/attachments/20140413/165e4deb/attachment.html>
More information about the Poppler-bugs
mailing list