[Libreoffice-bugs] [Bug 124982] Thread the Calc CSV parser

bugzilla-daemon at bugs.documentfoundation.org bugzilla-daemon at bugs.documentfoundation.org
Tue May 14 16:37:42 UTC 2019


https://bugs.documentfoundation.org/show_bug.cgi?id=124982

Michael Meeks <michael.meeks at collabora.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|normal                      |enhancement
     Ever confirmed|0                           |1
            Summary|Calc seems only capable of  |Thread the Calc CSV parser
                   |using one CPU core          |
             Status|UNCONFIRMED                 |NEW
                 CC|                            |libreoffice at kohei.us

--- Comment #5 from Michael Meeks <michael.meeks at collabora.com> ---
Calc has a single threaded CSV parser - that is correct. Given the general
simplicity of the CSV file format, it seems likely that profiling that with
kcachegrind and optimizing it further would probably yield more of a win than
trying to thread it - but of course, its possible that a parse/insert
separation would be helpful.

It is also possible that some column iterators for data insertion might help
optimize the mdds side.

Can you paste a few rows of your CSV file ? how homogeneous is it ? are there
large gaps in lots of the columns - or is it uniform ?

Anyhow - an enhancement here - someone might like to hack on that:

sc/source/filter/orcus/orcusfiltersimpl.cxx
bool ScOrcusFiltersImpl::importCSV(ScDocument& rDoc, SfxMedium& rMedium) const

Looks like a good place to poke. I believe much of the CSV parsing is inside
the standalone orcus library which should be easy to hack on - but whether
that's where the slowness is its hard to say without profiling =)

HTH.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.freedesktop.org/archives/libreoffice-bugs/attachments/20190514/2b002353/attachment.html>


More information about the Libreoffice-bugs mailing list