[Libreoffice] Symbolic regression for curvefitting.

Fabian Deutsch fabian.deutsch at gmx.de
Sun Jan 2 12:12:01 PST 2011


Hey,

as a part of my diploma thesis I developed a symbolic regression tool,
finding _compact formulas_ for datasets.
So what does this mean? E.g. it can find the formula for the surface of
a circle providing samples to the software.

Maybe it makes sense or there is interest in integrating this method as
an alternative to classic curve-fitting methods into libreoffice calc. 
AFAIK neither Microsoft Office nor gnumeric includes this method (that's
why I've also posted a similar mail to gnumeri's ml).

I've written a prototype in vala which works fairly well. It can be
found at
http://gitorious.org/pigp/libmlgp

It includes a library doing all the work and a simple commandline
interface to run a symbolic regression. Please handle this software with
care, sometimes it does not know what it is doing :)

The docs/ folder provides more informations on how to build and how to
run a regression.

More on the topic of symbolic regression (Schmidt and Lipson published a
nice implementation in 2009 which raised my interest): 
http://ccsl.mae.cornell.edu/eureqa
http://www.hakank.org/eureqa/
My implementation differs from different ones, as it uses
multi-objective optimization (via NSGA-2) to find (several differing)
compact formulas, classical symbolic regression is also fitting curves
to data, but it creates very long formulas to fit the data optimal.

Thoughts?

- fabian



More information about the LibreOffice mailing list