Hi Michael,<br><br><br>nice to ear from someone so "up the ranks" like you.. makes me feel much more important :-)<br><br><div class="gmail_quote">2012/7/6 Michael Meeks <span dir="ltr"><<a href="mailto:michael.meeks@suse.com" target="_blank">michael.meeks@suse.com</a>></span><br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Flavio,<br>
<div class="im"><br>
On Tue, 2012-07-03 at 11:45 +0100, Flavio Moringa wrote:<br>
> my name is Flávio Moringa, I'm from Portugal and I'm starting my<br>
> Masters Dissertation next September (Master in Open Source software -<br>
> <a href="http://moss.dcti.iscte.pt" target="_blank">http://moss.dcti.iscte.pt</a> ).<br>
<br>
</div> Welcome :-)<br></blockquote><div><br>Thanks <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div class="im"><br>
> I'm not a programmer, so what I'm interested in doing is something in<br>
> the lines of investigating the main conversion problems, identifying<br>
> the possible conversion flows, analysing the way the conversion flow<br>
> is implemented in LibreOffice, and eventually trying to improve this<br>
> flow somehow.<br>
<br>
</div> So - it will be hard to improve the flow without being a programmer I'm<br>
afraid :-)<br></blockquote><div><br>well, although not a programmer right now I've had my fair share of perl, python, c, bash, java, php... maybe I'm not so "fluent" in programming right now, but I'm certainly no strange to it, and definitely not afraid to do it if the need arises... what I meant was that I'll probably wont't be able to do a conversion engine by myself... but I can definitely mess around with code...<br>
</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div class="im"><br>
> From your reply I assume that testing the filters, and doing<br>
> regression tests is something I could do, maybe identifying the main<br>
> conversion issues in groups of documents and kind of creating a "major<br>
> conversion issues" table, and prioritizing those issues. Is there<br>
> already something like that?<br>
<br>
</div> There is a useful QA role in prioritising bug reports and<br>
interoperability issues; we have a real problem with masses of bug<br>
reports many of which could be duplicates. Having said that -<br>
interoperability has many, many known feature / impedance mis-matches<br>
that are non-trivial development problems to fix.<br>
<br>
One thing that -would- be really useful, and that Microsoft have<br>
internally, is an analysis tool for Microsoft's XML document formats -<br>
such that we can get a good idea of which attributes are actually used<br>
much. ie. by analysing and comparing a large corpus of documents out<br>
there, we can answer questions such as:<br>
<br>
"should we implement surface charts, or 3D doughnut charts ?"<br>
<br>
given whatever amount of feature-development time we have - simply by<br>
referring to the database of crunched XML files to work out which one is<br>
used most.<br>
<br>
It'd be nice to have that for ODF as well too of course for when we<br>
have to make zero-sum back-compatibility decisions; but for<br>
interoperability crunching those MS documents would be really good.<br>
<br>
Is that something you could do ? a bit of perl, zip extraction, XML<br>
parsing, etc. ?<br></blockquote><div><br>Yes, it's definitely something I can do... I do believe that the harder part is getting that " large corpus of documents out<br>
there...". At least as my experience goes, I've found that it's hard to get users to send us documents they use... either due to privacy questions or enterprise policies... But a tool like that makes a lot of sense<br>
</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
Developers are -much- more likely to let themselves be lead by<br>
objective statistics on real documents out there, rather than subjective<br>
feelings of priority - which can prove rather controversial :-)<br></blockquote><div><br>I can certainly relate to that... <br> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
Thanks !<br></blockquote><div><br>For now then I'll start doing as you suggest and look in bugzilla for documents with conversion problems to try and compile as much examples as I can. Then maybe using the latest beta to do the conversion and see which problems are still there. Then maybe starting a perl script that can scrap the OOXML files to find the most used tags... and start from there...<br>
</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<span class="HOEnZb"><font color="#888888"><br>
Michael.<br>
<br>
--<br>
<a href="mailto:michael.meeks@suse.com">michael.meeks@suse.com</a> <><, Pseudo Engineer, itinerant idiot<br>
<br>
</font></span></blockquote></div><br><br clear="all">Thanks a lot for helping out.<br>Cheers<br><br>-- <br><font size="-1"><b>Flávio Moringa</b></font><font size="1"><br>
Project Leader<br><br><img src="http://people.caixamagica.pt/flaviomoringa/images/caixamagica.png"><br>
<br>
Caixa Mágica Software<br>
Energia Open Source<br>
Rua Soeiro Pereira Gomes, Lote 1 - 4.º B,<br>
Edifício Espanha, 1600-196 Lisboa - Portugal<br>
Tel.: +351 217 921 260 Fax: +351 217 921 261<br><a href="http://www.caixamagica.pt" target="_blank">http://www.caixamagica.pt</a><br>
<a href="https://twitter.com/flaviomoringa" target="_blank">https://twitter.com/flaviomoringa</a><br>
<a href="https://www.facebook.com/flavio.moringa" target="_blank">https://www.facebook.com/flaviomoringa</a><br><a href="http://pt.linkedin.com/in/flaviomoringa" target="_blank">http://pt.linkedin.com/in/flaviomoringa</a><br>
</font><font size="1"><a href="http://people.caixamagica.pt/flaviomoringa" target="_blank">http://people.caixamagica.pt/flaviomoringa</a><br><br></font><br>