<div dir="ltr">Hi Kohei,<div><br></div><div>There are two models. One model is responsible for predicting the probability of each unit test to fail, and the other is predicting the overall probability of the patch to fail any unit test. The first has a failure recall of 95% (meaning 95% failures are captured) and a success recall of 85% (meaning reducing the number of unit tests by 85%), but this model is not directly used in deciding whether to run the fast or normal track. The second model is far less accurate, so <a href="https://baolef.github.io/libreoffice-ci/2023/08/03/week10.html#smart-inference">smart inference</a> is performed in jenkins, which has 91% failure recall and 57% success recall.</div><div><br></div><div>The confusion matrix for the first model is posted <a href="https://baolef.github.io/libreoffice-ci/2023/07/27/week9.html">here</a>.</div><div><br></div><div>Best,</div><div>Baole Fang</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Aug 18, 2023 at 9:45 AM Kohei Yoshida <<a href="mailto:kohei@libreoffice.org" target="_blank">kohei@libreoffice.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hello,<br>
<br>
On 09.08.2023 10:57, Baole Fang wrote:<br>
<br>
> Feel free to contact me if you have any questions!<br>
<br>
Just out of curiosity, what is the overall accuracy of your model? Do <br>
you have a confusion matrix or something similar that shows the <br>
performance of your model?<br>
<br>
Thanks,<br>
Kohei<br>
</blockquote></div>