Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
Use Windows Sandbox to safely install and test unknown apps in an isolated environment. Protect your PC from malware and risky software without affecting your system.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results