This repo has been tested with Python3 in Ubuntu. If need a different setup, don't hesitate con contact us on Discord. The Discord server can be found at https://docs ...
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results