Benchmarking Mental State Representations in Language Models

**[Matteo Bortoletto][1],   [Constantin Ruhdorfer][5],   [Lei Shi][2],   [Andreas Bulling][3]**

**ICML 2024 Workshop on Mechanistic Interpretability, Vienna, Austria**
**[[Paper][4]]**
# Citation ```bibtex @inproceedings{ bortoletto2024benchmarking, title={Benchmarking Mental State Representations in Language Models}, author={Matteo Bortoletto and Constantin Ruhdorfer and Lei Shi and Andreas Bulling}, booktitle={ICML 2024 Workshop on Mechanistic Interpretability}, year={2024}, url={https://openreview.net/forum?id=yEwEVoH9Be} } ``` Under construction [1]: https://mattbortoletto.github.io/ [2]: https://perceptualui.org/people/shi/ [3]: https://perceptualui.org/people/bulling/ [4]: https://openreview.net/forum?id=yEwEVoH9Be [5]: https://perceptualui.org/people/ruhdorfer/