Benchmarking Mental State Representations in Language Models

<div align="center">
<h1> Benchmarking Mental State Representations in Language Models </h1>

**[Matteo Bortoletto][1], &nbsp; [Constantin Ruhdorfer][5], &nbsp; [Lei Shi][2], &nbsp; [Andreas Bulling][3]** <br> <br>
**ICML 2024 Workshop on Mechanistic Interpretability, Vienna, Austria** <br>
**[[Paper][4]]**

</div>

# Citation

```bibtex
@inproceedings{
bortoletto2024benchmarking,
title={Benchmarking Mental State Representations in Language Models},
author={Matteo Bortoletto and Constantin Ruhdorfer and Lei Shi and Andreas Bulling},
booktitle={ICML 2024 Workshop on Mechanistic Interpretability},
year={2024},
url={https://openreview.net/forum?id=yEwEVoH9Be}
}
```


Under construction


[1]: https://mattbortoletto.github.io/
[2]: https://perceptualui.org/people/shi/
[3]: https://perceptualui.org/people/bulling/
[4]: https://openreview.net/forum?id=yEwEVoH9Be
[5]: https://perceptualui.org/people/ruhdorfer/