public-projects/mental-states-in-LMs

Fork 0

Code for the paper "Benchmarking Mental State Representations in Language Models", ICML 2024 Workshop on Mechanistic Interpretability

Find a file

Matteo Bortoletto 17196550f5 Update README.md		2024-06-25 15:06:04 +02:00
README.md	Update README.md	2024-06-25 15:06:04 +02:00

README.md

Benchmarking Mental State Representations in Language Models

Matteo Bortoletto, Constantin Ruhdorfer, Lei Shi, Andreas Bulling

ICML 2024 Workshop on Mechanistic Interpretability, Vienna, Austria
[Paper]

Citation

@inproceedings{
    bortoletto2024benchmarking,
    title={Benchmarking Mental State Representations in Language Models},
    author={Matteo Bortoletto and Constantin Ruhdorfer and Lei Shi and Andreas Bulling},
    booktitle={ICML 2024 Workshop on Mechanistic Interpretability},
    year={2024},
    url={https://openreview.net/forum?id=yEwEVoH9Be}
}

Under construction