Analysing the residual stream of language models under knowledge conflicts

Yu Zhao, Xiaotang Du, Giwon Hong, Aryo Pradipta Gema, Alessio Devoto, Hongru Wang, Xuanli He, Kam-Fai Wong, Pasquale Minervini

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Search results