Yoav Gur-Arieh

prof_pic.jpg

Hello! I’m Yoav, a graduate student at Mor Geva’s lab, where I specialize in interpretability research in LLMs. My work focuses on understanding how large language models operate internally, particularly how they store and retrieve knowledge. I also explore methods to refine these systems to make them safer and more effective for diverse applications.

Beyond my research, I’m part of the Adi Lautman Interdisciplinary Program for Outstanding Students at Tel Aviv University. This unique program fosters creativity and independent thinking, allowing students to explore any academic field. I’ve taken advantage of this freedom to study biology, neuroscience, philosophy, and history, gaining insights that complement my technical work.

Outside of my work, I enjoy slacklining, climbing, playing Spikeball, and playing music 🙃.

latest posts

selected publications

  1. pisces_diagram.png
    Precise In-Parameter Concept Erasure in Large Language Models
    Yoav Gur-Arieh, Clara Suslik, Yihuai Hong, Fazl Barez, and Mor Geva
    2025
  2. ACL
    fig1.png
    Enhancing Automated Interpretability with Output-Centric Feature Descriptions
    Yoav Gur-Arieh, Roy Mayan, Chen Agassy, Atticus Geiger, and Mor Geva
    In The 63rd Annual Meeting of the Association for Computational Linguistics, 2025