The Multimodal Lab

I lead the Multimodal Lab at Bar-Ilan University. We study multimodal generative models, attention mechanisms, and inference-time control — with students working across vision, language, audio, and video.

1
PhD
8
MSc
2
Alumni
4
Joint Advisors

PhD Students

1 active

Ben Fishman

Joint with Gal Chechik

MSc Students

8 active

Gilad Carmel

MSc Candidate

Mark Vexler

MSc Candidate

Uriel Dolev

Joint with Yoav Goldberg

Amit Ronen

MSc Candidate

Aviv Weidenfeld

MSc Candidate

Omri Keren

MSc Candidate

Binyamin Ramati

MSc Candidate

Yona Orunov

MSc Candidate

Alumni

2 graduated

Shira Schiber

Joint with Ofir Lindenbaum

TempoControl · CVPR'26 ›

Yair Shpitzer

Joint with Gal Chechik

SISO · CVPR'26 Workshop ›

News

  • Apr 2026 LaMI: Augmenting Large Language Models via Late Multi-Image Fusion accepted to ACL 2026 (Main Conference).
  • Apr 2026 Single Image Iterative Subject-driven Generation and Editing accepted to the P13N Workshop, CVPR 2026.
  • Apr 2026 TempoControl: Temporal Attention Guidance for Text-to-Video Models accepted to CVPR 2026.
  • Mar 2026 Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models presented at WACV 2026.
  • 2024 Started as Assistant Professor at Bar-Ilan University and founded the Multimodal Lab.

Publications

Contact

  • idanschwartz at gmail dot com
  • Computer Science Department, Room 213
    Building 503
    Bar-Ilan University
    Ramat Gan, Israel