Sanghwan Kim

prof_pic.jpg

Munich, Germany

I am an ELLIS PhD Student co-advised by Zeynep Akata (TUM & Helmholtz Munich) and Yongqin Xian (Google Zurich). I hold a Master’s degree in Data Science from ETH Zurich and a Bachelor’s degree in Electrical Engineering from KAIST.

My research focuses on multimodal learning and vision-language alignment. Currently, I am working on improving MLLM reasoning and long video question answering.

I’m always open to collaborations or project supervisions! Feel free to reach out :).

Selected publications

2025

  1. kim2024cosmos.png
    COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
    Sanghwan Kim, Rui Xiao, Mariana-Iuliana Georgescu, and 2 more authors
    CVPR, 2025
  2. xiao2024flair.png
    FLAIR: VLM with Fine-grained Language-informed Image Representations
    Rui Xiao, Sanghwan Kim, Mariana-Iuliana Georgescu, and 2 more authors
    CVPR, 2025

2024

  1. kim2023lalm.png
    PALM: Predicting Actions through Language Models
    Sanghwan Kim, Daoji Huang, Yongqin Xian, and 3 more authors
    ECCV, 2024
  2. kim2023distilling.jpg
    Distilling ODE Solvers of Diffusion Models into Smaller Steps
    Sanghwan Kim, Hao Tang, and Fisher Yu
    CVPR, 2024

2023

  1. kim2023boosting.jpg
    Boosting Radiology Report Generation by Infusing Comparison Prior
    Sanghwan Kim, Farhad Nooralahzadeh, Morteza Rohanian, and 5 more authors
    ACL Workshop, 2023
  2. kim2023achieving.jpg
    Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning
    Sanghwan Kim, Lorenzo Noci, Antonio Orvieto, and 1 more author
    CVPR, 2023