Sanghwan Kim

prof_pic.jpg

Munich, Germany

I am an ELLIS PhD Student co-advised by Zeynep Akata (TUM & Helmholtz Munich) and Yongqin Xian (Google Zurich). I hold a Master’s degree in Data Science from ETH Zurich and a Bachelor’s degree in Electrical Engineering from KAIST.

My research focuses on machine perception, particularly at the intersection of computer vision and natural language processing. Currently, I am working on vision-language pre-training and video question answering.

I’m always open to collaborations or project supervisions! Feel free to reach out :).

Selected publications

2024

  1. kim2024cosmos.png
    COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
    Sanghwan Kim, Rui Xiao, Mariana-Iuliana Georgescu, and 2 more authors
    arXiv preprint arXiv:2412.01814, 2024
  2. xiao2024flair.png
    FLAIR: VLM with Fine-grained Language-informed Image Representations
    Rui Xiao, Sanghwan Kim, Mariana-Iuliana Georgescu, and 2 more authors
    arXiv preprint arXiv:2412.03561, 2024
  3. kim2023lalm.png
    PALM: Predicting Actions through Language Models
    Sanghwan Kim, Daoji Huang, Yongqin Xian, and 3 more authors
    ECCV, 2024
  4. kim2023distilling.jpg
    Distilling ODE Solvers of Diffusion Models into Smaller Steps
    Sanghwan Kim, Hao Tang, and Fisher Yu
    CVPR, 2024

2023

  1. kim2023boosting.jpg
    Boosting Radiology Report Generation by Infusing Comparison Prior
    Sanghwan Kim, Farhad Nooralahzadeh, Morteza Rohanian, and 5 more authors
    ACL Workshop, 2023
  2. kim2023achieving.jpg
    Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning
    Sanghwan Kim, Lorenzo Noci, Antonio Orvieto, and 1 more author
    CVPR, 2023