π Actively seeking PhD positions in the U.S. β Fall 2027
π¬ Post-Master Researcher in Computer Science
π Kyung Hee University
π¬ Vision and Learning Lab, Advised by Prof. Jinwoo Choi
π§ jong980812@khu.ac.kr
Hello! I recently completed my Masterβs degree in Computer Science at Kyung Hee University, and I am now continuing my research at the same lab as a Post-Master Researcher. I am currently seeking PhD positions in the United States (Fall 2027) to further pursue my research in video understanding and multimodal AI.
Prior to my graduate studies, I earned dual bachelor's degrees in Biomedical Engineering and Electronics Engineering. This interdisciplinary background has given me a strong foundation in signal processing, embedded systems, and AI, which I now integrate into my research.
My research is conducted at the Vision and Learning Lab under Professor Jinwoo Choi, where I specialize in Video Understanding. I am particularly focused on:
Recently, I have been deeply passionate about Video eXplainable AI and Video-Text multimodal learning. I am always open to collaboration and discussions on cutting-edge research in Computer Vision, Multimodal AI, and Explainability.
Feel free to connect with me! π
I completed my M.S. in Computer Science at Kyung Hee University and am now continuing my research as a Post-Master Researcher at the Vision and Learning Lab.
I am actively looking for PhD positions in the United States for Fall 2027, focusing on video understanding, multimodal AI, and explainability. If you think I could be a good fit for your group, I would love to hear from you β feel free to reach out at jong980812@khu.ac.kr!
We released a new preprint, βWhich Way Did It Move? Diagnosing and Overcoming Directional Motion Blindness in Video-LLMsβ.
We identify a fundamental failure mode of Video-LLMs β directional motion blindness β and introduce DeltaDirect, a parameter-efficient motion-change head that raises LLaVA-Video-7B from 27.6% to 85.4% on real-world direction QA.
π Check it out on arXiv (2605.22823) and the project page.
Iβm honored to share that our paper has been accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), one of the most prestigious journals in computer vision and AI!
Title: βCA2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognitionβ
This work extends our NeurIPS 2023 paper CAST by incorporating audio-visual reasoning into a unified cross-attention framework, enabling models to understand how sound, spatial cues, and temporal dynamics interact in complex video scenes.
We are proud that this work has been recognized by TPAMI β a meaningful milestone in our ongoing exploration of multimodal video understanding.
π Check it out on arXiv (2503.23447).
π Huge thanks to my amazing collaborators β Joohyun Chang and Jinwoo Choi† β for their constant guidance and dedication throughout this project.
Iβm beyond excited to announce that our paper has been accepted to NeurIPS 2025 as a Spotlight (3.5% acceptance rate), one of the most prestigious venues in machine learning and AI!
Title: βDisentangled Concepts Speak Louder Than Words: Explainable Video Action Recognitionβ
This work introduces a novel framework for structured and interpretable video action recognition, disentangling motion dynamics, objects, and scenes into human-understandable concepts. By doing so, it provides not only strong performance but also clear explanations of model decisions.
The Spotlight recognition makes this achievement even more meaningful, as only a small fraction of submissions receive this honor. Itβs an encouraging step forward in my research journey on explainable AI for video understanding.
π Iβm truly grateful to my amazing collaborators β Wooil Lee, Gyeong-Moon Park†, Seong Tae Kim†, and Jinwoo Choi† β for their invaluable contributions. Excited to present this work at NeurIPS and share our vision for more transparent and interpretable AI systems!
Iβm thrilled to share that my paper has been accepted to ICCV 2025 as a highlight paper, one of the top-tier conferences in computer vision!
Title: βESSENTIAL: Episodic and Semantic Memory Integration for Video Class-Incremental Learningβ
What makes this even more meaningful is that it was accepted after three submission attempts. The journey was challenging β filled with revisions, rejections, and countless hours of iteration β but it proved to be an incredible learning experience.
Iβm deeply grateful for the growth that came through the process and couldnβt be more excited to finally see it accepted.
π Thank you to everyone who supported me along the way. I look forward to presenting this work at ICCV and continuing to push forward in research!
* indicates equally contributed first authors
† indicates corresponding author
Authors: Jongseo Lee, Hyuntak Lee, Sunghun Kim, Sooa Kim, Jihoon Chung, Jinwoo Choi†
Preprint: arXiv 2026 (under review)
Authors: Jongseo Lee, Wooil Lee, Gyeong-Moon Park†, Seong Tae Kim†, Jinwoo Choi†
Conference: Proceedings of Neural Information Processing Systems (NeurIPS), Spotlight (3.5% acceptance rate), 2025
Authors: Jongseo Lee*, Kyungho Bae*, Kyle Min, Gyeong-Moon Park†, Jinwoo Choi†
Conference: IEEE/CVF International Conference on Computer Vision (ICCV), Highlight, 2025
Authors: Jongseo Lee, Wooil Lee, Gyeong-Moon Park†, Seong Tae Kim†, Jinwoo Choi†
Conference: IEEE/CVF Computer Vision and Pattern Recognition (CVPR) 4th XAI4CV Workshop, Spotlight (16.7% acceptance rate), 2025
Authors: Jongseo Lee*, Joohyun Chang*, Dongho Lee, Jinwoo Choi†
Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Authors: Jongseo Lee*, Geo Ahn*, Seong Tae Kim†, Jinwoo Choi†
Preprint: arXiv 2024 (under review)
Authors: Dongho Lee*, Jongseo Lee*, Jinwoo Choi†
Conference: Proceedings of Neural Information Processing Systems (NeurIPS), 2023
Authors: Jongseo Lee, Su Hyeon Kim, Sun Woong Jang, Jun Yeong Moon, Doug Young Suh†
Journal: Journal of Appropriate Technology, Volume 8(2), 2022
Authors: Jongseo Lee, Soohyun Park, Jinwoo Choi†
Conference: Korean Institute of Information Scientists and Engineers (KIISE), 2024
Authors: Jongseo Lee, Soohyun Park, Jinwoo Choi†
Conference: Korean Institute of Information Scientists and Engineers (KIISE), 2024
Authors: Jongseo Lee, Joohyun Chang, Jinwoo Choi†
Conference: Korean Institute of Information Scientists and Engineers (KIISE), 2024