Hi, my name is Xinyu Lyu (吕新昱). I’m an Associate Professor in School of Computing and Artificial Intelligence, Southwestern University of Finance and Economics. Before that, I obtained my Ph.D. degree from College of Computer Science, University of Electronic Science and Technology of China, advised by Prof.Lianli Gao, Prof.Jingkuan Song and Prof.Jie Shao.

My research interests mainly focus are Multimedia Learning and AI Safety, icluding Hallucination Mitigation, Jailbreak Defense, Reasoning Vulnerabilities and Spatial Intelligence of Mutli-modal Large Lanuage Model.

🔥 News

  • 2025.06:   I was recognized as an outstanding reviewer (Top 5%) for CVPR 2025.
  • 2025.05:   One paper was accepted by IEEE Transactions on Image Processing (TIP 2025).
  • 2025.02:   One paper was accepted by IEEE Transactions on Image Processing (TIP 2025).
  • 2025.01:   One paper was accepted by International Journal of Computer Vision (IJCV 2025).
  • 2024.09:   One paper was accepted by Conference on Neural Information Processing Systems (NeurIPS 2024).

📝 Selected Publications

TPAMI 2023
sym

Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
Xinyu Lyu, Lianli Gao, Pengpeng Zeng, Heng Tao Shen, Jingkuan Song Code

Ensuring balanced and efficient learning process for fine-grained SGG.

IJCV 2025
sym

Informative Scene Graph Generation via Debiasing
Lianli Gao, Xinyu Lyu(Corresponding Author), Yuyu Guo, Yuxuan Hu, Yuan-Fang Li, Lu Xu, Heng Tao Shen, Jingkuan Song Code

Making balanced and informative predicate prediction for unbiased SGG.

TIP 2025
sym

Multi-Concept Learning for Scene Graph Generation
Xinyu Lyu, Lianli Gao, Junlin Xie, Pengpeng Zeng, Yulu Tian, Jie Shao, Heng Tao Shen Code

Addressing both predicate-level and concept-level imbalance issues for generalized unbiased SGG.

NeurIPS 2024
sym

Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization.
Xinyu Lyu*, Beitao Chen*(Equal Contribution), Lianli Gao, Jingkuan Song and Heng Tao Shen Code

Alleviating Hallucinations in LVLMs via Weak-to-Strong Generalization.

CVPR 2022
sym

Fine-Grained Predicates Learning for Scene Graph Generation
Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song Video | Code

Aims at differentiating hard-to-distinguish predicates for fine-grained SGG.

(†: equal contribution, *: corresponding author, ✧: Project Leader)

🎖 Honors and Awards

  • 2025.06 Outstanding Reviewers for CVPR 2025.(Webpage)
  • 2024.05 Academic Newcomer Honor of UESTC. (Webpage)
  • 2024.01 SCF Annual Outstanding Student Paper. (Webpage)

📚 Course Teaching

  • Graduate: Multimodal Large Models (Semester 1, 2025–2026)
  • Graduate: Principles and Practice of Multimodal Large Models (Semester 1, 2025–2026)
  • Graduate: Frontiers in Multimodal Large Models (Semester 2, 2025–2026)
  • Undergraduate: Artificial Intelligence and Modern Technology (Semesters 1–2, 2024–2026)

🙋 Service

  • Journal Reviewer: IEEE TPAMI, IJCV, IEEE TIP, IEEE TCSVT, IEEE TMM, IEEE TGRS, ACM TOMM, etc.
  • Conference Reviewer: CVPR, ICCV, ECCV, NeurIPS, ICLR, ACM MM, AAAI, EMNLP, ICME, etc.
  • Competition Reviewer: Guangdong-Hong Kong-Macao Greater Bay Area (Whampoa) International Algorithm Calculation Competition 2022/2023.

💻 Main Projects

  • 2025.07 - 2027.07, Teaching Practice Research on an Inclusive Finance Risk-Control Dialogue System Driven by Multimodal Large-Scale Models, Sichuan Provincial Department of Education Innovative Experimental Project of Undergraduate, Project Leader.
  • 2025.01 - 2026.12, Research on Visual Relationship Detection Algorithms and Applications in Open World Settings, Sichuan Provincial Natural Science Foundation Project (No. 2025ZNSFSC1463), Project Leader.
  • 2023.02 - 2024.06, 2030 Science and Technology Innovation Major Project for Next-Generation Artificial Intelligence by the Ministry of Science and Technology, National Key Research and Development Program of China (No. 2018AAA0102200), Main Researcher.
  • 2021.01 - 2025.12, Research on Key Technologies for Visual Natural Cognition through Collaborative Vision and Language Processing, National Natural Science Foundation of China under Grant (No. 62020106008), Participant.
  • 2020.01 - 2022.07, Research on Scene Graph Generation for E-commerce Live Streaming/Video, Kuaishou (No. 202109FKY00302), Main Researcher.
  • 2019.01 - 2022.12, Research on Key Technology Research on Collaborative Deep Video Understanding, Description, and Visual Question Answering, National Natural Science Foundation of China (No. 61872064), Participant.
  • 2018.01 - 2021.12, Research on Key Technology for Deep Visual Understanding Integrated with Natural Language Processing, National Natural Science Foundation of China (No. 61772116), Participant.

    Free Web Counter visitors since Jun 2024.