🧍‍♂️ About-me

I am a second-year graduate student at Sun Yat-sen University (advisor: Shen Zhao), and my on-campus research focuses on Computer Vision. Additionally, I have an in-depth understanding of AIGC and Recommender Systems, with achievements in these fields.

🧱 Work Experience

  • 2024.06 - 2024.11: Tencent AI Lab – internship, primarily working on AIGC projects for audio-driven humans gesture.
  • 2024.11 - 2025.03: Tencent Data Platform Department – internship, mainly involved in building data pipelines for multimodal large models.
  • 2025.04 -   Now   : JD.com – internship, specializing in model optimization for vector based retrieval and research on generative retrieval.

🔥News

  • 2025.05: 🎉 One paper is accepted by MICCAI 2025.
  • 2024.01: 🎉 One paper is accepted by ESWA 2024.
  • 2023.06: 🎉 Awarded the excellent undergraduate graduation thesis of Sun Yat-sen University.

📖 Educations

  • 2023.09-  Now   : M.S. in Sun Yat-sen University, Shenzhen, China
  • 2019.09-2023.06: B.S. in Sun Yat-sen University, Shenzhen, China

📝 Main Publications

  • (CCF B) Zixuan Tang, Bai Sun, Shidan He, Bin Chen, Shen Zhao. MIBF-Net: Multi-modal information balanced fusion network for Clinical Diagnosis via Patient Narratives and Lesion Image. Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
  • (JCR Q1) Zixuan Tang, Bin Chen, An Zeng, Mengyuan Liu, Shen Zhao. Progressive deep snake for instance boundary extraction in medical images. Expert Systems with Applications, 2024.
  • (CCF B) Zixuan Tang, Yuhang Wen, Youjun Zhao, Mengyuan Liu. A Survey on Backbones for Deep Video Action Recognition. IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2024.
  • (CCF A, in review) Hao Jiang* , Zixuan Tang*, Xiaoyu He, Shen Zhao, Congcong Liu, Xve Jiang. LLM-EBR: Empowering Embedding-Based Retrieval with Large Language Models. Neural Information Processing Systems (NeurIPS), 2025.
  • (CCF B) Yuhang Wen, Zixuan Tang, Yunsheng Pang, Beichen Ding, Mengyuan Liu. Interactive spatiotemporal token attention network for skeleton-based general interactive action recognition. International Conference on Intelligent Robots and Systems, 2023.
  • (CCF B) Shidan He, Enyuan Hu, Zixuan Tang, Bin Chen, Shen Zhao, et al. MedSoft-Diffusion: Medical Semantic-Guided Diffusion Model with Soft Mask Conditioning for Vertebral Disease Diagnosis. Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
  • (CCF A) Yi Zhang, Youjun Zhao, Yuhang Wen, Zixuan Tang, Mengyuan Liu Facial prior guided micro-expression generation. IEEE Transactions on Image Processing 33, 525-540.
  • (JCR Q3) Xinhua Xu, Yuhang Wen, Lu Zhao, Yi Zhang, Youjun Zhao, Zixuan Tang, Ziduo Yang, Calvin Yu‐Chian Chen CARes‐UNet: Content‐aware residual UNet for lesion segmentation of COVID‐19 from chest CT images Medical physics, 2021.

🎖Honors and Awards

  • First Prize of Outstanding Graduate Scholarship of Sun Yat-sen University
  • First Prize of Outstanding Undergraduate Scholarship of Sun Yat-sen University
  • Excellent Undergraduate Graduation Thesis of Sun Yat-sen University.
  • First place in MEGC 2021 track1.
  • Third place in CVPR2022 UG2+ track2.

🔧Skills

  • Programming Language: Proficient in Python, familiar with C++, C#
  • Model training framework: Proficient in Pytorch Lighting, LLaMA Factory, Distributed Data Parallel, familiar with DeepSpeed
  • Professional Software: familiar with Maya, Blender,