🧍♂️ About-me
I am a second-year graduate student at Sun Yat-sen University (advisor: Shen Zhao), and my on-campus research focuses on Computer Vision. Additionally, I have an in-depth understanding of AIGC and Recommender Systems, with achievements in these fields.
🧱 Work Experience
- 2024.06 - 2024.11: Tencent AI Lab – internship, primarily working on AIGC projects for audio-driven humans gesture.
- 2024.11 - 2025.03: Tencent Data Platform Department – internship, mainly involved in building data pipelines for multimodal large models.
- 2025.04 - Now : JD.com – internship, specializing in model optimization for vector based retrieval and research on generative retrieval.
🔥News
- 2025.05: 🎉 One paper is accepted by MICCAI 2025.
- 2024.01: 🎉 One paper is accepted by ESWA 2024.
- 2023.06: 🎉 Awarded the excellent undergraduate graduation thesis of Sun Yat-sen University.
📖 Educations
- 2023.09- Now : M.S. in Sun Yat-sen University, Shenzhen, China
- 2019.09-2023.06: B.S. in Sun Yat-sen University, Shenzhen, China
📝 Main Publications
- (CCF B) Zixuan Tang, Bai Sun, Shidan He, Bin Chen, Shen Zhao. MIBF-Net: Multi-modal information balanced fusion network for Clinical Diagnosis via Patient Narratives and Lesion Image. Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
- (JCR Q1) Zixuan Tang, Bin Chen, An Zeng, Mengyuan Liu, Shen Zhao. Progressive deep snake for instance boundary extraction in medical images. Expert Systems with Applications, 2024.
- (CCF B) Zixuan Tang, Yuhang Wen, Youjun Zhao, Mengyuan Liu. A Survey on Backbones for Deep Video Action Recognition. IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 2024.
- (CCF A, in review) Hao Jiang* , Zixuan Tang*, Xiaoyu He, Shen Zhao, Congcong Liu, Xve Jiang. LLM-EBR: Empowering Embedding-Based Retrieval with Large Language Models. Neural Information Processing Systems (NeurIPS), 2025.
- (CCF B) Yuhang Wen, Zixuan Tang, Yunsheng Pang, Beichen Ding, Mengyuan Liu. Interactive spatiotemporal token attention network for skeleton-based general interactive action recognition. International Conference on Intelligent Robots and Systems, 2023.
- (CCF B) Shidan He, Enyuan Hu, Zixuan Tang, Bin Chen, Shen Zhao, et al. MedSoft-Diffusion: Medical Semantic-Guided Diffusion Model with Soft Mask Conditioning for Vertebral Disease Diagnosis. Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
- (CCF A) Yi Zhang, Youjun Zhao, Yuhang Wen, Zixuan Tang, Mengyuan Liu Facial prior guided micro-expression generation. IEEE Transactions on Image Processing 33, 525-540.
- (JCR Q3) Xinhua Xu, Yuhang Wen, Lu Zhao, Yi Zhang, Youjun Zhao, Zixuan Tang, Ziduo Yang, Calvin Yu‐Chian Chen CARes‐UNet: Content‐aware residual UNet for lesion segmentation of COVID‐19 from chest CT images Medical physics, 2021.
🎖Honors and Awards
- First Prize of Outstanding Graduate Scholarship of Sun Yat-sen University
- First Prize of Outstanding Undergraduate Scholarship of Sun Yat-sen University
- Excellent Undergraduate Graduation Thesis of Sun Yat-sen University.
- First place in MEGC 2021 track1.
- Third place in CVPR2022 UG2+ track2.
🔧Skills
- Programming Language: Proficient in Python, familiar with C++, C#
- Model training framework: Proficient in Pytorch Lighting, LLaMA Factory, Distributed Data Parallel, familiar with DeepSpeed
- Professional Software: familiar with Maya, Blender,