I am a Ph.D. candidate in the VCIP Lab, College of Computer Science, Nankai University, under the supervision of Professors Jian Yang, Xiang Li, and Ming-Ming Cheng. I hold a first-class honors master degree (4 years) in Computer Science from University College London (UCL).
My research interests focus on computer vision, remote sensing object detection, large models, and multi-modal learning. I am actively seeking research internship and postdoctoral opportunities worldwide. For inquiries about my research or collaboration opportunities, please feel free to reach out via email at yuxuan.li.17 [at] ucl.ac.uk.
🎓 Educations
- 2023 – Present, Ph.D. Student in College of Computer Science, Tianjin, Nankai University. Research on computer vision, remote sensing object detection, large models, and multi-modal learning.
- 2021 – 2023, Research Assistant in College of Computer Science, Nankai University, Tianjin, China.
- 2017 – 2021, M.Eng. in Computer Science (4 years course), First Class Honors Degree, University College London (UCL), London, UK.
📚 Publications
# Corresponding author
Journal

LSKNet: A foundation lightweight backbone for remote sensing (IJCV)
Yuxuan Li, Xiang Li#, Yimian Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng, Jian Yang#
[Paper]
[BibTex]
[Demo Video]
[Code]
LSKNet can dynamically adjust its large spatial receptive field to better model the ranging context of various categories of objects in remote sensing scenarios.

Yuxuan Li, Lingfeng Yang, Xiang Li#
The APF-GAN model improves GAN-based image segmentation, surpassing GauGAN and winning the Second Jittor AI Challenge.
Conference

Large Selective Kernel Network for Remote Sensing Object Detection (ICCV 2023)
Yuxuan Li, Qibin Hou, Zhaohui Zheng, Ming-Ming Cheng, Jian Yang#, Xiang Li#
[Paper]
[BibTex]
[Demo Video]
[Report/Forum]
[Code]
[知乎专栏]
LSKNet can dynamically adjust its large spatial receptive field to better model the ranging context of various categories of objects in remote sensing scenarios.

Yuxuan Li, Xiang Li#, Weijie Li, Qibin Hou, Li Liu, Ming-Ming Cheng, Jian Yang#
[Paper]
[BibTex]
[Code]
[知乎专栏]
SARDet-100K, the first large-scale multi-class SAR object detection dataset, along with the proposed MSFA pretraining framework, addresses domain and model disparities, significantly improving SAR object detection performance and advancing the field.

Spatial group-wise enhance: Enhancing semantic feature learning in CNN (ACCV 2022)
Yuxuan Li, Xiang Li#, Jian Yang
[Paper]
[BibTex]
[[Code]]
[知乎专栏]
The Spatial Group-wise Enhance (SGE) module improves CNN performance by generating accurate spatial attention masks through local-global similarity within semantic groups, achieving notable accuracy gains on ImageNet and COCO tasks with minimal overhead.
📃 Other Publications
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
Preprint, 2024
Yuxuan Li, Xiang Li, Yunheng Li, Yicheng Zhang, Yimian Dai, Qibin Hou, Ming-Ming Cheng, Jian Yang
[Paper]
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
CVPR, 2024
Xin Zhang, Xue Yang, Yuxuan Li, Jian Yang, Ming-Ming Cheng, Xiang Li
[Paper]
DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction
Preprint, 2024
Yunheng Li, Yuxuan Li, Quansheng Zeng, Wenhai Wang, Qibin Hou, Ming-Ming Cheng
[Paper]
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection
Preprint, 2024
Xinbin Yuan, Zhaohui Zheng, Yuxuan Li, Xialei Liu, Li Liu, Xiang Li, Qibin Hou, Ming-Ming Cheng
[Paper]
GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling
Preprint, 2024
Qun Dai, Chunyang Yuan, Yimian Dai, Yuxuan Li, Xiang Li, Kang Ni, Jianhui Xu, Xiangbo Shu, Jian Yang
[Paper]
Pick of the Bunch: Detecting Infrared Small Targets Beyond Hit-Miss Trade-Offs via Selective Rank-Aware Attention
IEEE TGRS, 2024
Yimian Dai, Peiwen Pan, Yuxuan Li, et al.
[Paper]
Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?
Preprint, 2023
Zheng Li*, Yuxuan Li*, Penghai Zhao, et al.
[Paper]
DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images
Transactions on Aerospace and Electronic Systems, 2024
Yimian Dai, Minrui Zou, Yuxuan Li, et al.
[Paper]
🏆 Competitions and Awards

- 2022, Champion of Jittor AI competition, 50,000 RMB bonus (1st from 154 teams).

- 2022, First Prize of the 5th Creative Open-Source competition, 50,000 RMB bonus.

- 2022, Second place of IACC International Algorithm Case Competition, 100,000 RMB bonus (2nd from 116 teams).

- 2023, Third prize of Jittor AI competition, 10,000 RMB bonus (4th from 109 teams).


- 2023, Outstanding Presentation Award The 3rd Academic Forum for Graduate Students of Journal of Image and Graphics.

- 2019, Second prize of Facebook Hack-a-Project Spring (2 out of 12 teams).
💻 Experiments

Master Project: Exploring the relation between cancers and mRNA with computer science methods
- Aim to find out which combination of mRNA bases have information of cancer metastasis.
- Built a complex network based on correlations among mRNAs using Python and Gephi.
- Did network analysis to filter out informative mRNAs.
- Currently researching on more effective network analysis and machine learning techniques,
- And how to predict cancers based on the filtered mRNAs with ML models.

Paid summer internship at UCL computer science AI research group
- A simulation study on Twitter breaking news spreading based on ML Models.
- Analysed, reproduced and evaluated previous ML models and methods.
- Will do data mining, feature selection and adding textural features(NLP) to the existing model.

Forecasting influenza-like illness rates using online search data
- Use Google query logs data to train a model to nowcast and forecast influenza-like illness.
- Examined state-of-the-art approaches, implemented and compared different ML models.
- Built a super-ensemble model which has a high performance.

Summer Research on Deep Learning & Neural Network Data Augment
- Use pytorch to implement CNN and research on current data augmentation approaches.
- Merge 4 data augment strategies (Cutout, Mixup, Cutmix and Autoaugment) into one project code.
- Train the RNN with different data augmentation strategies on different Dataset x RNN models.
- Analyse and contrast on the experiment results.
- Come up with a new strategy called Random-mix and run experiments to test the capability.

How to Change the World–Cities
- Designed a season responsive solution: Transportable Respiratory Environmental Equipment. (T.R.E.E.)
- Aims to reduce air pollution during Medellin’s smog seasons alongside the government air quality improvement plan.
- It is a modular system consisting of air filtering, mist spreading and an interactive information display.

Microsoft AI/Deep learning with Cortana Project
- Using Ionic, Microsoft Speech API, Google map API, Cognitive Service, Azure and Bot framework.
- Invoke Cortana and use voice commands to interact with Soundscape App.
- Incorporate AI to enable Cortana to learn from people’s behaviour and make recommendations. Integrate a chatbot to enable people to interact with Soundscape and learn to use it effectively.

Great Ormond Street Hospital 3D VR tour Project
- Using Unity and google VR SDK to develop an Android 3D VR tour Application to demo a new modern hospital concept. People can wear google VR glasses to see a further modern hospital.
- Users can interact with the app by looking at a specific item in the VR.

UCL Computer Science News Portal Project
- Team leader, front-end developer, back-end developer and researcher. Using Azure, PHP, HTML, CSS, JavaScript, MySQL, jQuery, Bootstrap and Github.
- Created an online portal for students to upload media on. Admin can login to manage, delete, download or tweet an article.
- pH level, temperature and stirring speed, all of which are integrated into one user interface.

Global Health Care-bioreactor controlling Project
- Team leader of the heating sub-system.
- A bioreactor is built to produce the vaccines. Worked on the controls of pH level, temperature and stirring speed, all of which integrated into one user
👥 Services
- Conference: ICCV; NeurIPS; CVPR; etc.
- Journal: IEEE TPAMI; TCSVT; TGRS; IJCV.