Profile Alignment

I am a Ph.D. candidate in the VCIP Lab, College of Computer Science, Nankai University, under the supervision of Professors Jian Yang, Xiang Li, and Ming-Ming Cheng. I hold a first-class honors master degree (4 years) in Computer Science from University College London (UCL).

My research interests focus on computer vision, remote sensing object detection, large models, and multi-modal learning. I am actively seeking research internship and postdoctoral opportunities worldwide. For inquiries about my research or collaboration opportunities, please feel free to reach out via email at yuxuan.li.17 [at] ucl.ac.uk.

🎓 Educations

  • 2023 – Present, Ph.D. Student in College of Computer Science, Tianjin, Nankai University. Research on computer vision, remote sensing object detection, large models, and multi-modal learning.
  • 2021 – 2023, Research Assistant in College of Computer Science, Nankai University, Tianjin, China.
  • 2017 – 2021, M.Eng. in Computer Science (4 years course), First Class Honors Degree, University College London (UCL), London, UK.

📚 Publications

# Corresponding author

Journal

IJCV 2024
sym

LSKNet: A foundation lightweight backbone for remote sensing (IJCV)

Yuxuan Li, Xiang Li#, Yimian Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng, Jian Yang#

[Paper] [BibTex] [Demo Video] [Code]

PWC PWC PWC PWC PWC PWC PWC PWC PWC PWC PWC

LSKNet can dynamically adjust its large spatial receptive field to better model the ranging context of various categories of objects in remote sensing scenarios.

CVMJ 2024
sym

APF-GAN:Exploring asymmetric pre-training and fine-tuning strategy for conditional generativeadversarial network (CVMJ)

Yuxuan Li, Lingfeng Yang, Xiang Li#

[Paper] [Code]

The APF-GAN model improves GAN-based image segmentation, surpassing GauGAN and winning the Second Jittor AI Challenge.

Conference

ICCV 2023
sym

Large Selective Kernel Network for Remote Sensing Object Detection (ICCV 2023)

Yuxuan Li, Qibin Hou, Zhaohui Zheng, Ming-Ming Cheng, Jian Yang#, Xiang Li#

[Paper] [BibTex] [Demo Video] [Report/Forum] [Code] [知乎专栏]

PWC PWC PWC PWC PWC PWC PWC PWC PWC PWC PWC

LSKNet can dynamically adjust its large spatial receptive field to better model the ranging context of various categories of objects in remote sensing scenarios.

NeurIPS 2024
sym

SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection (NeurIPS 2024 Spotlight)

Yuxuan Li, Xiang Li#, Weijie Li, Qibin Hou, Li Liu, Ming-Ming Cheng, Jian Yang#

[Paper] [BibTex] [Code] [知乎专栏]

PWC

SARDet-100K, the first large-scale multi-class SAR object detection dataset, along with the proposed MSFA pretraining framework, addresses domain and model disparities, significantly improving SAR object detection performance and advancing the field.

ACCV 2022
sym

Spatial group-wise enhance: Enhancing semantic feature learning in CNN (ACCV 2022)

Yuxuan Li, Xiang Li#, Jian Yang

[Paper] [BibTex] [[Code]] [知乎专栏]

The Spatial Group-wise Enhance (SGE) module improves CNN performance by generating accurate spatial attention masks through local-global similarity within semantic groups, achieving notable accuracy gains on ImageNet and COCO tasks with minimal overhead.

📃 Other Publications

SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection

Preprint, 2024
Yuxuan Li, Xiang Li, Yunheng Li, Yicheng Zhang, Yimian Dai, Qibin Hou, Ming-Ming Cheng, Jian Yang
[Paper]


RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark

CVPR, 2024
Xin Zhang, Xue Yang, Yuxuan Li, Jian Yang, Ming-Ming Cheng, Xiang Li
[Paper]


DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction

Preprint, 2024
Yunheng Li, Yuxuan Li, Quansheng Zeng, Wenhai Wang, Qibin Hou, Ming-Ming Cheng
[Paper]


Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection

Preprint, 2024
Xinbin Yuan, Zhaohui Zheng, Yuxuan Li, Xialei Liu, Li Liu, Xiang Li, Qibin Hou, Ming-Ming Cheng
[Paper]


GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling

Preprint, 2024
Qun Dai, Chunyang Yuan, Yimian Dai, Yuxuan Li, Xiang Li, Kang Ni, Jianhui Xu, Xiangbo Shu, Jian Yang
[Paper]


Pick of the Bunch: Detecting Infrared Small Targets Beyond Hit-Miss Trade-Offs via Selective Rank-Aware Attention

IEEE TGRS, 2024
Yimian Dai, Peiwen Pan, Yuxuan Li, et al.
[Paper]


Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?

Preprint, 2023
Zheng Li*, Yuxuan Li*, Penghai Zhao, et al.
[Paper]


DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images

Transactions on Aerospace and Electronic Systems, 2024
Yimian Dai, Minrui Zou, Yuxuan Li, et al.
[Paper]

🏆 Competitions and Awards

sym

sym

sym

sym

sym
  • 2023, Third prize of “Jilin-1” Satellite Remote Sensing Application Youth Innovation and Entrepreneurship Competition, (5th from 110 teams).

sym
  • 2023, Outstanding Presentation Award The 3rd Academic Forum for Graduate Students of Journal of Image and Graphics.
sym
  • 2019, Second prize of Facebook Hack-a-Project Spring (2 out of 12 teams).

💻 Experiments

Sept 2020 - June 2021
sym

Master Project: Exploring the relation between cancers and mRNA with computer science methods

  • Aim to find out which combination of mRNA bases have information of cancer metastasis.
  • Built a complex network based on correlations among mRNAs using Python and Gephi.
  • Did network analysis to filter out informative mRNAs.
  • Currently researching on more effective network analysis and machine learning techniques,
  • And how to predict cancers based on the filtered mRNAs with ML models.
Jun 2020 – Sept 2020
sym

Paid summer internship at UCL computer science AI research group

  • A simulation study on Twitter breaking news spreading based on ML Models.
  • Analysed, reproduced and evaluated previous ML models and methods.
  • Will do data mining, feature selection and adding textural features(NLP) to the existing model.
Oct 2019 – May 2020
sym

Forecasting influenza-like illness rates using online search data

  • Use Google query logs data to train a model to nowcast and forecast influenza-like illness.
  • Examined state-of-the-art approaches, implemented and compared different ML models.
  • Built a super-ensemble model which has a high performance.
Jun 2019 – Sep 2019
sym

Summer Research on Deep Learning & Neural Network Data Augment

  • Use pytorch to implement CNN and research on current data augmentation approaches.
  • Merge 4 data augment strategies (Cutout, Mixup, Cutmix and Autoaugment) into one project code.
  • Train the RNN with different data augmentation strategies on different Dataset x RNN models.
  • Analyse and contrast on the experiment results.
  • Come up with a new strategy called Random-mix and run experiments to test the capability.
MAY 2019 – JUN 2019
sym

How to Change the World–Cities

  • Designed a season responsive solution: Transportable Respiratory Environmental Equipment. (T.R.E.E.)
  • Aims to reduce air pollution during Medellin’s smog seasons alongside the government air quality improvement plan.
  • It is a modular system consisting of air filtering, mist spreading and an interactive information display.
Sept 2018 – Apr 2019
sym

Microsoft AI/Deep learning with Cortana Project

  • Using Ionic, Microsoft Speech API, Google map API, Cognitive Service, Azure and Bot framework.
  • Invoke Cortana and use voice commands to interact with Soundscape App.
  • Incorporate AI to enable Cortana to learn from people’s behaviour and make recommendations. Integrate a chatbot to enable people to interact with Soundscape and learn to use it effectively.
Oct 2018 – Nov 2018
sym

Great Ormond Street Hospital 3D VR tour Project

  • Using Unity and google VR SDK to develop an Android 3D VR tour Application to demo a new modern hospital concept. People can wear google VR glasses to see a further modern hospital.
  • Users can interact with the app by looking at a specific item in the VR.
Feb 2018 – Apr 2018
sym

UCL Computer Science News Portal Project

  • Team leader, front-end developer, back-end developer and researcher. Using Azure, PHP, HTML, CSS, JavaScript, MySQL, jQuery, Bootstrap and Github.
  • Created an online portal for students to upload media on. Admin can login to manage, delete, download or tweet an article.
  • pH level, temperature and stirring speed, all of which are integrated into one user interface.
OCT 2017 – DEC 2017
sym

Global Health Care-bioreactor controlling Project

  • Team leader of the heating sub-system.
  • A bioreactor is built to produce the vaccines. Worked on the controls of pH level, temperature and stirring speed, all of which integrated into one user

👥 Services

  • Conference: ICCV; NeurIPS; CVPR; etc.
  • Journal: IEEE TPAMI; TCSVT; TGRS; IJCV.