👋 About Me

My Experiences

2021-Present: X3000 Inspection
Technology Partner at X3000 Inspection (pre-A round completed (2023), A round completed (2024)), focusing on algorithm design and deployment based on the machine vision.

2019-Present: Dalian University of Technology (DUT)
Pursuing a Ph.D in Signal and Information Processing and supervised by Prof. Lihe Zhang and Prof. Huchuan Lu (IEEE Fellow) from the School of Information and Communication Engineering.

2015-2019: Dalian University of Technology (DUT)
Received the B.E. degree in Electronic and Information Engineering from the School of Information and Communication Engineering.

Research Interests

My current research interests include Deep Learning, Computer Vision, and Neural Network Design.

In particular, I focus on:
  • Multi-Modal Learning: RGB+Depth/Thermal/Temporal, RGB+Text (Open-Vocabulary)
  • Effective/Efficient Architecture Design: CNN/Transformer/High-resolution/Task-generic
  • Context-depend Concept Perception: Salient/Camouflaged/Shadow/Transparent
  • Complex Scene Perception: Camouflaged Scenes, Remote Sensing Change Detection, Multi-Modal Crowd Counting, Multi-Modal Semantic Segmentation
  • Industrial Machine Vision: X-ray/CT, Lithium Battery
  • Medical Image Analysis: Colon Polyp, COVID-19, Breast, Skin
🤝 I work very closely with my best friend, Xiaoqi Zhao, on most of my projects.

📢 News

2024 ECCV (1 Paper).
2024 TPAMI (1 Paper).
2024 ICML 2024 (1 Poster).
2024 CVPR 2024 (1 Poster, 1 Highlight).
2024 IJCV (2 Paper).
2023 ICCV (1 Poster).
2023 IEEE TIP (1 Paper).
2022 IEEE TIP (1 Paper).
2022 PRCV (1 Poster).
2022 CVPR (1 Poster).
2022 AAAI (1 Poster).
2021 ACMMM (1 Oral).
2020 ECCV (2 Poster, 1 Oral).
2020 CVPR (1 Poster).

💻 Project

Stars Name Description
GitHub Repo stars Hands-on-Docker (中文) 一份详尽的 Docker 使用指南。
GitHub Repo stars Awesome-Class-Activation-Map An awesome list of papers and tools about the class activation map (CAM) technology.
GitHub Repo stars PyTorchTricks Some tricks of pytorch…
GitHub Repo stars MethodsCmp A Simple Toolkit for Counting the FLOPs/MACs, Parameters and FPS of Pytorch-based Methods.
GitHub Repo stars PySODEvalToolkit A Python-based salient object detection and video object segmentation evaluation toolbox.
GitHub Repo stars PySODMetrics A simple and efficient implementation of SOD metrcis.
GitHub Repo stars PyLoss Some loss functions for deeplearning.
GitHub Repo stars OpticalFlowBasedVOS A simple and efficient codebase for the optical flow based video object segmentation.
GitHub Repo stars CoSaliencyProj A project for co-saliency detection. Some codes are borrowed from ICNet (NIPS2020).
GitHub Repo stars RunIt A simple program scheduler for your code on different devices.
GitHub Repo stars RegisterIt Register it: A more flexible register for the DeepLearning project.
GitHub Repo stars mssim.pytorch A better pytorch-based implementation for the mean structural similarity. Differentiable simpler SSIM and MS-SSIM.
GitHub Repo stars tta.pytorch Test-Time Augmentation library for Pytorch.
GitHub Repo stars YuQueTools A simple tool to download your own articles from yuque.
GitHub Repo stars ManageMyAttachments Manage the attachments of your own obsidian vault.

📖 Paper

Peer Review

ComPtr: Towards Diverse Bi-source Dense Prediction Tasks via A Simple yet General Complementary Transformer

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

ArXiv 2023

[Paper (ArXiv)] [Code (GitHub)]

M2SNet: Multi-scale in Multi-scale Subtraction Network for Medical Image Segmentation

Xiaoqi Zhao, Hongpeng Jia, Youwei Pang, Long Lv, Feng Tian, Lihe Zhang, Weibing Sun, Huchuan Lu

ArXiv 2023

[Paper (ArXiv)]

Publication

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang*, Xiaoqi Zhao*, Jiaming Zuo, Lihe Zhang, Huchuan Lu

European Conference on Computer Vision (ECCV) 2024, Springer

[Paper (ArXiv)] [Code (GitHub)]

ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection

Youwei Pang*, Xiaoqi Zhao*, Tian-Zhu Xiang*, Lihe Zhang, Huchuan Lu

Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2024, IEEE

[EI:20242616353583] [DOI:10.1109/TPAMI.2024.3417329] [Paper (ArXiv)] [Code (GitHub)]

Spider: A Unified Framework for Context-dependent Concept Understanding

Xiaoqi Zhao*, Youwei Pang*, Wei Ji*, Baicheng Sheng, Jiaming Zuo, Lihe Zhang, Huchuan Lu

International Conference on Machine Learning (ICML) 2024, PMLR

[EI:20243817053162] [Paper (ArXiv)] [Code]

Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and Baseline

Xiaoqi Zhao*, Youwei Pang*, Zhenyu Chen, Qian Yu, Lihe Zhang, Hanqi Liu, Jiaming Zuo

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024, IEEE

[EI:20244317260998] [DOI:10.1109/CVPR52733.2024.02079] [Paper (ArXiv)] [Project] [工源三仟公众号] [Code]

Multi-view Aggregation Network for Dichotomous Image Segmentation

Qian Yu*, Xiaoqi Zhao*, Youwei Pang*, Lihe Zhang, Huchuan Lu

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024, IEEE Highlight

[EI:20244317260270] [DOI:10.1109/CVPR52733.2024.00376] [Paper (ArXiv)] [Code]

Towards Diverse Binary Segmentation via A Simple yet General Gated Network

Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Lei Zhang

International Journal of Computer Vision (IJCV) 2024, Springer

[EI:20241916055804] [WOS:001215379300003] [DOI:10.1007/s11263-024-02058-y] [Paper (ArXiv)] [Code]

Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation

Xiaoqi Zhao, Shijie Chang, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu

International Journal of Computer Vision (IJCV) 2024, Springer

[EI:20241015709995] [WOS:001176531200002] [DOI:10.1007/s11263-024-02024-8] [Paper (ArXiv)]

Adaptive Illumination Mapping for Shadow Detection in Raw Images

Jiayu Sun, Ke Xu, Youwei Pang, Lihe Zhang, Huchuan Lu, Gerhard Hancke, Rynson Lau

IEEE/CVF International Conference on Computer Vision (ICCV) 2023, IEEE

[EI:20240915634899] [WOS:001169499005013] [DOI:10.1109/ICCV51070.2023.01167] [Paper (CVF)] [Code]

CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Transactions on Image Processing (TIP) 2023, IEEE

[EI:20230613542686] [WOS:000922870200004] [DOI:10.1109/TIP.2023.3234702] [Paper (ArXiv)] [Paper (IEEE)] [Code] [Project]

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction

Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu

Transactions on Image Processing (TIP) 2022, IEEE

[EI:20225113272379] [WOS:000892917400002] [DOI:10.1109/TIP.2022.3222641] [Paper] [Code]

Zoom In and Out: A Mixed-scale Triplet Network for Camouflaged Object Detection

Youwei Pang*, Xiaoqi Zhao*, Tian-zhu Xiang, Lihe Zhang, Huchuan Lu

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022, IEEE

[EI:20224613119658] [WOS:000867754202041] [DOI:10.1109/CVPR52688.2022.00220] [Paper] [Code] [Project]

Self-Supervised Pretraining for RGB-D Salient Object Detection

Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Xiang Ruan

Association for the Advancement of Artificial Intelligence (AAAI) 2022

[EI:20230713571733] [WOS:000893636203061] [Paper] [Slide&极市平台推送] [Code]

Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation

Xiaoqi Zhao, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu

ACM International Conference on Multimedia (ACM MM) 2021 Oral

[EI:20214711200241] [WOS:001147786902077] [Paper] [Slide&极市平台推送] [Code]

Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection

Youwei Pang, Lihe Zhang, Xiaoqi Zhao, Huchuan Lu

European Conference on Computer Vision (ECCV) 2020, Springer

[EI:20205009617977] [DOI:10.1007/978-3-030-58595-2_15] [Paper] [Slide] [Code]

Suppress and Balance: A Simple Gated Network for Salient Object Detection

Xiaoqi Zhao*, Youwei Pang*, Lihe Zhang, Huchuan Lu, Lei Zhang

European Conference on Computer Vision (ECCV) 2020, Springer Oral

[EI:20205009597084] [DOI:10.1007/978-3-030-58536-5_3] [Paper] [Slide] [Code]

A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection

Xiaoqi Zhao, Lihe Zhang, Youwei Pang, Huchuan Lu, Lei Zhang

European Conference on Computer Vision (ECCV) 2020, Springer

[EI:20205009601094] [DOI:10.1007/978-3-030-58542-6_39] [Paper] [Code]

Multi-scale Interactive Network for Salient Object Detection

Youwei Pang*, Xiaoqi Zhao*, Lihe Zhang, Huchuan Lu

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020, IEEE

[EI:20204409431574] [WOS:001309199902028] [DOI:10.1109/CVPR42600.2020.00943] [Paper] [Code]