Zhuoxiao (Ivan) Chen

[陈卓潇]

I am a PhD candidate at The University of Queensland (UQ), Australia. My research focuses on 3D scene understanding and machine learning under the supervision of Dr Yadan Luo, Prof Helen Huang, and Dr Mahsa Baktashmotlagh. I received my Bachelor of Computer Science degree with First Class Honours from UQ in 2021. I was selected as the Valedictorian in the graduation ceremony in Dec 2021.

I will start Applied Scientist Intern at Oracle in June 2025.

Email: zhuoxiao [dot] chen [at] uq [dot] edu [dot] au

Google Scholar / GitHub / DBLP / IG

Publication

Open-CRB: Active Learning for 3D Object Detection from Open World
Zhuoxiao Chen, Yadan Luo, Zixin Wang, Zijian Wang, Zi Huang
TPAMI 2025
[arXiv] [code]
This paper investigates a more practical and challenging research task: Open World Active Learning for 3D Object Detection (OWAL-3D), aimed at acquiring informative point clouds with new concepts. We propose a simple yet effective strategy called Open Label Conciseness (OLC), which mines novel 3D objects with minimal annotation costs. Our empirical results show that OLC successfully adapts the 3D detection model to the open world scenario with just a single round of selection. Any generic AL policy can then be integrated with the proposed OLC to efficiently address the OWAL-3D problem.

MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection
Zhuoxiao Chen, Junjie Meng, Mahsa Baktashmotlagh, Yonggang Zhang, Zi Huang, Yadan Luo
ICLR 2025 (Oral)
[OpenReview] [arXiv] [code]
LiDAR-based 3D object detection is crucial for various applications but often experiences performance degradation in real-world deployments due to domain shifts. While most studies focus on cross-dataset shifts, such as changes in environments and object geometries, practical corruptions from sensor variations and weather conditions remain underexplored. In this work, we propose a novel online test-time adaptation framework for 3D detectors that effectively tackles these shifts, including a challenging cross-corruption scenario where cross-dataset shifts and corruptions co-occur. By leveraging long-term knowledge from previous test batches, our approach mitigates catastrophic forgetting and adapts effectively to diverse shifts.

DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
Jia Syuen Lim*, Zhuoxiao Chen*, Mahsa Baktashmotlagh, Zhi Chen, Xin Yu, Zi Huang, Yadan Luo
NeurIPS 2024
[OpenReview] [arXiv] [code]
Class-agnostic object detection (OD) can be a cornerstone or a bottleneck for many downstream vision tasks. Despite considerable advancements in bottom-up and multi-object discovery methods that leverage basic visual cues to identify salient objects, consistently achieving a high recall rate remains difficult due to the diversity of object types and their contextual complexity. In this work, we investigate using vision-language models (VLMs) to enhance object detection via a self-supervised prompt learning strategy.

DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection
Zhuoxiao Chen*, Zixin Wang*, Sen Wang, Zi Huang, Yadan Luo
MM 2024
[paper] [OpenReview] [arXiv] [code]
We propose dual-perturbation optimization (DPO) for Test-Time Adaptation for 3D Object Detection (TTA-3OD). We minimize the sharpness to cultivate a flat loss landscape to ensure model resiliency to minor data variations, thereby enhancing the generalization of the adaptation process. To fully capture the inherent variability of the test point clouds, we further introduce adversarial perturbation to the input BEV features to better simulate the noisy test environment. As the dual perturbation strategy relies on trustworthy supervision signals, we utilize a reliable Hungarian matcher to filter out pseudo-labels sensitive to perturbations. Additionally, we introduce early Hungarian cutoff to avoid error accumulation from incorrect pseudo-labels by halting the adaptation process.

In Search of Lost Online Test-Time Adaptation: A Survey
Zixin Wang, Yadan Luo, Liang Zheng, Zhuoxiao Chen, Sen Wang, Zi Huang
IJCV 2024
[paper] [arXiv] [code]
This article presents a comprehensive survey of online test-time adaptation (OTTA), focusing on effectively adapting machine learning models to distributionally different target data upon batch arrival. Despite the recent proliferation of OTTA methods, conclusions from previous studies are inconsistent due to ambiguous settings, outdated backbones, and inconsistent hyperparameter tuning, which obscure core challenges and hinder reproducibility.

Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling
Zhuoxiao Chen, Yadan Luo, Zi Huang, Zheng Wang, Mahsa Baktashmotlagh
ICCV 2023
[paper] [code] [video]
Unsupervised domain adaptation (DA) with the aid of pseudo labeling techniques has emerged as a crucial approach for domain-adaptive 3D object detection. While effective, existing DA methods suffer from a substantial drop in performance when applied to a multi-class training setting, due to the co-existence of low-quality pseudo labels and class imbalance issues. In this paper, we address this challenge by proposing a novel ReDB framework tailored for learning to detect all classes at once. Our approach produces Reliable, Diverse, and class-Balanced pseudo 3D boxes to iteratively guide the self-training on a distributionally different target domain.

KECOR: Kernel Coding Rate Maximization for Active 3D Object Detection
Yadan Luo, Zhuoxiao Chen, Zhen Fang, Zheng Zhang, Zi Huang, Mahsa Baktashmotlagh
ICCV 2023
[paper] [code]
Active learning (AL) seeks to mitigate the annotation burden through algorithms that use fewer labels and can attain performance comparable to fully supervised learning. Although AL has shown promise, current approaches prioritize the selection of unlabeled point clouds with high aleatoric and/or epistemic uncertainty, leading to the selection of more instances for labeling and reduced computational efficiency. In this paper, we resort to a novel kernel coding rate maximization (KECOR) strategy which aims to identify the most informative point clouds to acquire labels through the lens of information theory.

Exploring Active 3D Object Detection from a Generalization Perspective
Yadan Luo*, Zhuoxiao Chen*, Zijian Wang, Xin Yu, Zi Huang, Mahsa Baktashmotlagh
ICLR 2023 (Spotlight)
[OpenReview] [code]
To alleviate the high annotation cost in LiDAR-based 3D object detection, active learning is a promising solution that learns to select only a small portion of unlabeled data to annotate, without compromising model performance. We jointly investigate three novel criteria in our framework CRB for point cloud acquisition - label conciseness, feature representativeness and geometric balance, which hierarchically filters out the point clouds of redundant 3D bounding box labels, latent features and geometric characteristics (e.g., point cloud density) from the unlabeled sample pool and greedily selects informative ones with fewer objects to annotate.

Source-Free Progressive Graph Learning for Open-Set Domain Adaptation
Yadan Luo, Zijian Wang, Zhuoxiao Chen, Zi Huang, Mahsa Baktashmotlagh
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
[paper] [code]
We propose a Progressive Graph Learning (PGL) framework that decomposes the target hypothesis space into the shared and unknown subspaces, and then progressively pseudo-labels the most confident known samples from the target domain for hypothesis adaptation.

Research Services

I serve as a reviewer for the following top conferences:

ICML in 2025

ICLR in 2025

ICCV in 2025

WWW in 2025

NeurIPS in 2024, 2025

CVPR in 2024, 2025

ECCV in 2024

ACM MM in 2023, 2024, 2025

and top journals:

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) in 2024

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) in 2024

IEEE Transactions on Multimedia (TMM) in 2024

Teaching Experience

2022, 2023, 2024, 2025, Head of Tutor for postgraduate level course INFS7205 Advanced Techniques for High Dimensional Data

2022, Teaching Assistant (TA) Tutor for master thesis Data Science Capstone Project (DATA7901) and Data Science Capstone Project (DATA7902)

2022, 2024, Learning Designer for 2nd year university computer science courses

2021, Teaching Assistant (TA) Tutor for postgraduate level course INFS7202 Web Information Systems

Education

01/2022 - 12/2025, Doctor of Philosophy (PhD) , The University of Queensland

02/2018 - 12/2021, Bachelor of Computer Science (First Class Honours) , The University of Queensland, GPA Rank 1st

Awards

12/2021, Valedictorian, I was selected as the Valedictorian to present a speech in the graduation ceremony of the Faculty of Engineering, Architecture and Information Technology

12/2021, Graduate School Scholarships (UQGSS), these scholarships cover a living allowance stipend and tuition fee expenses

12/2020, UQ Excellence Scholarship for International Students, this scholarship provided by UQ covers 20% of your tuition fees per year, for GPA top 5% students only

Semester 1 & 2 in 2020, Semester 1 & 2 in 2021, Dean’s Commendation for Academic Excellence, The Faculty of Engineering, Architecture and Information Technology, has determined that students who demonstrate excellence in academic performance should receive acknowledgement of the achievement

Other Experience

07/2021 - now, Research Assistant (RA) at ITEE, UQ

08/2021 - 09/2021, Software Engineer for collaborative project (ITEE, UQ and Logon City Council, Australia) on intelligent platform for automated road defect detection and asset management [ project ]

03/2021, Web Developer for ACM Multimedia Asia 2021 Official Website

Welcome to use this website's source code, just add a link back to here. ✩