๐Ÿ‘‹ Hi there!

I am currently a joint PhD student at the Computer Graphics Lab of ETH Zรผrich and DisneyResearch|Studios, supervised by Prof. Markus Gross and Dr. Christopher Schroers. I also spent 4 months working on low-level vision at the Computer Vision Lab, ETH Zรผrich, supervised by Prof. Yulun Zhang. Before that, I received my M.E. and B.E. degrees respectively in 2023 and 2020 from the Electronic Information School of Wuhan University, where I work closely with Prof. Lei Yu on event-based vision. I am a lifelong learner with broad interests, including 3D vision, low-level vision, signal processing, and neuromorphic computing. I am particularly interested in achieving robust perception in complex environments for real-world applications.

๐Ÿ”ฅ News

  • 2026.02:  ๐ŸŽ‰ HairGuard is accepted by CVPR 2026!
  • 2025.11:  ๐ŸŽ‰ SE-SRB is accepted by AAAI 2026! Congrats to Chi Zhang!
  • 2025.03:  ๐ŸŽ‰ SplatDiff is accepted by SIGGRAPH 2025!
  • 2025.01:  ๐ŸŽ‰ SelfUnroll is accepted by Springer IJCV 2025! Congrats to Mingyuan Lin and Yanggunag Wang!
  • 2024.09:  ๐ŸŽ‰ BetterDepth is accepted by NeurIPS 2024!
  • 2024.07:  ๐ŸŽ‰ HiT-SR is accepted by ECCV 2024 (Oral)! Code, models, and results are released!
  • 2024.05:  ๐ŸŽ‰ CrossZoom is accepted by IEEE TPAMI 2024! Congrats to Chi Zhang!
  • 2024.02:  ๐ŸŽ‰ EBR is accepted by IEEE TIP 2024! Congrats to Shijie Lin!
  • 2023.08:  ๐Ÿ› ๏ธ The code of our GEM is released.
  • 2023.07:  ๐ŸŽ‰ GEM is accepted by ICCV 2023!
  • 2023.01:  ๐ŸŽ‰ eSL-Net++ is accepted by IEEE TPAMI 2023! Congrats to Bishan Wang!
  • 2022.12:  ๐ŸŽ‰ A-SSR is accepted by IEEE TSP 2022!
  • 2022.12:  ๐ŸŽ‰ Extended E-SAI is accepted by IEEE TPAMI 2022!
  • 2022.04:  ๐Ÿ› ๏ธ The code of our EVDI is released.
  • 2022.03:  ๐Ÿ› ๏ธ The code of our EF-SAI is released.
  • 2022.03:  ๐ŸŽ‰ EVDI and EF-SAI are accepted by CVPR 2022! Congrats to Wei Liao!
  • 2021.07:  ๐Ÿ› ๏ธ The code of our E-SAI is released.
  • 2021.06:  ๐Ÿพ E-SAI is selected as one of the best paper candidates by CVPR 2021!
  • 2021.03:  ๐ŸŽ‰ E-SAI is accepted by CVPR 2021 (Oral)!
  • ๐Ÿ“ Publications

    CVPR 2026
    sym

    Guardians of the Hair: Rescuing Soft Boundaries in Depth, Stereo, and Novel Views

    Xiang Zhang, Yang Zhang, Lukas Mehl, Markus Grossโ€ , Christopher Schroersโ€ 

    [arXiv]

    • We present HairGuard to capture, model, and reconstruct fine-grained soft boundary details in 3D vision tasks, achieving state-of-the-art performance on monocular depth estimation, stereo conversion, and novel view synthesis.
    • We leverage image matting datasets for training, enabling HairGuard to automatically identify and fix soft boundaries without relying on manually crafted cues like trimaps. A plug-and-play depth fixer is proposed for precise refinement, alongside a color fuser for high-quality view synthesis.
    SIGGRAPH 2025
    sym

    High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion

    Xiang Zhang, Yang Zhang, Lukas Mehl, Markus Grossโ€ , Christopher Schroersโ€ 

    [Website] [Paper] [arXiv] [Supp] [Video]

    • We introduce SplatDiff, a pixel-splatting-guided video diffusion model for synthesizing novel views with consistent geometry and high-fidelity texture from a single image.
    • SplatDiff excels in single-view novel view synthesis, sparse-view novel view synthesis, and stereo video conversion, demonstrating remarkable crossdomain and cross-task performance.
    NeurIPS 2024
    sym

    BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation

    Xiang Zhang, Bingxin Ke, Hayko Riemenschneider, Nando Metzger, Anton Obukhov, Markus Grossโ€ , Konrad Schindler, Christopher Schroersโ€ 

    [Website] [arXiv] [Poster]

    • We propose BetterDepth to boost zero-shot MDE methods with plug-and-play diffusion refiners, achieving robust affine-invariant MDE performance with fine-grained details.
    • We design global pre-alignment and local patch masking strategies to enable learning detail refinement from small-scale synthetic datasets while preserving rich prior knowledge from pre-trained MDE models for zero-shot transfer.
    ECCV 2024
    sym

    HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

    Xiang Zhang, Yulun Zhang, Fisher Yu

    ECCV 2024 Oral Presentation

    [Code] [Supp] [Video]

    • We propose a simple yet effective strategy (HiT-SR) to convert popular transformer-based SR methods to our hierarchical transformers, boosting SR performance by exploiting multi-scale features and long-range dependencies.
    • We design a spatial-channel correlation method to efficiently leverage spatial and channel features with linear computational complexity to window sizes, enabling utilization of large hierarchical windows, e.g., $64\times64$ windows.
    ICCV 2023
    sym

    Generalizing Event-Based Motion Deblurring in Real-World Scenarios

    Xiang Zhang, Lei Yuโ€ , Wen Yang, Jianzhuang Liu, Gui-Song Xia

    [Code] [Dataset] [Youtube]

    • A scale-aware network is designed to allow flexible setups of input spatial resolutions and enable learning from different temporal scales of motion blur.
    • A self-supervised learning framework is proposed for model training with real-world data and performance generalization in spatial and temporal domains.
    • A multi-scale real-world blurry dataset (MS-RBD) is constructed to facilitate the evaluation of deblurring performance in real-world scenarios.
    TPAMI 2022
    sym

    Learning to See Through with Events

    Lei Yuโ€ , Xiang Zhang, Wei Liao, Wen Yang, Gui-Song Xia

    [Code] [Dataset] [Bilibili]

    • We provide more analysis of the E-SAI framework, including more details on the components of triggered events and the corresponding epipolar geometry.
    • We design a spatial transformer network to automatically refocus the events collected by a moving event camera with fronto-parallel uniform motion, relaxing the dependence on prior information such as camera velocity and target depth.
    CVPR 2022
    sym

    Unifying Motion Deblurring and Frame Interpolation with Events

    Xiang Zhang, Lei Yuโ€ 

    [Code] [Youtube]

    • We present a unified framework for event-based video deblurring and interpolation (EVDI) that generates arbitrarily high frame-rate sharp videos from blurry inputs.
    • By utilizing the constraints between cross-modal frames and events, a fully self-supervised learning method is proposed to enable network training with real-world data without requiring ground-truth images.
    TSP 2022
    sym

    Spiking Sparse Recovery with Non-convex Penalties

    Xiang Zhang, Lei Yuโ€ , Gang Zheng, Yonina C. Eldar

    [Paper]

    • We present an adaptive sparse spiking recovery (A-SSR) algorithm to solve a class of non-convex regularized SR problems with spiking neural networks.
    • When implemented on the neuromorphic Loihi chip, our A-SSR can solve sparse recovery problems with approximately 1% of the power consumption of fast iterative shrinkage-thresholding algorithm.
    CVPR 2021
    sym

    Event-based Synthetic Aperture Imaging with a Hybrid Network

    Xiang Zhang*, Wei Liao*, Lei Yuโ€ , Wen Yang, Gui-Song Xia

    CVPR 2021 Best Paper Candidate and Oral Presentation

    [Code] [Dataset] [Youtube]

    • An event-based synthetic aperture imaging (E-SAI) algorithm is proposed to see through dense occlusions even under extreme lighting conditions.
    • A hybrid network composed of an spiking encoder and a convolutional decoder is designed to mitigate the disturbances from occlusions and guarantee the overall reconstruction performance.

    * means equal contribution and โ€  indicates my supervisor.

    ๐Ÿ’ป Services

    Conference Reviewer

    • Computer Vision and Pattern Recognition (CVPR)
    • International Conference on Computer Vision (ICCV)
    • European Conference on Computer Vision (ECCV)
    • Advances in Neural Information Processing Systems (NeurIPS)
    • International Conference on Learning Representations (ICLR)
    • International Conference on Machine Learning (ICML)

    Journal Reviewer

    Teaching

    ๐Ÿน Misc

    • ๐ŸŽธ I love rock and hip-hop music. Mayday, Jay Chou, and Pharaoh are my favoriate.
    • ๐Ÿ“– I enjoy reading all kinds of books. Echoโ€™s story inspired me about love and traveling. My recent favoriate is the sci-fi novel The Three Body Problem written by Cixin Liu.
    • ๐ŸŽฎ I often relax by playing games, including roguelike games like Slay the Spire and card games like Hearthstone. I recently played Split Fiction and It Takes Two with my girlfriend, and they were super fun!
    • ๐Ÿ›ซ It always excites me when experiencing new cultures in new places. Here are some photos I took during my trips.