Computer Vision · Multimodal Learning

Chenyuan Qu

Head of Technologies · PhD StudentAllsee · Vieunite · University of Birmingham

I work between doctoral research and industry technology leadership, combining computer-vision research with production experience in backend architecture, enterprise software, AI products, business operations, and commercialisation-facing systems.

Research

Research Interests

My recent work spans interpretable image representations, multimodal scene understanding, diffusion-based methods, and dataset-building for visual learning.

Computer Vision

Learning interpretable representations and robust visual understanding from images, scenes, and multimodal observations.

Multimodal Learning

Studying how visual, spatial, audio, textual, and viewpoint-specific signals can be fused for richer scene understanding.

Generative AI

Exploring controllable generative systems that connect representation learning with editing, restoration, and creative workflows.

AI for Science

Applying machine learning methods in scientifically grounded settings where interpretability and structure matter.

Publications

Selected Publications

Publications

Publications & Datasets

2025
  • Official DIFF pipeline overview showing diffusion feature extraction and fusion for cross-domain semantic segmentation.
    ICASSPSpotlight

    ICASSP

    2025

    Diffusion Features to Bridge Domain Gap for Semantic Segmentation

    Yuxiang Ji, Boyong He, Chenyuan Qu, Zhuoyue Tan, Chuan Qin, Liaoni Wu

    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

    DIFF leverages diffusion-model features to improve cross-domain semantic segmentation by extracting and fusing semantically rich representations across the diffusion process.

    arXivPaperCodeDOI
2024
2023
  • MeD
    ICCVSpotlight

    ICCV

    2023

    MeD

    Multi-view Self-supervised Disentanglement for General Image Denoising

    Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao

    IEEE/CVF International Conference on Computer Vision (ICCV)

    A self-supervised denoising framework that disentangles clean image structure from corruption by comparing multiple noisy views of the same latent scene.

Datasets

BinEgo-360

2025

BinEgo-360

A binocular egocentric and 360° panoramic multimodal dataset and challenge surface for scene understanding, aligned with spatial audio, text, and geo-metadata.

2026

2026

text-to-art-database

A privacy-safe text-to-image dataset released on Hugging Face, repacked into Parquet shards with embedded image bytes and organised into samples and iterations splits.

News

Recent News

5 May 2026

Started Help To Grow: Management at BCU

I started the 12-week Help To Grow: Management Course at Birmingham City University Business School, with sessions spanning strategy, digital transformation, marketing, operations, finance, and growth planning.

2025

VisualSplit accepted to BMVC 2025

Project page, paper, supplementary material, code, and model weights are publicly available.

2025

DIFF presented at ICASSP 2025

Collaborative work on diffusion features for cross-domain semantic segmentation.

2024

360+x published at CVPR 2024

Dataset paper with accompanying benchmark resources, code, and public dataset access.

2023

Started my PhD in the MI X group

I began doctoral research in 2023 on computer vision and multimodal learning in the MI X Group.

2023

MeD published at ICCV 2023

An early project on self-supervised image denoising using multi-view disentanglement.

Experience

Professional Experience

Feb 2023 — Present

University of Birmingham · MI X Group

PhD Student

2023 — Present

Doctoral research in computer vision, multimodal learning, and generative AI within the School of Computer Science.

Research Assistant

Dec 2023 — Present

Research on compositionality for foundation models, with a focus on representation, reasoning, and generative computer-vision systems.

Research Assistant

Feb 2023 — Dec 2023

Research on interpretable hydrological modelling and machine-learning methods for scientific analysis.

Sep 2022 — Present

Allsee · Vieunite

Head of Technologies

Dec 2024 — Present

Own the company-wide technology roadmap for digital transformation and AI acceleration, spanning backend architecture, enterprise software, business operations, sales enablement, and commercialisation-facing systems.

  • Set technical direction for cross-company systems, balancing maintainable engineering, operational control, commercial needs, and practical delivery across hardware, software, and business workflows.
  • Built a structured delivery operating system around Linear, connecting sales, product, operations, and engineering requests into prioritised roadmaps with accountable owners and clearer cross-team visibility.
  • Lead AI adoption from prototype to production workflow, using machine learning, generative AI, and automation to accelerate internal processes and product-facing capabilities.
  • Bridge leadership, engineering, operations, and sales by turning loosely defined business problems into concrete architectures, implementation plans, and shipped tools.

Full-stack Engineer

Dec 2023 — Dec 2024

Led major full-stack and traditional software-engineering work for the CMS and ERP platforms, including the CMS rebuild that moved the company away from fragmented board-specific codebases toward one shared software architecture across hardware boards and operating systems.

  • Re-architected the CMS software so one maintainable codebase could support multiple boards, deployment targets, and system environments.
  • Built the ERP system from the ground up, designing core workflows, data structures, and interfaces for internal business operations and management visibility.
  • Worked across frontend, backend, deployment, and device constraints, connecting product requirements with the engineering detail needed to ship reliable systems.

Algorithm Engineer

Sep 2022 — Dec 2023

Built early backend and AI capabilities for the Allsee and Vieunite product stack, developing recommendation infrastructure and Vieutopia AI art features that connected machine-learning experiments with production product needs.

  • Implemented backend services that supported AI-driven product features and internal workflows.
  • Developed recommendation-system components for content and product discovery.
  • Built Vieutopia AI art functionality, helping translate generative-model capability into usable product features.
Jul 2020 — Sep 2020

AsiaInfo Software Co. Ltd

Algorithm Engineer Intern

Jul 2020 — Sep 2020

Developed and optimised machine-learning components for a visual customer-service anchor in a mobile deployment setting.

Education

Academic Training

2021 — 2022

Master's Study

University of Birmingham

Postgraduate study completed at Birmingham before beginning doctoral research.

2018 — 2021

BSc in Physics

University of Southampton

Undergraduate training in physics, which continues to shape how I think about machine learning and computer vision.

Contact

Contact

The easiest way to reach me is by email.

University of Birmingham

cxq134@student.bham.ac.uk