DHR-CLIP

Official implementation of "DHR-CLIP: Dynamic High-Resolution Object-agnostic Prompt Learning for Zero-shot Anomaly Segmentation"
by Jiyul Ham, Jun-Geal Baek.
Accepted to ICAIIC 2025 in Fukuoka, Japan

paper

Introduction

Zero-shot anomaly segmentation (ZSAS) is crucial for detecting and localizing defects in target datasets without need for training samples. This approach is particularly valuable in industrial quality control, where there are distributional shifts between training and operational environments or when data access is restricted. Recent vision-language models have demonstrated strong zero-shot performance across various visual tasks. However, the variations in the granularity of local anomaly regions due to resolution changes and their focus on class semantics make it challenging to directly apply them to ZSAS. To address these issues, we propose DHR-CLIP, a novel approach that incorporates dynamic high-resolution processing to enhance ZSAS in industrial inspection tasks. Additionally, we adapt object-agnostic prompt design to detect normal and anomalous patterns without relying on specific object semantics. Finally, we implement deep-text prompt tuning in the text encoder for refined textual representations and employ V-V attention layers in the vision encoder to capture detailed local features. Our integrated framework enables effective identification of fine-grained anomalies through refinement of image and text prompt design, providing precise localization of defects. The effectiveness of DHR-CLIP has been demonstrated through comprehensive experiments on real-world industrial datasets, MVTecAD and VisA, achieving strong performance and generalization capabilities across diverse industrial scenarios.

Overview of DHR-CLIP

Motivation of DHR-CLIP

Quantitative results

Reproducibility

Implementation environment

Ubuntu==22.04.1 LTS
cuda==12.1.0
cudnn==8
python==3.10
pytorch==2.0.0

First, download MVTecAD and VisA datasets. and then generate json files.

cd generate_dataset_json
python mvtec.py
python visa.py

Second, run DHRCLIP python file.

bash run_DHRCLIP.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
DHRCLIP_lib		DHRCLIP_lib
asset		asset
generate_dataset_json		generate_dataset_json
README.md		README.md
dataset_anyres.py		dataset_anyres.py
logger.py		logger.py
loss.py		loss.py
metrics.py		metrics.py
prompt_DHRCLIP.py		prompt_DHRCLIP.py
requirements.txt		requirements.txt
run_DHRCLIP.sh		run_DHRCLIP.sh
test_DHRCLIP.py		test_DHRCLIP.py
train_DHRCLIP.py		train_DHRCLIP.py
utils_anyres.py		utils_anyres.py
visualization.py		visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DHR-CLIP

paper

Introduction

Overview of DHR-CLIP

Motivation of DHR-CLIP

Quantitative results

Reproducibility

About

Releases

Packages

Languages

YUL-git/DHR-CLIP

Folders and files

Latest commit

History

Repository files navigation

DHR-CLIP

paper

Introduction

Overview of DHR-CLIP

Motivation of DHR-CLIP

Quantitative results

Reproducibility

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages