[Scaling Up AI-Generated Image Detection with Generator-Aware Prototypes]

🔥 News

[2026.02] 🎉 GAPL has been accepted to CVPR 2026!

[2025.12] 📄 Paper released on arXiv

💡 Motivation

Figure 1: Overview of our proposed Generator-Aware Prototype Learning (GAPL) framework.

In scaling up AIGI detections, generated images show heterogenity and cause previous AIGI detector fails to scale.

We learn a small set of forgery concepts as Generator-Aware Prototypes. And convert diverse generators into some certain prototypes.

🛠️ Preparation

1. Environment Setup

We provide the minimum requirement package in requirements.txt, you could check whether your environment satisfy it or create an enviroment with the following command.

pip install -r requirements.txt

🚀 Quick inference

To evaluate performance of the proposed GAPL, You need to download the checkpoint from

🔗 Pretrained: Huggingface

1. Evaluate the proposed model in benchmarks.

To reproduce the results reported in our paper across various benchmarks:

Modify the dataset paths in benchmarks.py to point to your local data.
Run the evaluation script:

bash scripts/val_bench.sh

2. Single Image Inference

You can also run inference on a single image to detect whether it is Real or Fake.

python inference.py \
  --model_path pretrained/checkpoint.pt \
  --image_path assets/test_image.jpg \
  --device cuda

Output Example:

[INFO] Loading model from pretrained/checkpoint.pt...
[RESULT] Image: assets/test_image.jpg
  -> Prediction: Fake (AI-Generated)
  -> Confidence: 99.8%

🏋️ Training a GAPL model

📦 Prerequisites

Before starting, please ensure you have prepared the required datasets:

Stage 1 Data:
- GenImage or CNNSpot Training Set.
- CLIP Pre-trained Model (will be automatically downloaded via HuggingFace).
Stage 2 Data:
- Community Forensics (Small Training Set): Please download it from Hugging Face.
- 🔗 Download Link: OwensLab/CommunityForensics-Small

Stage 1: Backbone Training & Prototype Extraction

In this stage, we train the backbone and learn the initial generator-aware prototypes.

Step 1: Configure Paths Please open prototype_dataset.py and modify the dataset paths to match your local environment.

Step 2: Train Backbone Run the following script to start training:

bash scripts/stage1.sh

Step 3: Extract Prototypes After the backbone training converges, run the extraction script to generate the prototype vectors:

prototype/dream_prototype.py

⚡ Fast Track: We provide the pre-trained Stage 1 checkpoint and pre-extracted prototype vectors. You can skip this stage by downloading them from pretrained.

Stage 2: Fine-tuning

In the second stage, we fine-tune the model using the Community Forensics dataset to enhance robustness against diverse generators.

Run Training:

scripts/stage2.sh

🖊️ Citation

If you find our work useful in your research, please consider citing:

@article{qin2025Scaling,
  title={Scaling Up AI-Generated Image Detection with Generator-Aware Prototypes},
  author={Qin, Ziheng and Ji, Yuheng and Tao, Renshuai and Tian, Yuxuan and Liu, Yuyang and Wang, Yipu and Zheng, Xiaolong},
  journal={arXiv preprint arXiv:2512.12982},
  year={2025}
}

🙏 Acknowledgements

Our code is developed based on the following excellent open-source repositories. We appreciate their excellent work and contributions to the community:

CNNDetection

Community Forensics We leverage the dataset and borrow some code from this codebase.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
pretrained		pretrained
prototype		prototype
scripts		scripts
.gitignore		.gitignore
Readme.md		Readme.md
benchmarks.py		benchmarks.py
custom_sampler.py		custom_sampler.py
custom_transforms.py		custom_transforms.py
dataloader.py		dataloader.py
inference_image.py		inference_image.py
models.py		models.py
prototype_dataset.py		prototype_dataset.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py
validate_bench.py		validate_bench.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[Scaling Up AI-Generated Image Detection with Generator-Aware Prototypes]

🔥 News

💡 Motivation

🛠️ Preparation

1. Environment Setup

🚀 Quick inference

1. Evaluate the proposed model in benchmarks.

2. Single Image Inference

🏋️ Training a GAPL model

📦 Prerequisites

Stage 1: Backbone Training & Prototype Extraction

Stage 2: Fine-tuning

🖊️ Citation

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

UltraCapture/GAPL

Folders and files

Latest commit

History

Repository files navigation

[Scaling Up AI-Generated Image Detection with Generator-Aware Prototypes]

🔥 News

💡 Motivation

🛠️ Preparation

1. Environment Setup

🚀 Quick inference

1. Evaluate the proposed model in benchmarks.

2. Single Image Inference

🏋️ Training a GAPL model

📦 Prerequisites

Stage 1: Backbone Training & Prototype Extraction

Stage 2: Fine-tuning

🖊️ Citation

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages