DFBench: The Image Deepfake Detection Benchmark 2025

DFBench provides a standardized evaluation for computer vision deepfake detection systems. This leaderboard focuses on image deepfake detection, e.g. the output of text-to-image and image-to-image models.

Objectives:

Allow fair comparison between deepfake detection models on unseen test data (no fine tuning on the test data possible)
Advance the state-of-the-art in synthetic media identification

Leaderboard Image Deepfake Detection

Rank	Model	Accuracy	Accuracy on Real	Accuracy on Fake	Accuracy on JPEG	Accuracy on PNG	Accuracy on WEBP	Accuracy on TIFF
1	Resemble.ai	85.1	94.7	75.4	81.2	90.4	82.2	90.8
2	RECCE	67.3	99.4	35.1	64.2	69.5	68.8	69.8
3	Xception	66.1	99.3	33.0	63.8	67.4	69.0	66.7
4	ResNet101	65.5	97.7	33.4	63.1	67.2	66.7	67.5
5	Xception SLADD	65.0	99.9	30.1	62.5	65.6	67.2	67.4
6	STIL	64.7	98.3	31.2	61.5	67.4	67.7	65.8
7	ResNet34	64.0	98.4	29.6	61.8	65.8	65.0	65.6
8	VGG19	60.7	99.5	21.9	57.5	61.8	64.0	62.8
9	EfficientNetB4	58.2	99.7	16.8	55.5	60.6	61.1	58.4
10	CLIP	55.4	94.0	16.9	54.6	57.4	56.6	54.0
11	Xception FFD	54.8	97.3	12.3	53.7	56.4	56.3	54.1

The Leaderboard is updated upon validation of new submissions. All results are evaluated on the official test dataset.