Face Forensics++ Benchmark Dataset

https://github.com/ondyari/faceforensics

Overview

FaceForensics++ is a forensics dataset consisting of 1000 original video sequences that have been manipulated with four automated face manipulation methods: Deepfakes, Face2Face, FaceSwap and NeuralTextures. The data has been sourced from 977 youtube videos and all videos contain a trackable mostly frontal face without occlusions which enables automated tampering methods to generate realistic forgeries. As we provide binary masks the data can be used for image and video classification as well as segmentation. In addition, we provide 1000 Deepfakes models to generate and augment new data.

Associated Paper or Article

More information can be found by reading FaceForensics++: Learning to Detect Manipulated Facial Images.

Download

If you would like to download the FaceForensics++ dataset, please fill out this google form and, once accepted, we will send you the link to our download script. If you have not received a response within a week, it is likely that your email is bouncing - please check this before sending repeat requests.

Once, you obtain the download link, please head to the download section. You can also find details about the generation of the dataset there.

Model

XceptionNet from the associated paper trained on FaceForensics++ dataset.

Besides the full image models, all models were trained on slightly enlarged face crops with a scale factor of 1.3. The models were trained using the Face2Face face tracker, though the detect_from_models.py file uses the freely available dlib face detector.

More information on the classification model can be found here. Please check requirements.txt that can be found by accessing the link and make sure you have Python 3.6 installed on your machine.

Benchmarks

An automated benchmark for this dataset can be found here. This benchmark is for facial manipulation detection on the presence of compression based on our manipulation methods that contains 1000 images.

Size

The original downladed source videos from youtube: 38.5GB.

All h264 compressed videos with compression rate factor.

All raw extracted images as pngs: ~2TB

Associated Challenges

No associated challenges have been found in regard to this dataset.

Licence

The dataset is licenced under Face Forsenics Terms of Use and the code is licenced under the MIT licence.