Geo-DefakeHop: High-Performance Geographic Fake Image Detection

[3]

CartoDB

, https://carto.com,

2021

[4]

H.-S.

Chen

Rouhsedaghat

Ghani

You

, and

C.-C. J.

Kuo

, “

DefakeHop: A Light-Weight High-Performance Deepfake Detector

”, in

2021 IEEE International Conference on Multimedia and Expo (ICME)

, IEEE,

2021

–

[5]

Chen

and

Guestrin

, “

Xgboost: A Scalable Tree Boosting System

”, in

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

2016

785

–

[6]

Chen

and

C.-C. J.

Kuo

, “

Pixelhop: A Successive Subspace Learning (SSL) Method for Object Recognition

”,

Journal of Visual Communication and Image Representation

2020

102749

[7]

Choi

Kim

J.-W.

Kim

, and

Choo

, “

Star-GAN: Unified Generative Adversarial Networks for Multi-domain ImagetoImage Translation

”, in

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

2018

8789

–

[8]

Dolhansky

Bitton

Pflaum

Howes

Wang

, and

C. C.

Ferrer

, “

The Deepfake Detection Challenge (DFDC) Dataset

”,

arXiv preprint

arXiv:

2006.07397

2020

[9]

Frank

Eisenhofer

Schönherr

Fischer

Kolossa

, and

Holz

, “

Leveraging Frequency Analysis for Deep Fake Image Recognition

”, in

International Conference on Machine Learning

, PMLR,

2020

3247

–

[10]

Guarnera

Giudice

, and

Battiato

, “

Deepfake Detection by Analyzing Convolutional Traces

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops

2020

666

–

[11]

C. F.

Hall

and

E. L.

Hall

, “

A Nonlinear Model for the Spatial Characteristics of the Human Visual System

”,

IEEE Transactions on Systems, Man, and Cybernetics

(

1977

161

–

[12]

Heusel

Ramsauer

Unterthiner

Nessler

, and

Hochreiter

, “

Gans Trained by a Two Time-scale Update Rule Converge to a Local Nash Equilibrium

”,

Advances in Neural Information Processing Systems

2017

[13]

Isola

J.-Y.

Zhu

Zhou

, and

A. A.

Efros

, “

Image-to-Image Translation with Conditional Adversarial Networks

”, in

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

2017

[14]

Janowicz

Gao

McKenzie

, and

Bhaduri

, “

GeoAI: Spatially Explicit Artificial Intelligence Techniques for Geographic Knowledge Discovery and Beyond

”, in,

Vol. 34, No. 4

Taylor & Francis

2020

625

–

[15]

Jiang

Qian

, and

C. C.

Loy

, “

Deeperforensics-1.0: A Large-scale Dataset for Real-world Face Forgery Detection

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

2889

–

[16]

Karras

Aila

Laine

, and

Lehtinen

, “

Progressive Growing of GANs for Improved Quality, Stability, and Variation

”,

arXiv preprint

arXiv:

1710.10196

2017

[17]

Karras

Aittala

Laine

Härkönen

Hellsten

Lehtinen

, and

Aila

, “

Alias-free Generative Adversarial Networks

”,

Advances in Neural Information Processing Systems

2021

[18]

Karras

Laine

, and

Aila

, “

A Style-based Generator Architecture for Generative Adversarial Networks

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2019

4401

–

[19]

Karras

Laine

Aittala

Hellsten

Lehtinen

, and

Aila

, “

Analyzing and Improving the Image Quality of Stylegan

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

8110

–

[20]

C.-C. J.

Kuo

and

A. M.

Madni

, “

Green Learning: Introduction, Examples and Outlook

”,

arXiv preprint

arXiv:

2210.00965

2022

[21]

C.-C. J.

Kuo

Zhang

Duan

, and

Chen

, “

Interpretable Convolutional Neural Networks via Feedforward Design

”,

Journal of Visual Communication and Image Representation

2019

346

–

[22]

Johansen

, and

M. F.

McCabe

, “

A Machine Learning Approach for Identifying and Delineating Agricultural Fields and Their Multitemporal Dynamics using Three Decades of Landsat Data

”,

ISPRS Journal of Photogrammetry and Remote Sensing

186

2022

–

101

[23]

Martinis

, and

Wieland

, “

Urban Flood Mapping with An Active Self-learning Convolutional Neural Network based on TerraSAR-X Intensity and Interferometric Coherence

”,

ISPRS Journal of Photogram-metry and Remote Sensing

152

2019

178

–

[24]

Yang

Sun

, and

Lyu

, “

Celeb-DF: A Largescale Challenging Dataset for Deepfake Forensics

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

3207

–

[25]

Liu

Zhu

Song

, and

Elgammal

, “

Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis

”, in

International Conference on Learning Representations

2020

[26]

Liu

Xing

Yang

C.-C. J.

Kuo

Babu

G. E.

Fakhri

Jenkins

, and

Woo

, “

VoxelHop: Successive Subspace Learning for ALS Disease Classification Using Structural MRI

”,

arXiv preprint

arXiv:

2101.05131

2021

[27]

Liu

Zhang

Yin

, and

B. A.

Johnson

, “

Deep Learning in Remote Sensing Applications: A Meta-analysis and Review

”,

ISPRS Journal of Photogrammetry and Remote Sensing

152

2019

166

–

[28]

Nataraj

T. M.

Mohammed

Manjunath

Chandrasekaran

Flenner

J. H.

Bappy

, and

A. K.

Roy-Chowdhury

, “

Detecting GAN Generated Fake Images using Co-occurrence Matrices

”,

Electronic Imaging

2019

(

2019

532

–

[29]

Park

M.-Y.

Liu

T.-C.

Wang

, and

J.-Y.

Zhu

, “

Semantic Image Synthesis with Spatially-adaptive Normalization

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2019

2337

–

[30]

Radford

Child

Luan

Amodei

Sutskever

, et al., “

Language Models are Unsupervised Multitask Learners

”,

OpenAI Blog

(

2019

[31]

Rössler

Cozzolino

Verdoliva

Riess

Thies

, and

Nießner

, “

Faceforensics: A Large-scale Video Dataset for Forgery Detection in Human Faces

”,

arXiv preprint

arXiv:

1803.09179

2018

[32]

S.-Y.

Wang

Zhang

Owens

, and

A. A.

Efros

, “

CNNgenerated Images are Surprisingly Easy to Spot… for Now

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

8695

–

704

[33]

Zhang

Wang

Sohrab

Gabbouj

, and

C.-C. J.

Kuo

, “

AnomalyHop: An SSL-based Image Anomaly Localization Method

”,

arXiv preprint

arXiv:

2105.03797

2021

[34]

Zhang

You

Kadam

Liu

, and

C.-C. J.

Kuo

, “

PointHop: An Explainable Machine Learning Method for Point Cloud Classification

”,

IEEE Transactions on Multimedia

(

2020

1744

–

[35]

Zhang

Karaman

, and

S.-F.

Chang

, “

Detecting and Simulating Artifacts in GAN Fake Images

”, in

2019 IEEE International Workshop on Information Forensics and Security (WIFS)

, IEEE,

2019

–

[36]

Zhao

Zhang

Sun

, and

Deng

, “

Deep Fake Geography? When Geospatial Data Encounter Artificial Intelligence

”,

Cartography and Geographic Information Science

(

2021

338

–

[37]

Zheng

Zhang

, and

Zhong

, “

Deep multisensor learning for missing-modality all-weather mapping

”,

ISPRS Journal of Photogrammetry and Remote Sensing

174

2021

254

–

[38]

Zhou

Wang

Liang

, and

Shen

, “

Face Forensics in the Wild

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2021

5778

–

[39]

J.-Y.

Zhu

Park

Isola

, and

A. A.

Efros

, “

Unpaired Imageto-Image Translation using Cycle-consistent Adversarial Networks

”, in

Proceedings of the IEEE International Conference on Computer Vision

2017

2223

–

[40]

Chang

Chen

, and

Y.-G.

Jiang

, “

Wilddeepfake: A Challenging Real-world Dataset for Deepfake Detection

”, in

Proceedings of the 28th ACM International Conference on Multimedia

2020

2382

–

2024

H.-S. Chen, K. Zhang, S. Hu, S. You and C.-C. J. Kuo

Figure 1

An overview of the Geo-DefakeHop method, where the input is an image tile and the output is a binary decision on whether the input is an authentic or a fake one. First, each input title is partitioned into non-overlapping blocks of dimension 16 × 16 × 3. Second, each block goes through one PixelHop or multiple PixelHops, each of which yields 3D tensor responses of dimension H × W × C. Third, for each PixelHop, an XGBoost classifier is applied to spatial samples of each channel to generate channel-wise (c/w) soft decision scores and a set of discriminant channels are selected accordingly. Last, all block decision scores are ensembled to generate the final decision of the image tile.

Figure 2

An illustration showing four plots labeled a through d, each comparing F one score and Energy percentage against channel index for train, val, and test data under different conditions.

The channel-wise performance of four settings: (a) without perturbation, (b) resizing, (c) adding Gaussian noise, and (d) JPEG compression. The channel 0 is DC (Direct Current) and from the first channel to the 26th channel are corresponding to AC1 to AC26 (Alternating Current). The blue line is the energy percentage of each channel and the red, magenta and green lines are the F1-score of the training, validation and testing dataset. We observe that high-frequency channels without perturbation in 2(a) has a higher performance. After applying resizing, adding Gaussian noise and compression, the performance of high-frequency channels degrades as shown in 2(b), 2(c), and 2(d). The test score and validation score are closely related, indicating that the validation score can be used to select the discriminant channels.

Table 1

A collage of various building types, showcasing architectural diversity and styles from different regions.

Visualization of original real images (the first column), partial real/partial fake (PRPF) images (the second column), the ground truth (the third column, where dark blue and yellow denote real and fake regions, respectively) and heat maps (the four column, where cold and warm colors indicate a higher probability of being real and fake in the corresponding location, respectively.)

Table 2

A table displaying various types of soil in different colors, showcasing their unique textures and characteristics.

Visualization of absolute values of Saab filter responses and the detection heat maps for DC, AC1, AC11 and AC26 four channels, where DC and AC1 are low-frequency channels, AC11 is a mid-frequency channel, and AC26 is a high-frequency channel. Cold and warm colors in heat maps indicate a higher probability of being real and fake in the corresponding location, respectively. The ground truth is that the whole image is a fake one.

Table 3

The statistics of three fake satellite image datasets, where C-GAN, S-GAN and L-GAN denote CycleGAN, StyleGAN2 and Lightweight GAN, respectively.

	UW/C-GAN	USC/S-GAN	USC/L-GAN
No. of Real	8,046	32,184	32,184
No. of Fake	8,046	32,184	32,184
Image sizes	256 × 256	128 × 128	128 × 128

Table 4

Comparison of FID scores of three fake satellite image datasets, where C-GAN, S-GAN and L-GAN denote CycleGAN, StyleGAN2 and Lightweight GAN, respectively. Lower FID scores indicate better generated images of higher fidelity and variability.

	UW/C-GAN	USC/S-GAN	USC/L-GAN
Beijing	134.88	49.31	55.72
Seattle	174.78	47.11	41.87
Tacoma	-	60.18	28.76

Table 5

Detection performance comparison with raw images from the UW dataset for three benchmarking methods. The boldface and the underbar indicate the best and the second-best results, respectively.

Method	Features or Designs	F1 score	Precision	Recall
Zhao et al. [36]	Spatial	75.81%	78.15%	73.61%
	Histogram	78.99%	72.93%	86.16%
	Frequency	65.84%	49.07%	100%
	Spatial + Histogram	86.77%	82.78%	91.17%
	Spatial + Frequency	77.02%	78.75%	75.36%
	Histogram + Frequency	83.90%	78.36%	90.29%
	Spatial + Histogram + Frequency	87.08%	82.73%	91.92%
DefakeHop [4]		96.89%	97.26%	96.53%
Geo-DefakeHop (Ours)	PixelHop A	99.88%	100%	99.75%
	PixelHop B	100%	100%	100%
	PixelHop C	99.88%	100%	99.75%
	PixelHops A&B&C	100%	100%	100%

Method	Features or Designs	F1 score	Precision	Recall
Zhao et al. [36]	Spatial	75.81%	78.15%	73.61%
	Histogram	78.99%	72.93%	86.16%
	Frequency	65.84%	49.07%	100%
	Spatial + Histogram	86.77%	82.78%	91.17%
	Spatial + Frequency	77.02%	78.75%	75.36%
	Histogram + Frequency	83.90%	78.36%	90.29%
	Spatial + Histogram + Frequency	87.08%	82.73%	91.92%
DefakeHop [4]		96.89%	97.26%	96.53%
Geo-DefakeHop (Ours)	PixelHop A	99.88%	100%	99.75%
	PixelHop B	100%	100%	100%
	PixelHop C	99.88%	100%	99.75%
	PixelHops A&B&C	100%	100%	100%

Table 6

Detection performance comparison for images resized from 256 × 256 to 128 × 128 and 64 × 64 The boldface and the underbar indicate the best and the second-best results, respectively.

Tile size	Method	Features or Designs	Fl score	Precision	Recall
128 × 128	Zhao et al. [36]	Spatial	77.35%	76.61%	78.10%
		Histogram	80.09%	75.93%	84.72%
		Frequency	64.14%	47.21%	100%
		Spatial + Histogram	88.28%	85.81%	90.89%
		Spatial + Frequency	79.79%	81.38%	78.26%
		Histogram + Frequency	81.92%	76.99%	87.53%
		Spatial + Histogram + Frequency	88.09%	86.52%	89.71%
	DefakeHop [4]		92.63%	97.78%	88.00%
	Geo-DefakeHop (Ours)	PixelHop A	100%	100%	100%
		PixelHop Β	99.88%	100%	99.75%
		PixelHop C	99.75%	99.75%	99.75%
		PixelHops A&B&C	100%	100%	100%
64 × 64	Zhao et al. [36]	Spatial	76.46%	78.85%	74.21%
		Histogram	81.59%	76.60%	87.26%
		Frequency	49.75%	79.89%	36.12%
		Spatial + Histogram	88.22%	86.15%	90.39%
		Spatial + Frequency	77.46%	77.83%	77.09%
		Histogram + Frequency	83.16%	77.80%	89.32%
		Spatial + Histogram + Frequency	87.91%	83.94%	92.29%
	DefakeHop [4]		86.60%	89.36%	84.00%
	Geo-DefakeHop (Ours)	PixelHop A	98.27%	98.27%	98.27%
		PixelHop Β	97.39%	97.76%	97.03%
		PixelHop C	96.36%	97.71%	95.05%
		PixelHops A&B&C	99.01%	99.01%	99.01%

Tile size	Method	Features or Designs	Fl score	Precision	Recall
128 × 128	Zhao et al. [36]	Spatial	77.35%	76.61%	78.10%
		Histogram	80.09%	75.93%	84.72%
		Frequency	64.14%	47.21%	100%
		Spatial + Histogram	88.28%	85.81%	90.89%
		Spatial + Frequency	79.79%	81.38%	78.26%
		Histogram + Frequency	81.92%	76.99%	87.53%
		Spatial + Histogram + Frequency	88.09%	86.52%	89.71%
	DefakeHop [4]		92.63%	97.78%	88.00%
	Geo-DefakeHop (Ours)	PixelHop A	100%	100%	100%
		PixelHop Β	99.88%	100%	99.75%
		PixelHop C	99.75%	99.75%	99.75%
		PixelHops A&B&C	100%	100%	100%
64 × 64	Zhao et al. [36]	Spatial	76.46%	78.85%	74.21%
		Histogram	81.59%	76.60%	87.26%
		Frequency	49.75%	79.89%	36.12%
		Spatial + Histogram	88.22%	86.15%	90.39%
		Spatial + Frequency	77.46%	77.83%	77.09%
		Histogram + Frequency	83.16%	77.80%	89.32%
		Spatial + Histogram + Frequency	87.91%	83.94%	92.29%
	DefakeHop [4]		86.60%	89.36%	84.00%
	Geo-DefakeHop (Ours)	PixelHop A	98.27%	98.27%	98.27%
		PixelHop Β	97.39%	97.76%	97.03%
		PixelHop C	96.36%	97.71%	95.05%
		PixelHops A&B&C	99.01%	99.01%	99.01%

Table 7

Detection performance comparison for images corrupted by additive white Gaussian noise with standard deviation σ = 0.02, 0.06, 0.1. The boldface and the underbar indicate the best and the second-best results, respectively.

Noise σ	Method	Features or Designs	Fl score	Precision	Recall
0.02	Zhao et al. [36]	Spatial + Histogram	83.04%	82.41%	83.67%
		Spatial + Frequency	75.63%	78.42%	73.04%
		Histogram + Frequency	81.47%	76.62%	86.98%
		Spatial + Histogram + Frequency	83.25%	81.47%	85.11%
	DefakeHop [4]		91.84%	93.75%	90.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.56%	96.38%	98.77%
		PixelHop Β	98.90%	98.05%	99.75%
		PixelHop C	99.01%	98.53%	99.50%
		PixelHop A&B&C	98.65%	97.58%	99.75%
0.06	Zhao et al. [36]	Spatial + Histogram	80.74%	80.94%	80.54%
		Spatial + Frequency	76.39%	78.47%	74.42%
		Histogram + Frequency	80.28%	75.49%	85.71%
		Spatial + Histogram + Frequency	81.42%	79.40%	83.55%
	DefakeHop [4]		92.78%	95.75%	90.00%
	Geo-DefakeHop (Ours)	PixelHop A	95.24%	93.98%	96.53%
		PixelHop Β	96.59%	95.19%	98.02%
		PixelHop C	95.07%	94.70%	97.28%
		PixelHop A&B&C	96.59%	95.19%	98.02%
0.1	Zhao et al. [36]	Spatial + Histogram	81.74%	78.42%	85.35%
		Spatial + Frequency	69.05%	70.35%	67.79%
		Histogram + Frequency	79.44%	74.67%	84.86%
		Spatial + Histogram + Frequency	80.05%	77.78%	82.46%
	DefakeHop [4]		92.63%	97.78%	88.00%
	Geo-DefakeHop (Ours)	PixelHop A	94.43%	92.42%	96.53%
		PixelHop Β	94.88%	93.51%	96.29%
		PixelHop C	95.37%	93.99%	96.78%
		PixelHop A&B&C	96.10%	94.71%	97.52%

Noise σ	Method	Features or Designs	Fl score	Precision	Recall
0.02	Zhao et al. [36]	Spatial + Histogram	83.04%	82.41%	83.67%
		Spatial + Frequency	75.63%	78.42%	73.04%
		Histogram + Frequency	81.47%	76.62%	86.98%
		Spatial + Histogram + Frequency	83.25%	81.47%	85.11%
	DefakeHop [4]		91.84%	93.75%	90.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.56%	96.38%	98.77%
		PixelHop Β	98.90%	98.05%	99.75%
		PixelHop C	99.01%	98.53%	99.50%
		PixelHop A&B&C	98.65%	97.58%	99.75%
0.06	Zhao et al. [36]	Spatial + Histogram	80.74%	80.94%	80.54%
		Spatial + Frequency	76.39%	78.47%	74.42%
		Histogram + Frequency	80.28%	75.49%	85.71%
		Spatial + Histogram + Frequency	81.42%	79.40%	83.55%
	DefakeHop [4]		92.78%	95.75%	90.00%
	Geo-DefakeHop (Ours)	PixelHop A	95.24%	93.98%	96.53%
		PixelHop Β	96.59%	95.19%	98.02%
		PixelHop C	95.07%	94.70%	97.28%
		PixelHop A&B&C	96.59%	95.19%	98.02%
0.1	Zhao et al. [36]	Spatial + Histogram	81.74%	78.42%	85.35%
		Spatial + Frequency	69.05%	70.35%	67.79%
		Histogram + Frequency	79.44%	74.67%	84.86%
		Spatial + Histogram + Frequency	80.05%	77.78%	82.46%
	DefakeHop [4]		92.63%	97.78%	88.00%
	Geo-DefakeHop (Ours)	PixelHop A	94.43%	92.42%	96.53%
		PixelHop Β	94.88%	93.51%	96.29%
		PixelHop C	95.37%	93.99%	96.78%
		PixelHop A&B&C	96.10%	94.71%	97.52%

Table 8

Detection performance comparison for images coded by the JPEG compression standard of three quality factors (QF), i.e., QF = 95, 85, 75. The boldface and the underbar indicate the best and the second-best results, respectively.

JPEG quality	Method	Features or Designs	Fl score	Precision	Recall
95	Zhao et al. [36]	Spatial+Histogram	85.95%	82.49%	89.72%
		Spatial+Frequency	78.00%	78.38%	77.62%
		Histogram+Frequency	82.43%	74.95%	91.58%
		Spatial+Histogram+Frequency	86.96%	85.06%	88.94%
	DefakeHop [4]		98.00%	98.00%	98.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.91%	97.31%	98.51%
		PixelHop Β	97.90%	97.54%	98.27%
		PixelHop C	98.28%	97.56%	99.01%
		PixelHop A&B&C	98.15%	97.55%	98.76%
85	Zhao et al. [36]	Spatial + Histogram	85.91%	82.67%	89.42%
		Spatial + Frequency	82.53%	81.48%	83.61%
		Histogram + Frequency	85.28%	81.66%	89.24%
		Spatial + Histogram + Frequency	89.54%	85.82%	93.6%
	DefakeHop [4]		94.85%	97.87%	92.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.54%	96.83%	98.27%
		PixelHop Β	97.91%	97.08%	98.76%
		PixelHop C	97.91%	97.08%	98.76%
		PixelHop A&B&C	97.54%	97.06%	98.02%
75	Zhao et al. [36]	Spatial+Histogram	85.61%	81.70%	89.93%
		Spatial+Frequency	87.09%	83.94%	90.49%
		Histogram+Frequency	88.94%	87.41%	90.52%
		Spatial+Histogram+Frequency	90.20%	88.46%	92.00%
	DefakeHop [4]		92.93%	93.88%	92.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.92%	96.63%	99.26%
		PixelHop Β	97.66%	97.07%	98.27%
		PixelHop C	97.79%	96.84%	98.76%
		PixelHop A&B&C	97.92%	96.63%	99.26%

JPEG quality	Method	Features or Designs	Fl score	Precision	Recall
95	Zhao et al. [36]	Spatial+Histogram	85.95%	82.49%	89.72%
		Spatial+Frequency	78.00%	78.38%	77.62%
		Histogram+Frequency	82.43%	74.95%	91.58%
		Spatial+Histogram+Frequency	86.96%	85.06%	88.94%
	DefakeHop [4]		98.00%	98.00%	98.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.91%	97.31%	98.51%
		PixelHop Β	97.90%	97.54%	98.27%
		PixelHop C	98.28%	97.56%	99.01%
		PixelHop A&B&C	98.15%	97.55%	98.76%
85	Zhao et al. [36]	Spatial + Histogram	85.91%	82.67%	89.42%
		Spatial + Frequency	82.53%	81.48%	83.61%
		Histogram + Frequency	85.28%	81.66%	89.24%
		Spatial + Histogram + Frequency	89.54%	85.82%	93.6%
	DefakeHop [4]		94.85%	97.87%	92.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.54%	96.83%	98.27%
		PixelHop Β	97.91%	97.08%	98.76%
		PixelHop C	97.91%	97.08%	98.76%
		PixelHop A&B&C	97.54%	97.06%	98.02%
75	Zhao et al. [36]	Spatial+Histogram	85.61%	81.70%	89.93%
		Spatial+Frequency	87.09%	83.94%	90.49%
		Histogram+Frequency	88.94%	87.41%	90.52%
		Spatial+Histogram+Frequency	90.20%	88.46%	92.00%
	DefakeHop [4]		92.93%	93.88%	92.00%
	Geo-DefakeHop (Ours)	PixelHop A	97.92%	96.63%	99.26%
		PixelHop Β	97.66%	97.07%	98.27%
		PixelHop C	97.79%	96.84%	98.76%
		PixelHop A&B&C	97.92%	96.63%	99.26%

Table 9

Comparison of F1-scores of four detection methods under the weak supervision data setting, where X-Y-Z means that X% of training, Y% of validation and Z% of test data samples.

	40-10-50	10-10-80
Zhao et al.	87.82%	86.62%
Wang et al.	100%	99.64%
ResNet18	99.85%	98.87%
ResNet18-FFT	99.98%	99.88%
Geo-DefakeHop	99.93%	99.67%

Table 10

Comparion of F1-scores of four detection methods on fake images generated by CycleGAN, StyleGAN2, and Lightweight GAN, where all datasets are split with 10% training, 10% validation and 80% test data.

	CycleGAN	StyleGAN2	LightweightGAN
Zhao et al.	86.62%	69.50%	69.75%
Wang et al.	99.64%	37.89%	11.55%
ResNet18	98.87%	98.46%	98.89%
ResNet18-FFT	99.88%	96.33%	96.45%
Geo-DefakeHop	99.67%	99.47%	99.80%

Table 11

Model size computation of four Geo-DefakeHop designs for raw satellite input images.

System	No. of Selected Channels	No. of Filter Parameters	No. of c/w XGBoost Parameters	No. of ensemble XGBoost Parameters	Total Model Size
Pixelhop A	1	12	400	400	812
Pixelhop B	1	27	400	400	827
Pixelhop C	1	48	400	400	848
Pixelhop A&B&C	3	87	1,200	1,200	2,487

System	No. of Selected Channels	No. of Filter Parameters	No. of c/w XGBoost Parameters	No. of ensemble XGBoost Parameters	Total Model Size
Pixelhop A	1	12	400	400	812
Pixelhop B	1	27	400	400	827
Pixelhop C	1	48	400	400	848
Pixelhop A&B&C	3	87	1,200	1,200	2,487

Table 12

Summary of model sizes of four Geo-DefakeHop designs with different input images.

Experiments	PixelHop A	PixelHop B	PixelHop C	A&B&C
Raw Images	0.8K	0.8K	0.8K	2.5K
Resizing	9.7K	20K	37K	61.7K
Noise	8.1K	13K	33K	38.5K
Compression	7.3K	19K	33K	37.4K

[1]

Barni

Kallas

Nowroozi

, and

Tondi

, “

CNN Detection of GANgenerated Face Images based on Cross-Band Co-occurrences Analysis

”, in

2020 IEEE International Workshop on Information Forensics and Security (WIFS)

, IEEE,

2020

–

[2]

Brock

Donahue

, and

Simonyan

, “

Large Scale GAN Training for High Fidelity Natural Image Synthesis

”,

arXiv:1809.11096

2018

[3]

CartoDB

, https://carto.com,

2021

[4]

H.-S.

Chen

Rouhsedaghat

Ghani

You

, and

C.-C. J.

Kuo

, “

DefakeHop: A Light-Weight High-Performance Deepfake Detector

”, in

2021 IEEE International Conference on Multimedia and Expo (ICME)

, IEEE,

2021

–

[5]

Chen

and

Guestrin

, “

Xgboost: A Scalable Tree Boosting System

”, in

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

2016

785

–

[6]

Chen

and

C.-C. J.

Kuo

, “

Pixelhop: A Successive Subspace Learning (SSL) Method for Object Recognition

”,

Journal of Visual Communication and Image Representation

2020

102749

[7]

Choi

Kim

J.-W.

Kim

, and

Choo

, “

Star-GAN: Unified Generative Adversarial Networks for Multi-domain ImagetoImage Translation

”, in

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

2018

8789

–

[8]

Dolhansky

Bitton

Pflaum

Howes

Wang

, and

C. C.

Ferrer

, “

The Deepfake Detection Challenge (DFDC) Dataset

”,

arXiv preprint

arXiv:

2006.07397

2020

[9]

Frank

Eisenhofer

Schönherr

Fischer

Kolossa

, and

Holz

, “

Leveraging Frequency Analysis for Deep Fake Image Recognition

”, in

International Conference on Machine Learning

, PMLR,

2020

3247

–

[10]

Guarnera

Giudice

, and

Battiato

, “

Deepfake Detection by Analyzing Convolutional Traces

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops

2020

666

–

[11]

C. F.

Hall

and

E. L.

Hall

, “

A Nonlinear Model for the Spatial Characteristics of the Human Visual System

”,

IEEE Transactions on Systems, Man, and Cybernetics

(

1977

161

–

[12]

Heusel

Ramsauer

Unterthiner

Nessler

, and

Hochreiter

, “

Gans Trained by a Two Time-scale Update Rule Converge to a Local Nash Equilibrium

”,

Advances in Neural Information Processing Systems

2017

[13]

Isola

J.-Y.

Zhu

Zhou

, and

A. A.

Efros

, “

Image-to-Image Translation with Conditional Adversarial Networks

”, in

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

2017

[14]

Janowicz

Gao

McKenzie

, and

Bhaduri

, “

GeoAI: Spatially Explicit Artificial Intelligence Techniques for Geographic Knowledge Discovery and Beyond

”, in,

Vol. 34, No. 4

Taylor & Francis

2020

625

–

[15]

Jiang

Qian

, and

C. C.

Loy

, “

Deeperforensics-1.0: A Large-scale Dataset for Real-world Face Forgery Detection

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

2889

–

[16]

Karras

Aila

Laine

, and

Lehtinen

, “

Progressive Growing of GANs for Improved Quality, Stability, and Variation

”,

arXiv preprint

arXiv:

1710.10196

2017

[17]

Karras

Aittala

Laine

Härkönen

Hellsten

Lehtinen

, and

Aila

, “

Alias-free Generative Adversarial Networks

”,

Advances in Neural Information Processing Systems

2021

[18]

Karras

Laine

, and

Aila

, “

A Style-based Generator Architecture for Generative Adversarial Networks

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2019

4401

–

[19]

Karras

Laine

Aittala

Hellsten

Lehtinen

, and

Aila

, “

Analyzing and Improving the Image Quality of Stylegan

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

8110

–

[20]

C.-C. J.

Kuo

and

A. M.

Madni

, “

Green Learning: Introduction, Examples and Outlook

”,

arXiv preprint

arXiv:

2210.00965

2022

[21]

C.-C. J.

Kuo

Zhang

Duan

, and

Chen

, “

Interpretable Convolutional Neural Networks via Feedforward Design

”,

Journal of Visual Communication and Image Representation

2019

346

–

[22]

Johansen

, and

M. F.

McCabe

, “

A Machine Learning Approach for Identifying and Delineating Agricultural Fields and Their Multitemporal Dynamics using Three Decades of Landsat Data

”,

ISPRS Journal of Photogrammetry and Remote Sensing

186

2022

–

101

[23]

Martinis

, and

Wieland

, “

Urban Flood Mapping with An Active Self-learning Convolutional Neural Network based on TerraSAR-X Intensity and Interferometric Coherence

”,

ISPRS Journal of Photogram-metry and Remote Sensing

152

2019

178

–

[24]

Yang

Sun

, and

Lyu

, “

Celeb-DF: A Largescale Challenging Dataset for Deepfake Forensics

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

3207

–

[25]

Liu

Zhu

Song

, and

Elgammal

, “

Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis

”, in

International Conference on Learning Representations

2020

[26]

Liu

Xing

Yang

C.-C. J.

Kuo

Babu

G. E.

Fakhri

Jenkins

, and

Woo

, “

VoxelHop: Successive Subspace Learning for ALS Disease Classification Using Structural MRI

”,

arXiv preprint

arXiv:

2101.05131

2021

[27]

Liu

Zhang

Yin

, and

B. A.

Johnson

, “

Deep Learning in Remote Sensing Applications: A Meta-analysis and Review

”,

ISPRS Journal of Photogrammetry and Remote Sensing

152

2019

166

–

[28]

Nataraj

T. M.

Mohammed

Manjunath

Chandrasekaran

Flenner

J. H.

Bappy

, and

A. K.

Roy-Chowdhury

, “

Detecting GAN Generated Fake Images using Co-occurrence Matrices

”,

Electronic Imaging

2019

(

2019

532

–

[29]

Park

M.-Y.

Liu

T.-C.

Wang

, and

J.-Y.

Zhu

, “

Semantic Image Synthesis with Spatially-adaptive Normalization

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2019

2337

–

[30]

Radford

Child

Luan

Amodei

Sutskever

, et al., “

Language Models are Unsupervised Multitask Learners

”,

OpenAI Blog

(

2019

[31]

Rössler

Cozzolino

Verdoliva

Riess

Thies

, and

Nießner

, “

Faceforensics: A Large-scale Video Dataset for Forgery Detection in Human Faces

”,

arXiv preprint

arXiv:

1803.09179

2018

[32]

S.-Y.

Wang

Zhang

Owens

, and

A. A.

Efros

, “

CNNgenerated Images are Surprisingly Easy to Spot… for Now

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2020

8695

–

704

[33]

Zhang

Wang

Sohrab

Gabbouj

, and

C.-C. J.

Kuo

, “

AnomalyHop: An SSL-based Image Anomaly Localization Method

”,

arXiv preprint

arXiv:

2105.03797

2021

[34]

Zhang

You

Kadam

Liu

, and

C.-C. J.

Kuo

, “

PointHop: An Explainable Machine Learning Method for Point Cloud Classification

”,

IEEE Transactions on Multimedia

(

2020

1744

–

[35]

Zhang

Karaman

, and

S.-F.

Chang

, “

Detecting and Simulating Artifacts in GAN Fake Images

”, in

2019 IEEE International Workshop on Information Forensics and Security (WIFS)

, IEEE,

2019

–

[36]

Zhao

Zhang

Sun

, and

Deng

, “

Deep Fake Geography? When Geospatial Data Encounter Artificial Intelligence

”,

Cartography and Geographic Information Science

(

2021

338

–

[37]

Zheng

Zhang

, and

Zhong

, “

Deep multisensor learning for missing-modality all-weather mapping

”,

ISPRS Journal of Photogrammetry and Remote Sensing

174

2021

254

–