Table 3

Training and performance hyperparameters for benchmark CNN models

ParameterResNet-18Deep Baseline CNN
Input size2 × 500 × 5002 × 500 × 500
Parameter quantity12.5 m11m
Batch size6464
Learning rate1 × 10–31 × 10–3
OptimizerAdamAdam
Weight decay1 × 10–51 × 10–5
Data augmentationRandom flips, cropsRandom flips, crops
Epochs100110
Training time (wall-clock)6 h6.5 h
GPU memory peak8 GB8 GB
Inference latency per sample45 ms50 ms
List of channels in conv layers[ 64, 128, 256, 512 ][ 64, 128, 256, 512 ]
Sizes of FC hidden layers[ 512, 1,024, 256 ][ 512, 1,024, 256 ]
Dropout rate0.30.3

or Create an Account

Close Modal
Close Modal