Efficient Multi-stage Context Based Entropy Model for Learned Lossy Point Cloud Attribute Compression

[5]

Bjontegaard

, “

Calculation of average PSNR differences between RD-curves

”,

ITU SG16 Doc. VCEG-M33

2001

[6]

Choy

Gwak

, and

Savarese

, “

4d spatio-temporal convnets: Minkowski convolutional neural networks

”, in

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

2019

3075

–

[7]

R. A.

Cohen

Tian

, and

Vetro

, “

Attribute compression for sparse point clouds using graph transforms

”, in

2016 IEEE International Conference on Image Processing (ICIP)

, IEEE,

2016

1374

–

[8]

d’Eon

Harrison

Myers

, and

P. A.

Chou

, “

8i Voxelized Full Bodies - A Voxelized Point Cloud Dataset

”,

ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) m38673/M72012

May

2016

[9]

Dai

A. X.

Chang

Savva

Halber

Funkhouser

, and

NieSSner

, “

ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2017

[10]

R. L.

De Queiroz

and

P. A.

Chou

, “

Compression of 3D point clouds using a region-adaptive hierarchical transform

”,

IEEE Transactions on Image Processing

(

2016

3947

–

[11]

R. L.

De Queiroz

and

P. A.

Chou

, “

Transform coding for point clouds using a Gaussian process model

”,

IEEE Transactions on Image Processing

(

2017

3507

–

[12]

Fang

Wang

, and

Guo

, “

3dac: Learning attribute compression for point clouds

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2022

1481928

[13]

Song

Gao

, and

Liu

, “

Octattention: Octree-based large-scale contexts model for point cloud compression

”, in

Proceedings of the AAAI Conference on Artificial Intelligence

Vol. 36

2022

625

–

[14]

Zheng

Sun

Wang

, and

Qin

, “

Checkerboard context model for efficient learned image compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2021

14771

–

[15]

Yang

Khalid

Xiao

Trigoni

, and

Markham

, “

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2021

[16]

J.-H.

Kim

Heo

, and

J.-S.

Lee

, “

Joint global and local hierarchical priors for learned image compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2022

59926001

[17]

T.-Y.

Lin

Maire

Belongie

Hays

Perona

Ramanan

Dollar

, and

C. L.

Zitnick

, “

Microsoft coco: Common objects in context

”, in

Computer Vision-ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13

, Springer,

2014

740

–

[18]

Minnen

Ballé

, and

G. D.

Toderici

, “

Joint autoregressive and hierarchical priors for learned image compression

”,

Advances in neural information processing systems

2018

[19]

D. T.

Nguyen

and

Kaup

, “

Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model

”,

IEEE Transactions on Circuits and Systems for Video Technology

2023

[20]

Pavez

Girault

Ortega

, and

P. A.

Chou

, “

Region adaptive graph Fourier transform for 3D point clouds

”, in

2020 IEEE International Conference on Image Processing (ICIP)

, IEEE,

2020

2726

–

[21]

Perlin

, “

An image synthesizer

”,

ACM Siggraph Computer Graphics

(

1985

287

–

https://doi.org/10.1145/3652212.3652217

[22]

R. B.

Pinheiro

J.-E.

Marvie

Valenzise

, and

Dufaux

, “

NF-PCAC: Normalizing Flow based Point Cloud Attribute Compression

”, in

ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

, IEEE,

2023

–

[23]

Qian

Tan

Sun

Lin

Sun

Hao

, and

Jin

, “

Learning Accurate Entropy Model with Global Reference for Image Compression

”, in

International Conference on Learning Representations

2021

[24]

Quach

Valenzise

, and

Dufaux

, “

Folding-based compression of point cloud attributes

”, in

2020 IEEE International Conference on Image Processing (ICIP)

, IEEE,

2020

3309

–

[25]

Quach

Valenzise

, and

Dufaux

, “

Learning convolutional transforms for lossy point cloud geometry compression

”, in

2019 IEEE international conference on image processing (ICIP)

, IEEE,

2019

43204

[26]

Que

, and

, “

Voxelcontext-net: An octree based framework for point cloud compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2021

604251

[27]

Rudolph

Riemenschneider

, and

Rizk

, “

Progressive Coding for Deep Learning based Point Cloud Attribute Compression

”, in,

MMVE ’24

Bari, Italy

: Association for Computing Machinery,

2024

–

ISBN: 9798400706189

, DOI:

[28]

Schwarz

Preda

Baroncini

Budagavi

Cesar

P. A.

Chou

R. A.

Cohen

Krivokua

Lasserre

, et al., “

Emerging MPEG standards for point cloud compression

”,

IEEE Journal on Emerging and Selected Topics in Circuits and Systems

(

2018

133

–

[29]

Shao

Zhang

Fan

, and

, “

Attribute compression of 3D point clouds using Laplacian sparsity optimized graph transform

”, in

2017 IEEE Visual Communications and Image Processing (VCIP)

, IEEE,

2017

–

[30]

Sheng

Liu

Xiong

, and

, “

Deep-pcac: An end-to-end deep lossy compression framework for point cloud attributes

”,

IEEE Transactions on Multimedia

2021

2617

–

[31]

Song

Liu

, and

, “

Efficient Hierarchical Entropy Model for Learned Point Cloud Compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2023

1436877

[32]

Wang

Ding

Feng

Cao

, and

, “

Sparse tensor-based multiscale representation for point cloud geometry compression

”,

IEEE Transactions on Pattern Analysis and Machine Intelligence

2022

[33]

Wang

Ding

, and

, “

Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction

”, in

2023 Data Compression Conference (DCC)

, IEEE,

2023

228

–

[34]

Wang

and

, “

Sparse tensor-based point cloud attribute compression

”, in

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR)

, IEEE,

2022

–

[35]

Wang

Zhu

Liu

, and

, “

Lossy point cloud geometry compression via end-to-end learning

”,

IEEE Transactions on Circuits and Systems for Video Technology

(

2021

4909

–

[36]

Wang

Zhang

Wang

Guo

, and

Gao

, “

Predictive generalized graph Fourier transform for attribute compression of dynamic point clouds

”,

IEEE Transactions on Circuits and Systems for Video Technology

(

2020

1968

–

https://doi.org/10.1145/3581783.3612422

[37]

Yang

Shao

Liu

T. H.

, and

, “

PDE-based Progressive Prediction Framework for Attribute Compression of 3D Point Clouds

”, in

Proceedings of the 31st ACM International Conference on Multimedia, MM ’23

Ottawa ON, Canada

: Association for Computing Machinery,

2023

9271

–

ISBN: 9798400701085

, DOI:

[38]

Yao

, and

Ziyu

, “

Owlii Dynamic human mesh sequence dataset

”,

ISO/IEC JTC1/SC29/WG11 (MPEG/JPEG) m4l658

2017

[39]

Zhang

Florencio

, and

Loop

, “

Point cloud attribute compression with graph transform

”, in

2014 IEEE International Conference on Image Processing (ICIP)

, IEEE,

2014

2066

–

[40]

Zhang

Chen

Ding

, and

, “

YOGA: Yet Another Geometry-based Point Cloud Compressor

”, in

Proceedings of the 31th ACM International Conference on Multimedia

2023

[41]

Zhang

Chen

Ding

, and

, “

G-PCC++: Enhanced Geometry-based Point Cloud Compression

”, in

Proceedings of the 31st ACM International Conference on Multimedia

2023

1352

–

2024

K. Wang, P. Zhang, S. Jiao, H. Yuan, S. Wang and X. Wang

Figure 1

Comparison of average compression performance and decoding latency among various point cloud attribute compression codecs on the longdress (10-bit geometry) from 8iVFBv2 [8] dataset. Our proposed model, compared with the joint hyper-autoregressive entropy model [34], demonstrates notable decoding speed enhancements while maintaining comparable decoding latency to codecs utilizing hyperprior entropy model (Hyper) and factorized entropy models (Factorized). Notably, Ours-Light shares the same backbone architecture as Hyper and Factorized, but these models employ different entropy models.

Figure 2

An illustration shows two diagrams labeled (a) A d a B N and (b) O S U D A.

The overview of our proposed method. The left side shows the attributes autoencoder, and the right side shows the entropy model. “SConv n³ × C” and “TSConv n³ × C “ denotes the sparse convolution and transposed convolution with C output channels and kernel size n³. “Residual Block” and “Self Attention Block” represent the residual network and the local self attention network used for efficient latent feature aggregation. “s ↑” and “s ↓” represent upsampling and downsampling at a factor of s. “Q” represents quantizer, “AE” represent arithmetic encoder, and “AD” represent arithmetic decoder. “G” represents the partition operation of the quantized latent representations. “0” symbolizes the context with the same shape as the input point cloud, where all attribute values are 0. U is used to describe the combination of point clouds of different shapes. Red arrows represent the encoding data flow, blue arrows represent the decoding data flow, and purple arrows represent the shared data flow.

Figure 3

A block diagram shows two parallel paths for X and tilda X in the G subscript a and G subscript s modules, followed by a Global Hyperprior Model and a Context Model.

An example of a multi-stage context modeling scheme with three groups:{l}, {3,6,8}, {2,4,5,7}, aimed at enhancing the accuracy of probability estimation in an entropy model by exploiting spatial correlations among groups. 0 symbolizes the context with the same shape as the input point cloud, where all attribute values are 0. Y_th represents the hyperprior context output by decoder h_s, and ${\hat{Z}}_{g}$ defines the global hyperprior. The entropy decoding decodes the bitstream into voxels using the entropy model parameters{μ, θ} generated by g_ep and ɡ_cm.

Figure 4

A block diagram shows the Global Hyperprior Model, Context Model, and Joint Entropy Model connected sequentially.

The overview of the Global Hyperprior Model and the proposed parallel Context Model and Joint Entropy Model.

Figure 5

A graph of P S N R dash Y in d B versus Bitrate in b p p for six different scenarios B I V F B two dash longdress B I V F B two dash soldier Sensaturban L A R G E dash scale outdoor O w L I dash dancer O w L I dash model and S C A N N E T indoor showing G dash P C C t m c one three v two three G dash P C C t m c one three v one nine G dash P C C t m c one three v six factorized hyper joint O u r s dash L i g h t and O u r s lines.

Rate-distortion curves of various point cloud attribute compression approaches. The results are evaluated on Human Body (8iVFBv2, Owlii), ScanNet and SensatUrban datasets.

Figure 6

A table of images with five rows and four columns of images, each column containing two images. The rows are labeled G P C C V six, G P C C V two three, Wang joint, O u r s underscore L, and O u r s underscore H. The columns are labeled Longdress q one Longdress q five loot q one loot q five model q one model q five redandblack q one redandblack q five. Under each image are the values b p p and P S N R underscore Y U V.

Visual quality comparison of the reconstructed point cloud on 8iVFBv2 dataset.

Figure 7

A column of images on the left shows a full body image of a person in a patterned dress with three boxes around the face torso and lower body area. Four other columns show close up crops of the face torso and lower body corresponding to the areas highlighted in the first image for four different methods.

Visual quality detail comparison of the reconstructed point cloud on Longdress from the 8iVFBv2 dataset.

Figure 8

A graph of P S N R dash Y in d B versus Bitrate in b p p for four different scenarios longdress loot redandblack and soldier showing G dash P C C t m c one three v two three R D E dash R A N G E D G u dash P C C t m c one three v six joint progressive O u r s dash L i g h t and O u r s lines.

Rate distortion performance with other learning based methods.

Figure 9

A graph of P S N R in d B versus Bitrate in b p p for four different scenarios B I V F B two longdress B I V F B two loot B I V F B two redandblack and B I V F B two soldier showing L dash factorized V one L dash hyper V two L dash hyper dash autoregressive V three L dash hyper dash M S C V four L dash hyper dash M S C dash global V five and H dash hyper dash M S C dash global V six lines.

Ablation studies of different entropy models and transform networks on 8iVFBv2 dataset.

Table 1

Comparison results of the proposed method with the G-PCC and other learning based methods in terms of BD-PSNR(Y) (dB) and BD-RATE (%).

Dataset	Point Cloud	Ours vs G-PCC(TMC13v23)		Ours vs Hyper		Ours vs Joint
Dataset	Point Cloud	BD-PSNR (dB) ↑	BD-RATE (%) ↓	BD-PSNR (dB)↑	BD-RATE (%) ↓	BD-PSNR (dB)↑	BD-RATE (%) ↓
	longdress	+0.74	−17.58	+1.09	−26.81	+0.57	−16.19
	loot	−1.09	+50.54	+0.60	−17.80	-0.16	+7.59
8iVFBv2	redandblack	−0.66	+24.20	+0.55	−15.43	-0.19	+8.42
	soldier	+0.05	−1.29	+1.10	−28.66	+0.62	−18.98
	Average	-0.24	+13.95	+0.84	−22.18	+0.21	−4.79
	basketball_player	−0.23	+9.73	+0.67	−21.14	−0.08	+4.03
	dancer	−0.42	+14.71	+0.60	−18.92	−0.10	+4.82
Owlii	exercise	−0.55	+28.79	+0.35	−15.16	−0.27	+17.44
	model	−0.56	+17.01	+0.62	−18.20	+0.10	−2.75
	Average	−0.44	+17.56	+0.56	−18.36	−0.09	+5.89
ScanNet	Average	−0.44	+14.63	+1.25	−31.61	+0.25	+18.30
SensatUrban	Average	+0.70	−17.33	+1.06	−30.27	+0.52	−6.77

Dataset	Point Cloud	Ours vs G-PCC(TMC13v23)		Ours vs Hyper		Ours vs Joint
Dataset	Point Cloud	BD-PSNR (dB) ↑	BD-RATE (%) ↓	BD-PSNR (dB)↑	BD-RATE (%) ↓	BD-PSNR (dB)↑	BD-RATE (%) ↓
	longdress	+0.74	−17.58	+1.09	−26.81	+0.57	−16.19
	loot	−1.09	+50.54	+0.60	−17.80	-0.16	+7.59
8iVFBv2	redandblack	−0.66	+24.20	+0.55	−15.43	-0.19	+8.42
	soldier	+0.05	−1.29	+1.10	−28.66	+0.62	−18.98
	Average	-0.24	+13.95	+0.84	−22.18	+0.21	−4.79
	basketball_player	−0.23	+9.73	+0.67	−21.14	−0.08	+4.03
	dancer	−0.42	+14.71	+0.60	−18.92	−0.10	+4.82
Owlii	exercise	−0.55	+28.79	+0.35	−15.16	−0.27	+17.44
	model	−0.56	+17.01	+0.62	−18.20	+0.10	−2.75
	Average	−0.44	+17.56	+0.56	−18.36	−0.09	+5.89
ScanNet	Average	−0.44	+14.63	+1.25	−31.61	+0.25	+18.30
SensatUrban	Average	+0.70	−17.33	+1.06	−30.27	+0.52	−6.77

Table 2

Complexity and compression performance comparison among different methods on “longdress” point cloud (Anchor: G-PCC). The “L” denotes a lightweight transform network comprising of stacked convolutional layers, while “H” represents a heavyweight network incorporating self-attention layers.

Methods	Transform	#Param. ↓	GPU Mem. (GB) ↓	Enc. Time (s) ↓	Dec. Time (s) ↓	BD-PSNR (dB) ↑	BD-Rate (%) ↓
G-PCC	RAHT	-	-	0.700	0.509	-	-
G-PCC (TMC13v19)	RAHT	-	-	4.281	3.889	+2.12	−45.49
G-PCC (TMC13v23)	RAHT	-	-	5.062	4.315	+2.38	−49.75
Factorized	L	3.554M	1.903	0.148	0.153	+1.68	−37.22
Hyper	L	8.758M	1.911	0.210	0.192	+2.10	−44.33
Joint (Hyper + Autoregressive)	L	9.872M	1.848	0.189	64.079	+2.37	−53.17
Ours-Light w/o Global	L	9.872M	1.913	0.313	0.222	+2.17	−45.36
Ours-Light	L	18.103M	1.919	0.442	0.362	+2.57	−50.48
Ours	H	31.127M	2.491	0.513	0.391	+3.16	−57.90

Methods	Transform	#Param. ↓	GPU Mem. (GB) ↓	Enc. Time (s) ↓	Dec. Time (s) ↓	BD-PSNR (dB) ↑	BD-Rate (%) ↓
G-PCC	RAHT	-	-	0.700	0.509	-	-
G-PCC (TMC13v19)	RAHT	-	-	4.281	3.889	+2.12	−45.49
G-PCC (TMC13v23)	RAHT	-	-	5.062	4.315	+2.38	−49.75
Factorized	L	3.554M	1.903	0.148	0.153	+1.68	−37.22
Hyper	L	8.758M	1.911	0.210	0.192	+2.10	−44.33
Joint (Hyper + Autoregressive)	L	9.872M	1.848	0.189	64.079	+2.37	−53.17
Ours-Light w/o Global	L	9.872M	1.913	0.313	0.222	+2.17	−45.36
Ours-Light	L	18.103M	1.919	0.442	0.362	+2.57	−50.48
Ours	H	31.127M	2.491	0.513	0.391	+3.16	−57.90

Table 3

Complexity and compression performance comparison result of ablation studies (Anchor: L+Factorized).

Methods	BD-PSNR (dB) ↑	BD-Rate (%) ↓	Dec. Time (s) ↓
L+Factorized (V1)	−	−	0.153
L+Hyper (V2)	+ 1.22	−31.68	0.192
L+Hyper+Autoregressive (V3)	+ 1.52	−43.80	64.079
L+Hyper+MSC (V4)	+ 1.24	−32.99	0.222
L+Hyper+MSC+Global (V5)	+ 1.52	−37.32	0.362
H+Hyper+MSC+Global (V6)	+2.03	−47.07	0.391

Methods	BD-PSNR (dB) ↑	BD-Rate (%) ↓	Dec. Time (s) ↓
L+Factorized (V1)	−	−	0.153
L+Hyper (V2)	+ 1.22	−31.68	0.192
L+Hyper+Autoregressive (V3)	+ 1.52	−43.80	64.079
L+Hyper+MSC (V4)	+ 1.24	−32.99	0.222
L+Hyper+MSC+Global (V5)	+ 1.52	−37.32	0.362
H+Hyper+MSC+Global (V6)	+2.03	−47.07	0.391

[1]

Alexiou

Tung

, and

Ebrahimi

, “

Towards neural network approaches for point cloud compression

”, in

Applications of digital image processing XLIII

Vol. 11510

, SPIE,

2020

–

[2]

Baert

Lagae

, and

Dutré

, “

Out-of-core construction of sparse voxel octrees

”, in

Proceedings of the 5th high-performance graphics conference

2013

–

[3]

Ballé

Minnen

Singh

S. J.

Hwang

, and

Johnston

, “

Vari-ational image compression with a scale hyperprior

”, in

International Conference on Learning Representations

2018

[4]

Biswas

Liu

Wong

Wang

, and

Urtasun

, “

Muscle: Multi sweep compression of lidar using deep entropy models

”,

Advances in Neural Information Processing Systems

2020

22170

–

[5]

Bjontegaard

, “

Calculation of average PSNR differences between RD-curves

”,

ITU SG16 Doc. VCEG-M33

2001

[6]

Choy

Gwak

, and

Savarese

, “

4d spatio-temporal convnets: Minkowski convolutional neural networks

”, in

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

2019

3075

–

[7]

R. A.

Cohen

Tian

, and

Vetro

, “

Attribute compression for sparse point clouds using graph transforms

”, in

2016 IEEE International Conference on Image Processing (ICIP)

, IEEE,

2016

1374

–

[8]

d’Eon

Harrison

Myers

, and

P. A.

Chou

, “

8i Voxelized Full Bodies - A Voxelized Point Cloud Dataset

”,

ISO/IEC JTC1/SC29 Joint WG11/WG1 (MPEG/JPEG) m38673/M72012

May

2016

[9]

Dai

A. X.

Chang

Savva

Halber

Funkhouser

, and

NieSSner

, “

ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2017

[10]

R. L.

De Queiroz

and

P. A.

Chou

, “

Compression of 3D point clouds using a region-adaptive hierarchical transform

”,

IEEE Transactions on Image Processing

(

2016

3947

–

[11]

R. L.

De Queiroz

and

P. A.

Chou

, “

Transform coding for point clouds using a Gaussian process model

”,

IEEE Transactions on Image Processing

(

2017

3507

–

[12]

Fang

Wang

, and

Guo

, “

3dac: Learning attribute compression for point clouds

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2022

1481928

[13]

Song

Gao

, and

Liu

, “

Octattention: Octree-based large-scale contexts model for point cloud compression

”, in

Proceedings of the AAAI Conference on Artificial Intelligence

Vol. 36

2022

625

–

[14]

Zheng

Sun

Wang

, and

Qin

, “

Checkerboard context model for efficient learned image compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2021

14771

–

[15]

Yang

Khalid

Xiao

Trigoni

, and

Markham

, “

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2021

[16]

J.-H.

Kim

Heo

, and

J.-S.

Lee

, “

Joint global and local hierarchical priors for learned image compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2022

59926001

[17]

T.-Y.

Lin

Maire

Belongie

Hays

Perona

Ramanan

Dollar

, and

C. L.

Zitnick

, “

Microsoft coco: Common objects in context

”, in

Computer Vision-ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13

, Springer,

2014

740

–

[18]

Minnen

Ballé

, and

G. D.

Toderici

, “

Joint autoregressive and hierarchical priors for learned image compression

”,

Advances in neural information processing systems

2018

[19]

D. T.

Nguyen

and

Kaup

, “

Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model

”,

IEEE Transactions on Circuits and Systems for Video Technology

2023

[20]

Pavez

Girault

Ortega

, and

P. A.

Chou

, “

Region adaptive graph Fourier transform for 3D point clouds

”, in

2020 IEEE International Conference on Image Processing (ICIP)

, IEEE,

2020

2726

–

[21]

Perlin

, “

An image synthesizer

”,

ACM Siggraph Computer Graphics

(

1985

287

–

https://doi.org/10.1145/3652212.3652217

[22]

R. B.

Pinheiro

J.-E.

Marvie

Valenzise

, and

Dufaux

, “

NF-PCAC: Normalizing Flow based Point Cloud Attribute Compression

”, in

ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

, IEEE,

2023

–

[23]

Qian

Tan

Sun

Lin

Sun

Hao

, and

Jin

, “

Learning Accurate Entropy Model with Global Reference for Image Compression

”, in

International Conference on Learning Representations

2021

[24]

Quach

Valenzise

, and

Dufaux

, “

Folding-based compression of point cloud attributes

”, in

2020 IEEE International Conference on Image Processing (ICIP)

, IEEE,

2020

3309

–

[25]

Quach

Valenzise

, and

Dufaux

, “

Learning convolutional transforms for lossy point cloud geometry compression

”, in

2019 IEEE international conference on image processing (ICIP)

, IEEE,

2019

43204

[26]

Que

, and

, “

Voxelcontext-net: An octree based framework for point cloud compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2021

604251

[27]

Rudolph

Riemenschneider

, and

Rizk

, “

Progressive Coding for Deep Learning based Point Cloud Attribute Compression

”, in,

MMVE ’24

Bari, Italy

: Association for Computing Machinery,

2024

–

ISBN: 9798400706189

, DOI:

[28]

Schwarz

Preda

Baroncini

Budagavi

Cesar

P. A.

Chou

R. A.

Cohen

Krivokua

Lasserre

, et al., “

Emerging MPEG standards for point cloud compression

”,

IEEE Journal on Emerging and Selected Topics in Circuits and Systems

(

2018

133

–

[29]

Shao

Zhang

Fan

, and

, “

Attribute compression of 3D point clouds using Laplacian sparsity optimized graph transform

”, in

2017 IEEE Visual Communications and Image Processing (VCIP)

, IEEE,

2017

–

[30]

Sheng

Liu

Xiong

, and

, “

Deep-pcac: An end-to-end deep lossy compression framework for point cloud attributes

”,

IEEE Transactions on Multimedia

2021

2617

–

[31]

Song

Liu

, and

, “

Efficient Hierarchical Entropy Model for Learned Point Cloud Compression

”, in

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

2023

1436877

[32]

Wang

Ding

Feng

Cao

, and

, “

Sparse tensor-based multiscale representation for point cloud geometry compression

”,

IEEE Transactions on Pattern Analysis and Machine Intelligence

2022

[33]

Wang

Ding

, and

, “

Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction

”, in

2023 Data Compression Conference (DCC)

, IEEE,

2023

228

–

[34]

Wang

and

, “

Sparse tensor-based point cloud attribute compression

”, in

2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR)

, IEEE,

2022

–

[35]

Wang

Zhu

Liu

, and

, “

Lossy point cloud geometry compression via end-to-end learning

”,

IEEE Transactions on Circuits and Systems for Video Technology

(

2021

4909

–

[36]

Wang

Zhang

Wang

Guo

, and

Gao

, “

Predictive generalized graph Fourier transform for attribute compression of dynamic point clouds

”,

IEEE Transactions on Circuits and Systems for Video Technology

(

2020

1968

–

https://doi.org/10.1145/3581783.3612422

[37]

Yang

Shao

Liu

T. H.

, and

, “

PDE-based Progressive Prediction Framework for Attribute Compression of 3D Point Clouds

”, in

Proceedings of the 31st ACM International Conference on Multimedia, MM ’23

Ottawa ON, Canada

: Association for Computing Machinery,

2023

9271

–

ISBN: 9798400701085

, DOI:

[38]

Yao

, and

Ziyu

, “

Owlii Dynamic human mesh sequence dataset

”,

ISO/IEC JTC1/SC29/WG11 (MPEG/JPEG) m4l658

2017