Table 8

Comparison of the standard three-factor disentanglement methods with re-entry training framework on singing voices.

MethodsMSE ↓Phoneme estimation [%]Pitch estimation [%]
(Timbre ↓)Variation ↑Pitch ↓(Timbre ↓)Variation ↓Pitch ↑
Standard0.613(N/A)61.448.1(N/A)12.745.5
Standard + RN1.130(N/A)60.858.5(N/A)11.953.2
Standard + PD0.594(N/A)60.944.8(N/A)12.544.0
Standard + RN + PD1.110(N/A)60.758.2(N/A)11.751.1
Re-entry0.590(N/A)60.733.9(N/A)10.933.6

or Create an Account

Close Modal
Close Modal