Student Competition in Audio-Visual Speech Synthesis in the Serbian Language
Creating digital avatars that speak naturally and convincingly in real time remains one of the most complex challenges in artificial intelligence. While Text-to-Speech systems have reached high levels of quality, realistic facial animation, especially accurate lip, jaw and facial expression synchronization is still an open research problem. The challenge becomes even greater in low-resource languages […]