New Prak model (AUC 2025)

Customising Czech phonetic alignment using HuBERT and manual segmentation

☀︎ Adléta Hanžlová, Václav Hanžl

Institute of Phonetics, Charles University in Prague

This paper presents Prak, a forced alignment tool developed for Czech, with a focus on transparent modular design and phonetic accuracy. In addition to a rule-based pronunciation module and exception handling, Prak introduces a novel application of non-deterministic, backward-processing FSTs to model complex regressive assimilation processes in Czech consonant clusters. We further describe the integration of a HuBERT-based transformer model and training including extensive manually time-aligned data to enhance phone classification accuracy while maintaining ease of installation and use. Evaluation against a manually aligned test corpus demonstrates that the enhanced model significantly outperforms both our earlier Prak-CV model and the long-established previous forced alignment baseline. The new model reduces major boundary errors and mismatches, bringing alignment accuracy closer to manual phonetic segmentation standards for Czech. We emphasize both methodological transparency and practical usability, aiming to support phoneticians working with Czech as well as developers interested in extending the tool for other languages.

Cite this paper:

APA: Hanžlová, A. & Hanžl, V. (2025) Customising Czech phonetic alignment using HuBERT and manual segmentation. Acta Universitatis Carolinae Philologica 3/2025, 43–60. https://doi.org/10.14712/24646830.2025.20

bibtex:
@article{hanzlova_prak_2025,
title = {Customising {C}zech phonetic alignment using {HuBERT} and manual segmentation},
journal = {Acta Universitatis Carolinae Philologica},
volume = {2025},
number = {3},
author = {Hanžlová, Adléta and Hanžl, Václav},
year = {2025},
pages = {43--60},
doi = {10.14712/24646830.2025.20}
}