Speechmatics releases Universal Time Alignment, our language-independent forced-alignment service to match words in text files to their counterparts in audio files, accurately and automatically delivering improved content discoverability, in any language!
The R&D team at Speechmatics have used their deep learning expertise to create a highly accurate and automated system for aligning audio to text.
By synchronising audio to text, Universal Time Alignment can be used for the creation of closed captions and subtitles, indexing archives and enriching human generated transcripts with extra metadata that would usually be carried out laboriously by hand. In an industry where metadata and searchability is becoming increasingly crucial, time alignment offers a simple and very cost effective way of making audio, video and text searchable across any language.
To create Universal Time Alignment we extracted elements from our modular speech recognition technology, re-engineered them for the purpose and added additional alignment specific technology based on our machine learning expertise and experience. As a result we have created a system that is not only robust and accurate, but crucially able to cope with any language in the world.