[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"header-page-type:en:/custom-transcription-system:k496kf42yqvn5j5j7mans3px":3,"mdc-u6c42r-key":4,"mdc--u14dgo-key":22,"mdc-1wr19u-key":71,"mdc-drtl40-key":80},"case-study",{"data":5,"body":6},{},{"type":7,"children":8},"root",[9,17],{"type":10,"tag":11,"props":12,"children":13},"element","p",{},[14],{"type":15,"value":16},"text","National Media and Infocommunications Authority (of Hungary)",{"type":10,"tag":11,"props":18,"children":19},{},[20],{"type":15,"value":21},"The National Media and Infocommunications Authority is the independent regulatory body of the Hungarian state responsible for overseeing the media and infocommunications market. Its tasks include ensuring interference-free frequency usage, filtering out non-compliant equipment, and supervising the quality of electronic media and infocommunications services. Its goal is to ensure fair competition, media pluralism, and the enforcement of user interests.",{"data":23,"body":24},{},{"type":7,"children":25},[26,31,56,61,66],{"type":10,"tag":11,"props":27,"children":28},{},[29],{"type":15,"value":30},"The Authority set out to design and implement a system capable of:",{"type":10,"tag":32,"props":33,"children":34},"ul",{},[35,41,46,51],{"type":10,"tag":36,"props":37,"children":38},"li",{},[39],{"type":15,"value":40},"producing textual transcripts of media assets",{"type":10,"tag":36,"props":42,"children":43},{},[44],{"type":15,"value":45},"identifying speaker changes (identifying speakers)",{"type":10,"tag":36,"props":47,"children":48},{},[49],{"type":15,"value":50},"recognizing name entities",{"type":10,"tag":36,"props":52,"children":53},{},[54],{"type":15,"value":55},"determining the start and end of spoken words with hundredth-of-a-second accuracy",{"type":10,"tag":11,"props":57,"children":58},{},[59],{"type":15,"value":60},"In addition, all system functions must be accessible via an easy-to-use web interface.",{"type":10,"tag":11,"props":62,"children":63},{},[64],{"type":15,"value":65},"The above requirements serve multiple purposes. On one hand, video and audio media assets become searchable in text form. Such use cases include content analysis of political news and magazine programs. On the other hand, checks such as monitoring advertisements and sponsorships, tracking product placements, verifying protection of minors, examining age ratings, and supervising accessibility for the hearing and visually impaired can be enabled or automated. Thirdly, in official procedures, it is expected that investigation reports contain verbatim transcripts of the examined program elements.",{"type":10,"tag":11,"props":67,"children":68},{},[69],{"type":15,"value":70},"Until now, transcripts were prepared exclusively by human resources, which was an extremely time- and labor-intensive process. Thus, the Authority did not have adequate IT tools for the efficient monitoring of these tasks and regulations.",{"data":72,"body":73},{},{"type":7,"children":74},[75],{"type":10,"tag":11,"props":76,"children":77},{},[78],{"type":15,"value":79},"A system capable of handling unique and automatic transcription processes, with completed transcripts viewable and exportable (in SRT, Word, and JSON formats) via the web user interface. The system includes an accuracy measurement module that evaluates the quality of machine-generated transcripts using objective indicators (word and character error rates, speaker changes, timestamp and named entity recognition accuracy). The system’s trainable models guarantee consistently accurate transcripts and long-term support for the Authority’s tasks.",{"data":81,"body":82},{},{"type":7,"children":83},[84],{"type":10,"tag":11,"props":85,"children":86},{},[87],{"type":15,"value":88},"During the pilot, the system proved capable of automatically processing television and radio recordings and producing reliable transcripts. With appropriate training data, it is trainable. Moreover, it is not only applicable to linear broadcasting, but also suitable for processing content appearing on social media."]