ARTIFICIAL INTELLIGENCE IN TV CONTENT AUDIO DESCRIPTION
Abstract
Abstract: Providers of audio-visual media services, especially media public services, are obliged to provide the accesibility of TV contents to the blind and visually impaired. Legal regulations differ from one country to another, but at the core of all legal acts, in case there is no obligatory quota, there is the tendency to make as many TV materials as possible accessible to the persons with impaired sight. At the age of expanded programme offer through linear broadcast of basic and specialised TV channels, along with video streaming platforms and non-linear services, particularly of “videos on demand“ (VoD), the development and implementation of artificial intelligence are recognised as the tool that could provide, with the implicit quality, a wider contents accessibility. Media experts, just like the users, recognise the advantages, but also the shortcomings in practical usage, the concept that will be presented in this paper through scientific interviews conducted with the experts for the access services from European public service media, such as the BBC and TV France, and with a blind person, a specialised consultant within the team of the Radio-Television of Serbia for creating scenarios for audio-description. The aim of this paper is to use a case study, scientific interviews and analysis of technical-technological possibilities, current initiatives in the development of services for availability and new programme activities, to observe the benefits, as well as to point at the drawbacks in the current degree of implementing artificial intelligence and in the advantages of the presence of human factor in the process of adjusting the programmes for the blind and visually-impaired.
References
American Council of the Blind [ACB]. (2025). Guidelines and Best Practices for the Use of Text-to-Speech (TTS) in Audio Description. American Council of the Blind. Posećeno: 24.8.2025.URL:2025 Proposed Resolutions, Bylaws and Standing Rules | American Council of the Blind
ARCOM. (2009). Décret n° 2009-796 du 23 juin 2009 fixant le cahier des charges de la société nationale de programme France Télévisions. (Dernière mise à jour des données de ce texte : 06 décembre 2024). Posećeno 27.8.2025. URL:https://www.legifrance.gouv.fr/loda/id/JORFTEXT000020788471/2025-03-30
ARCOM. (2018). Le guide de l'audiodescription. Les droits des personnes handicapées. Posećeno 27.8.2025. URL: https://www.arcom.fr/nous-connaitre-nos-missions/garantir-le-pluralisme-et-la-cohesion-sociale/les-droits-des-personnes-handicapees
Audio description coalition [ADC]. (2009). National Standards for Audio Description and Code of Professional Conduct for Describers. Audio description solutions.Posećeno: 18.8.2025. URL: https://audiodescriptionsolutions.com/wp-content/uploads/2020/04/adc_standards_090615.pdf
Autor. (2020).
Cheema, M., Elahimanesh, S., Martin, S., Seifi, H. & Pazli, P. (2025). DescribePro: Collaborative audio description with human-AI interaction. arXiv. Posećeno 26.8.2025. https://doi.org/10.48550/arXiv.2508.01092
CRTC. (2015). Broadcasting Regulatory Policy CRTC 2015-104. Canadian Radio-television and Telecommunications Commission. Posećeno: 24.8.2025. URL:https://crtc.gc.ca/eng/archive/2015/2015-514.pdf
Directive 2010/13/EU of the European Parliament and of the Council. (2010). Audiovisual Media Services Directive. EUR-lex. Posećeno: 24.8.2025. URL:https://eur-lex.europa.eu/eli/dir/2010/13/oj
Federal Communications Commission [FCC]. (2025). Audio Description. Posećeno: 24.8.2025. https://www.fcc.gov/consumers/guides/audio-description?
Federal register. (2000). Implementation of Video Description of Video Programming. Federal Communications Commission 47 CFR Part 7. Posećeno: 18.8.2025. URL:https://www.federalregister.gov/documents/2000/09/11/00-23154/implementation-of-video-description-of-video-programming
Gao,Y., Fischer,L., Lintner, A. & Ebling, S. (2025). Audio Description Generation in the Era of LLMs and VLMs: A Review of Transferable Generative AI Technologies. In Findings of the Association for Computational Linguistics: NAACL 2025 (pp. 471–490). Albuquerque, New Mexico.
International Agency for the Prevention of Blindness [IAMB]. (2020). Magnitude of sight loss. Vision Atlas. Posećeno: 17.8.2025.URL: https://visionatlas.iapb.org/topics/magnitude-of-sight-loss/
International Telecommunication Union [ITU]. (2022). Guidance on audio description. ITU-T Rec. T.701.21 Posećeno: 20.8.2025. URL: https://www.itu.int/ITU-T/recommendations/rec.aspx?id=14972
Kaneko, H. Takahashi, M., & Okuda, M. (2024). Latest Trends in Supporting Technology for Media Accessibility. NHK STRL. Posećeno, 27.8.2025. URL:https://www.nhk.or.jp/strl/english/publica/bt/98/2.html
Klatt, H. D. (1987). Sound files and descriptions from “Review of text-to-speech conversion for English.” Journal of the Acoustical Society of America, 82, 737-793. Posećeno 24.8.2025. URL:https://acousticstoday.org/klatts-speech-synthesis-d/
Kokotajlo, D. Scott, A. Larsen, T. Lifland, E & Dean, R. (2005). AI 2027. AI Futures project. Posećeno: 25.8.2025. URL: https://ai-2027.com
Kurihara, K., Imai, A., Seiyama, N., Shimizu, T., Sato, S., Yamada, I., Kumano, T., Tako, R., Miyazaki, T., Ichiki, M., Takagi, T., & Sumiyoshi, H. (2019). Automatic Generation of Audio Descriptions for Sports Programs. SMPTE Motion Imaging Journal (2019). https://doi.org/10.5594/JMI. 2018.2879261
Lee, S. H., Wang, J., Fan, D., Zhang, Z., Liu, L., Hao, X., Bhat, V., & Li, X. (2025). Now you see me: Context-aware automatic audio description [Conference paper]. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2025). arXiv. https://doi.org/10.48550/arXiv.2412.10002
Locke, K. (2025, June 1). AI is now used for audio description. But it should be accurate and actually useful for people with low vision. Modern Science. Posećeno: 18.8.2025. URL:https://modernsciences.org/ai-audio-description-accessibility-low-vision-accuracy-may-2025/
Masleša. P. (2020). AI - artificial intelligence - gift or reason for fear? Poslovni Inkubator. Posećeno 25.8.2025.URL: https://inkubator.biz/vi-vestacka-inteligencija/
NHK STRL. (n.d.) NHK Technical Catalog: Audio commentary production and distribution technology. NHK Science & Technology Laboratories. Posećeno: 26.8.2025. https://www.nhk-fdn.or.jp/es/transfer/catalog/barrierfree_05.html
Ofcom. (2024). Ofcom’s Code on Television Access Service. Ofcom. Posećeno: 24.8.2025. URL: https://www.ofcom.org.uk/siteassets/resources/documents/tv-radio-and-on-demand/broadcast-codes/code-on-television-access-services/ofcom-code-television-access-services.pdf?v=370035
Ofcom. (2025). Television channels required to provide access services in 2026. Ofcom. Posećeno: 25.8.2025. URL: https://www.ofcom.org.uk/siteassets/resources/documents/tv-radio-and-on-demand/broadcast-guidance/television-channels-required-to-provide-access-services-in-2026.pdf?v=399983
ORF (2025, February 18). ORF barrierefrei – Aktionsplan 2024-2027. Posećeno: 17.8.2025.URL:https://der.orf.at/unternehmen/humanitarian/barrierefreiheit/aktionsplan-barrierefreiheit104.html
Royal National Institute of Blind People [RNIB]. (2025). RNIB's Accessible media services overview 2024.. Posećeno: 24.8.2025. URL:https://www.rnib.org.uk/news/rnibs-accessible-media-services-overview-2024/?
Schwartz, A. H., Chalson, M., Beisiegel, N., Caddigan, D. J., Zimmerman, R. S., Antunes, C. S., & Moutis, N. R. (2024). Automated audio description system and method (U.S. Patent No. 12142047B1). United States Patent and Trademark Office. https://patents.google.com/patent/US12142047B1/
Snyder, J. (2009). In memoriam: Margaret R. Pfanstiehl, radio reading service & audio description Pioneer. American Council of the blind. Posećeno: 18.8.2025. URL: https://www.acb.org/memoriam-margaret-pfanstiehl
Takou, R. & Fujimori, R. (2025). Audio Description Generation Technology for Live Sport Broadcasting. NHK Science & Technology Laboratories. Posećeno: 26.8. 2025. URL:https://www.nhk.or.jp/strl/english/publica/giken_dayori/243/4.html
World Health Organisation [WHO]. (2023, August 10). Blindness and vision impairment. World Health Organisation. Posećeno, 17.8.2025. URL: https://www.who.int/news-room/fact-sheets/detail/blindness-and-visual-impairment?
World Wide Web Consortium [W3C]. (2025). Timed Text Markup Language 2 (TTML2) – DAPT: Dubbing and Audio description Profiles of TTML2 (W3C Working Draft).. Posećeno: 27.8. 2025. https://www.w3.org/TR/dapt/
Ye, X., Song, Y., Zhou, S., & Li, L. (2025). FocusedAD: Character-centric movie audio description. arXiv. https://doi.org/10.48550/arXiv.2504.12157
Zakon o elektronskim medijima [ZEM]. (2023). Posećeno: 26.8.2025. URL: https://pravno-informacioni-sistem.rs/eli/rep/sgrs/skupstina/zakon/2023/92/3/reg
Copyright
Authors retain copyright of the published papers and grant to the publisher the non-exclusive right to publish the article, to be cited as its original publisher in case of reuse, and to distribute it in all forms and media.
Licensing
The published articles will be distributed under the Creative Commons Attribution ShareAlike 4.0 International license (CC BY-SA). It is allowed to copy and redistribute the material in any medium or format, and remix, transform, and build upon it for any purpose, even commercially, as long as appropriate credit is given to the original author(s), a link to the license is provided, it is indicated if changes were made and the new work is distributed under the same license as the original.
Users are required to provide full bibliographic description of the original publication (authors, article title, journal title, volume, issue, pages), as well as its DOI code. In electronic publishing, users are also required to link the content with both the original article published in CM: Communication and Media and the licence used.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Self-archiving policy
Authors are permitted to deposit author’s publisher's version (PDF) of their work in an institutional repository, subject-based repository, author's personal website (including social networking sites, such as ResearchGate, Academia.edu, etc.), at any time after publication.
Full bibliographic information (authors, article title, journal title, volume, issue, pages) about the original publication must be provided and links must be made to the article's DOI and the license.
Disclaimer
The views expressed in the published works do not express the views of the Editors and the Editorial Staff. The authors take legal and moral responsibility for the ideas expressed in the articles. Publisher shall have no liability in the event of issuance of any claims for damages. The Publisher will not be held legally responsible should there be any claims for compensation.
