Proposal for the standardization of controlled vocabularies for television archives: case study at RTVE


Controlled vocabularies play a crucial role in indexing and retrieving content in audiovisual archives. The integration of SKOS and ontologies can enhance search processes and metadata generation. This work demonstrates how to integrate SKOS within the ARCA system, used by RTVE for audiovisual management. The proposal focuses on adapting the relational schema structure of ARCA to unify the thesauri using the SKOS model. The process involves identifying concepts, labels, semantic relationships, and collections to create a single controlled vocabulary from the different thesauri, represented through a relational database schema. The results of the unified thesauri and the mapping of vocabulary concepts to Wikidata items are shown to reinforce integration in the realm of Linked Data.


SKOS; TV audiovisual archives; thesauri integration; controlled vocabularies

Full Text:




Alexiev, V., Isaac, A., & Lindenthal, J. (2016). On the composition of ISO 25964 hierarchical relations (BTG, BTP, BTI). International Journal on Digital Libraries, 17, 39-48.

Bazán-Gil, V. (2021a). Use of controlled vocabularies in TV Archives. In LinkedIn.

Bazán-Gil, V. (2021b). First findings of our research on the use of controlled vocabularies on TV Archives!. In Twitter.

Bazán-Gil, V., & Escribano, M. (2017). Raiders of lost order: reordening thesaurus in a digital enviorement. FIAT/IFTA World Conference: Living in the Digital Age; Connecting Roots and Cultures.

BBC. (n.d.). BBC Things. Retrieved from

Biblioteca Nacional de España. (2022, October 31). El portal de datos bibliográficos de la Biblioteca Nacional de España. Retrieved from

Bus, H., & Huis in’t Veld, V. (2021). Thesaurus Management at Sound and Vision: the switch to a new editor. FIAT/IFTA World Conference: Advancing the Digital Dividend. Retrieved from

Caldera-Serrano, J., & Sánchez-Jiménez, R. (2008a). Ontología para el control y recuperación de información onomástica en televisión. El Profesional de La Informacion, 17(1), 86–91.

Caldera-Serrano, J., & Sánchez-Jiménez, R. (2008b). Recuperación de secuencias de información audiovisual con rdf y smil. El Profesional de La Informacion, 18(3), 291–300.

de Boer, V. (2017). Getting down with LOD tools at the 2nd CLARIAH Linked Data workshop. Retrieved from

de Boer, V., Ordelman, R. J. F., & Schuurman, J. (2016). Evaluating unsupervised thesaurus-based labeling of audiovisual content in an archive production environment. International Journal on Digital Libraries, 17(3), 189–201.

de Boer, V., Priem, M., Hildebrand, M., Verplancke, N., de Vries, A., & Oomen, J. (2016). Exploring Audiovisual Archives Through Aligned Thesauri (pp. 211–222).

de Prada, A. (2021). Archivos Audiovisuales de RTVE entre el patrimonio empresarial y la memoria. Nueva Revista de Política, Cultura y Arte.

EBU. (2011). EBU – TECH 3336: EBU Reference Data & Classification Schemes. Retrieved from

EBU. (2020). Index of /metadata/cs. Retrieved from

Hidalgo, P. (2017). Preservación del patrimonio audiovisual de televisión El archivo de Televisión Española (TVE): de los orígenes a la digitalización.

IPTC. (2022a). IPTC CV Server Guidelines.

IPTC. (2022b). Media Topics Subject Taxonomy for the Media: the successor to the Subject Codes.

IPTC. (2022c). News Codes. Retrieved from

ISO. (2011). ISO 25964-2:2011. Thesauri and interoperability with other vocabularies. Part 1: Thesauri for information retrieval.

ISO. (2013). ISO 25964-2:2011. Thesauri and interoperability with other vocabularies. Part 2: Interoperability with other vocabularies.

López de Quintana, E. (2010). Transformación y compatibilidad: el documento audiovisual en los archivos municipales. XVIII. Jornadas de Archivos Municipales. Cuadro de Clasificación de Fondos. Pilares de La e-Administración: Cuadro de Clasificación y Tesauro. Retrieved from

Meemoo Flemish Institute for Archives. (2022). Meemoo: a vision for the future, and the past. Retrieved from

Muñoz-de-la-Peña-Costero, P., Meana-Alonso, S., & Sáez-Carreras, S. (2014). Cinco años de experiencia digital en los Servicios Informativos de TVE: una nueva gestión de contenidos. El profesional de la información, 23(1), 72-79.

Quinn, B., & Parrucc, J. (2021). IPTC NewsCodes: Controlled Vocabularies for the News Media. EBU MDN Workshop 2021. Retrieved from

Valle Gastaminza, F. del, & García Jiménez, A. (2002). Construcción de un tesauro para el Centro de Documentación de Telecinco. Scire, 8(1), 103–118. Retrieved from

Valle Gastaminza, F. del. (2003). Tesauros e Información Audiovisual. Estudio de caso. Documentación de Las Ciencias de La Información, 26, 165–180. Retrieved from

W3C. (2009). SKOS Simple Knowledge Organization System Reference. World Wide Web Consortium Recommendation. Retrieved from

Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM


  • There are currently no refbacks.

Copyright (c) 2023 Virginia Bazán-Gil, Juan-Antonio Pastor-Sánchez

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.



SCIRES-IT, e-ISSN 2239-4303

Journal founded by Virginia Valzano