Editing Video Streams Through Natural Language Embeddings: Keyframe Sequencing Using Transformers

  • Luís Arandas
  • Pedro Sarmento
  • Mick Grierson
  • Miguel Carvalhais
Palavras-chave: automatic video editing, video sequencing, transformer models, cognitive architecture, video montage

Resumo

The automation of video editing processes with artificial intelligence (AI) is a flourishing field of research, whether the purpose is to save labour, or motivate the emergence of unexpected ideas. Informed by theories on human-like reasoning, problem-solving and learning, these technologies can produce video sequences without much human intervention. In this article we showcase the design and implementation of a system that sequences video datasets based on semantic correlations with natural language. Our system generates videos automatically with tractable principles for montage, removing the human editor – if and when they want – from the process of individual element selection.

Downloads

Não há dados estatísticos.
Publicado
2025-02-26