JSAI2022

Presentation information

General Session

General Session » GS-7 Vision, speech media processing

[2O4-GS-7] Vision, speech media processing

Wed. Jun 15, 2022 1:20 PM - 3:00 PM Room O (Room 510)

座長:岡部 浩司(NEC)[遠隔]

1:40 PM - 2:00 PM

[2O4-GS-7-02] Automatic converter to portrait video for news videos

〇Nanoka Mizukado1, Takuto Yamauchi1, Kenji Tei1 (1. Waseda University)

Keywords:video processing

As video watching with smartphones becomes more common, vertical videos are becoming more popular on social networking service such as YouTube and TikTok. Since people use smartphone in vertical basically, vertical videos can be more appealing than horizontal videos, depending on your target market. In this paper, we focus on news videos that require quick and reliable information, and propose a tool that automatically converts news videos to vertical (9:16). This converter consists of three steps. First, we identify the area of most interest of the video and crop it. Next, it acquires textual information from the cropped content. Finally, the textual information is re-layout of the vertical video. As a result, we cropped video with minimizing the loss of semantic important content (textual information) . In this paper, we experimented with inputting 20 actual broadcasted news videos into the proposed tool to confirm its sufficient usability.

Authentication for paper PDF access

A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.

Password