Proceedings: GI 2019

A Tool for Navigating and Editing 360 Video of Social Conversations into Shareable Highlights

Anh Truong (Stanford University), Maneesh Agrawala (Stanford University)

Proceedings of Graphics Interface 2019: Kingston, Ontario, 28 - 31 May 2019

DOI 10.20380/GI2019.14

  • BibTex

    @inproceedings{Truong:2019:10.20380/GI2019.14,
    author = {Truong, Anh and Agrawala, Maneesh},
    title = {A Tool for Navigating and Editing 360 Video of Social Conversations into Shareable Highlights},
    booktitle = {Proceedings of Graphics Interface 2019},
    series = {GI 2019},
    year = {2019},
    issn = {0713-5424},
    isbn = {978-0-9947868-4-5},
    location = {Kingston, Ontario},
    numpages = {9},
    doi = {10.20380/GI2019.14},
    publisher = {Canadian Information Processing Society},
    keywords = {360 Video, Video Editing, Social Conversations, Video Highlights},
    }

Abstract

We present ConvCut, an interactive tool for efficiently navigating and editing 360 video of social conversations into shareable video highlights of the memorable moments. ConvCut starts by obtaining a high-quality transcript of the conversation and uses it to segment the video into one 360 video clip per line of speech. It then applies audio, video and text analysis to label the clips with information including the spatial location of faces, the current speaker, the topics of conversation, instances of laughter and extreme changes in volume, facial expression, or gestural motion. The resulting structure lets users navigate the video using keyword search over the transcript and labels to quickly find memorable moments. Users can mark the lines corresponding to these moments and ConvCut edits together the corresponding video clips, automatically choosing a regular field of view (RFOV) framing that emphasizes the speaker of each line. If desired, users can modify the automatic edit to include alternative framings or reactions from others in the group. We demonstrate that with ConvCut , first-time users can easily edit long social conversations (25-60 min) into short highlight videos (0.27-2 min) and share them with others. The resulting highlights include jokes, reactions to pranks, funny stories and interactions with children.