Annotation of screencasts: Distinguishing Between Relevant and Irrelevant Sections

Research output: ThesisMaster's Thesis

Standard

Harvard

APA

Ulm, T. (2022). Annotation of screencasts: Distinguishing Between Relevant and Irrelevant Sections. [Master's Thesis, Montanuniversitaet Leoben (000)].

Bibtex - Download

@mastersthesis{38ec17494954485eab86b233e36c0417,
title = "Annotation of screencasts: Distinguishing Between Relevant and Irrelevant Sections",
abstract = "This thesis proposes a method to annotate screencasts, in order to identify sections of significance. The proposed approach quantifies the relevance frame by frame over the duration of the recording, making it easier for an external observer to navigate to sections of interest. Within this work, we implemented an approach for annotating screencasts of programming activities. Given a recording of screencasts only, the proposed method measures the amount of written code between each pair of subsequent frames. The approach is divided into three steps: extracting the code editor of a development environment, separating individual characters within those regions, and finally analyzing changes of those characters between subsequent frames. The detection of code editors is performed using computer vision methods that detect features characteristic for those regions. Character segmentation algorithms are then applied to the detected regions, in order to decide whether it contains a monospaced font, as this is a distinct attribute for fonts used in code editors. Changes in those characters are then analyzed, taking into account possible disturbances. The results were evaluated using 56 screencasts. The recordings originated from three different programming exercises, completed by 20 different students, each student using one of two development environments. The evaluation of those recordings result in a median accuracy of 83.4% with a median F2 score of 81.5%.",
keywords = "Ereignisdetektion, Screencasts, Programmiert{\"a}tigkeiten, Computer Vision, Event Detection, Screencasts, Programming Activities, Computer Vision",
author = "Tabea Ulm",
note = "no embargo",
year = "2022",
language = "English",
school = "Montanuniversitaet Leoben (000)",

}

RIS (suitable for import to EndNote) - Download

TY - THES

T1 - Annotation of screencasts

T2 - Distinguishing Between Relevant and Irrelevant Sections

AU - Ulm, Tabea

N1 - no embargo

PY - 2022

Y1 - 2022

N2 - This thesis proposes a method to annotate screencasts, in order to identify sections of significance. The proposed approach quantifies the relevance frame by frame over the duration of the recording, making it easier for an external observer to navigate to sections of interest. Within this work, we implemented an approach for annotating screencasts of programming activities. Given a recording of screencasts only, the proposed method measures the amount of written code between each pair of subsequent frames. The approach is divided into three steps: extracting the code editor of a development environment, separating individual characters within those regions, and finally analyzing changes of those characters between subsequent frames. The detection of code editors is performed using computer vision methods that detect features characteristic for those regions. Character segmentation algorithms are then applied to the detected regions, in order to decide whether it contains a monospaced font, as this is a distinct attribute for fonts used in code editors. Changes in those characters are then analyzed, taking into account possible disturbances. The results were evaluated using 56 screencasts. The recordings originated from three different programming exercises, completed by 20 different students, each student using one of two development environments. The evaluation of those recordings result in a median accuracy of 83.4% with a median F2 score of 81.5%.

AB - This thesis proposes a method to annotate screencasts, in order to identify sections of significance. The proposed approach quantifies the relevance frame by frame over the duration of the recording, making it easier for an external observer to navigate to sections of interest. Within this work, we implemented an approach for annotating screencasts of programming activities. Given a recording of screencasts only, the proposed method measures the amount of written code between each pair of subsequent frames. The approach is divided into three steps: extracting the code editor of a development environment, separating individual characters within those regions, and finally analyzing changes of those characters between subsequent frames. The detection of code editors is performed using computer vision methods that detect features characteristic for those regions. Character segmentation algorithms are then applied to the detected regions, in order to decide whether it contains a monospaced font, as this is a distinct attribute for fonts used in code editors. Changes in those characters are then analyzed, taking into account possible disturbances. The results were evaluated using 56 screencasts. The recordings originated from three different programming exercises, completed by 20 different students, each student using one of two development environments. The evaluation of those recordings result in a median accuracy of 83.4% with a median F2 score of 81.5%.

KW - Ereignisdetektion

KW - Screencasts

KW - Programmiertätigkeiten

KW - Computer Vision

KW - Event Detection

KW - Screencasts

KW - Programming Activities

KW - Computer Vision

M3 - Master's Thesis

ER -