A Framework for Learning Visual and Tactile Correlation

Research output: ThesisMaster's Thesis

Standard

A Framework for Learning Visual and Tactile Correlation. / Schödinger, Benjamin.
2022.

Research output: ThesisMaster's Thesis

Vancouver

Bibtex - Download

@mastersthesis{fdb68d3b107141808c8f6bce39db9444,
title = "A Framework for Learning Visual and Tactile Correlation",
abstract = "Tactile data is an important source of information for applications in fields such as object manipulation or object recognition. However, the process of gathering the tactile data can be inconvenient and time consuming. For example a robotic manipulator would have to grasp the object to be moved every time to gather the tactile information and then again to finally pick it up. This thesis proposes a way to overcome this kind of issue by implementing a method to predict what the tactile feedback sensors would measure when touching an object at a given position, based on two dimensional visual data. Therefore, visual-tactile data pairs were gathered to train a Convolutional Neural Network that takes images of objects with the positions of interest marked as the input and the force vector as the output. To improve performances, the edges of the input images were extracted using the Canny algorithm, a new architecture was developed and the training process optimised with the Bayesian Optimisation algorithm. An evaluation strategy was developed and a test set built, to be able to effectively compare the different models to each other. The result is a framework that is capable of understanding the spacial relationship between tactile sensors and surfaces but lacks in accuracy, as a result of noisy data. The noise is caused by inaccurate sensors and a sub-optimal acquisition strategy.",
keywords = "Visuelle and tactile correlation, Neural network, Machine Learning, Artificial Intelligence, Computer vision, Visuelle und taktile Korrelation, Neuronales Netzwerk, Machine Learning, K{\"u}nstliche Intelligenz, Computer vision",
author = "Benjamin Sch{\"o}dinger",
note = "no embargo",
year = "2022",
doi = "10.34901/mul.pub.2023.100",
language = "English",
school = "Montanuniversitaet Leoben (000)",

}

RIS (suitable for import to EndNote) - Download

TY - THES

T1 - A Framework for Learning Visual and Tactile Correlation

AU - Schödinger, Benjamin

N1 - no embargo

PY - 2022

Y1 - 2022

N2 - Tactile data is an important source of information for applications in fields such as object manipulation or object recognition. However, the process of gathering the tactile data can be inconvenient and time consuming. For example a robotic manipulator would have to grasp the object to be moved every time to gather the tactile information and then again to finally pick it up. This thesis proposes a way to overcome this kind of issue by implementing a method to predict what the tactile feedback sensors would measure when touching an object at a given position, based on two dimensional visual data. Therefore, visual-tactile data pairs were gathered to train a Convolutional Neural Network that takes images of objects with the positions of interest marked as the input and the force vector as the output. To improve performances, the edges of the input images were extracted using the Canny algorithm, a new architecture was developed and the training process optimised with the Bayesian Optimisation algorithm. An evaluation strategy was developed and a test set built, to be able to effectively compare the different models to each other. The result is a framework that is capable of understanding the spacial relationship between tactile sensors and surfaces but lacks in accuracy, as a result of noisy data. The noise is caused by inaccurate sensors and a sub-optimal acquisition strategy.

AB - Tactile data is an important source of information for applications in fields such as object manipulation or object recognition. However, the process of gathering the tactile data can be inconvenient and time consuming. For example a robotic manipulator would have to grasp the object to be moved every time to gather the tactile information and then again to finally pick it up. This thesis proposes a way to overcome this kind of issue by implementing a method to predict what the tactile feedback sensors would measure when touching an object at a given position, based on two dimensional visual data. Therefore, visual-tactile data pairs were gathered to train a Convolutional Neural Network that takes images of objects with the positions of interest marked as the input and the force vector as the output. To improve performances, the edges of the input images were extracted using the Canny algorithm, a new architecture was developed and the training process optimised with the Bayesian Optimisation algorithm. An evaluation strategy was developed and a test set built, to be able to effectively compare the different models to each other. The result is a framework that is capable of understanding the spacial relationship between tactile sensors and surfaces but lacks in accuracy, as a result of noisy data. The noise is caused by inaccurate sensors and a sub-optimal acquisition strategy.

KW - Visuelle and tactile correlation

KW - Neural network

KW - Machine Learning

KW - Artificial Intelligence

KW - Computer vision

KW - Visuelle und taktile Korrelation

KW - Neuronales Netzwerk

KW - Machine Learning

KW - Künstliche Intelligenz

KW - Computer vision

U2 - 10.34901/mul.pub.2023.100

DO - 10.34901/mul.pub.2023.100

M3 - Master's Thesis

ER -