Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Udo Bankhofer is active.

Publication


Featured researches published by Udo Bankhofer.


machine learning and data mining in pattern recognition | 2012

Hot deck methods for imputing missing data: the effects of limiting donor usage

Dieter William Joenssen; Udo Bankhofer

Missing data methods, within the data mining context, are limited in computational complexity due to large data amounts. Amongst the computationally simple yet effective imputation methods are the hot deck procedures. Hot deck methods impute missing values within a data matrix by using available values from the same matrix. The object, from which these available values are taken for imputation within another, is called the donor. The replication of values leads to the problem, that a single donor might be selected to accommodate multiple recipients. The inherent risk posed by this is that too many, or even all, missing values may be imputed with the values from a single donor. To mitigate this risk, some hot deck variants limit the amount of times any one donor may be selected for donating its values. This inevitably leads to the question under which conditions such a limitation is sensible. This study aims to answer this question though an extensive simulation. The results show rather clear differences between imputations by hot deck methods in which the donor limit was varied. In addition to these differences, influencing factors are identified that determine whether or not a donor limit is sensible.


GfKl | 2014

On Limiting Donor Usage for Imputation of Missing Data via Hot Deck Methods

Udo Bankhofer; Dieter William Joenssen

Hot deck methods impute missing values within a data matrix by using available values from the same matrix. The object from which these available values are taken for imputation is called the donor. Selection of a suitable donor for the receiving object can be done within imputation classes. The risk inherent to this strategy is that any donor might be selected for multiple value recipients. In extreme cases one donor can be selected for too many or even all values. To mitigate this donor over usage risk, some hot deck procedures limit the amount of times one donor may be selected for value donation. This study answers if limiting donor usage is a superior strategy when considering imputation variance and bias in parameter estimates.


Archives of Data Science, Series A (Online First) | 2017

Decision Trees for the Imputation of Categorical Data

Tobias Rockel; Dieter William Joenssen; Udo Bankhofer

Resolving the problem of missing data via imputation can theoretically be done by any prediction model. In the field of machine learning, a well known type of prediction model is a decision tree. However, the literature on how suitable a decision tree is for imputation is still scant to date. Therefore, the aim of this paper is to analyze the imputation quality of decision trees. Furthermore, we present a way to conduct a stochastic imputation using decision trees. We ran a simulation study to compare the deterministic and stochastic imputation approach using decision trees among each other and with other imputation methods. For this study, real datasets and various missing data settings are used. In addition, three different quality criteria are considered. The results of the study indicate that the choice of imputation method should be based on the intended analysis.


Archive | 2012

Aufgaben zur Wahrscheinlichkeitsrechnung

Udo Bankhofer; Jürgen Vogel

Professor Muller halt am Donnerstag, dem 27. Mai 2010, wie gewohnt die Statistikvorlesung. Es ist ein besonderer Tag fur ihn, denn er hat seinen 60. Geburtstag. Im Horsaal sind n Studierende anwesend.


Archive | 2012

Aufgaben zur beschreibenden Statistik

Udo Bankhofer; Jürgen Vogel

In einem Unternehmen fallen fur die Versendung von funf ausgewahlten Erzeugnissen unterschiedliche Verpackungskosten pro Stuck an.


Archive | 2012

Lösungen zu den Klausuren

Udo Bankhofer; Jürgen Vogel

In der nachfolgenden Losung zur Klausur sind in den einzelnen Losungsteilen auch die jeweils erreichbaren Punkte angegeben.


Archive | 2012

Aufgaben zur Datenanalyse

Udo Bankhofer; Jürgen Vogel

In der folgenden Datenmatrix wurden fur funf Champagner die Merkmale Suse, Alkoholgehalt (in Vol. %) und Preis (in Euro) erhoben.


Archive | 2012

Lösungen zur Wahrscheinlichkeitsrechnung

Udo Bankhofer; Jürgen Vogel

G1 sei das zufallige Ereignis, dass mindestens einer der anwesenden Horer am 27. Mai Geburtstag hat. G0 sei das dazu komplementare Ereignis, dass an diesem Tag keiner Geburtstag hat.


Archive | 2012

Aufgaben zur schließenden Statistik

Udo Bankhofer; Jürgen Vogel

Es sei X1 ,X2 ,X3 ,X4 eine mathematische Stichprobe zu einem Merkmal X mit Erwartungswert μ und Varianz σ 2 .


Archive | 2012

Aufgaben zum Data Mining

Udo Bankhofer; Jürgen Vogel

Die folgende Tabelle enthalt die Daten von 15 Bestellungen, die von verschiedenen Gasten eines Cafes aufgegeben wurden. Dabei bezeichnet ein Wert von 1, dass der Artikel in der entsprechenden Bestellung enthalten war, wahrend ein Wert von 0 angibt, dass dieser Artikel in der Bestellung nicht enthalten war.

Collaboration


Dive into the Udo Bankhofer's collaboration.

Top Co-Authors

Avatar

Jürgen Vogel

Technische Universität Ilmenau

View shared research outputs
Top Co-Authors

Avatar

Dieter William Joenssen

Technische Universität Ilmenau

View shared research outputs
Researchain Logo
Decentralizing Knowledge