Franz Pernkopf | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Franz Pernkopf is active.

Explore More

Publication

Featured researches published by Franz Pernkopf.

IEEE Transactions on Pattern Analysis and Machine Intelligence | 2005

Genetic-based EM algorithm for learning Gaussian mixture models

Franz Pernkopf; Djamel Bouchaffra

We propose a genetic-based expectation-maximization (GA-EM) algorithm for learning Gaussian mixture models from multivariate data. This algorithm is capable of selecting the number of components of the model using the minimum description length (MDL) criterion. Our approach benefits from the properties of genetic algorithms (GA) and the EM algorithm by combination of both into a single procedure. The population-based stochastic search of the GA explores the search space more thoroughly than the EM method. Therefore, our algorithm enables escaping from local optimal solutions since the algorithm becomes less sensitive to its initialization. The GA-EM algorithm is elitist which maintains the monotonic convergence property of the EM algorithm. The experiments on simulated and real data show that the GA-EM outperforms the EM method since: (1) we have obtained a better MDL score while using exactly the same termination condition for both algorithms; (2) our approach identifies the number of components which were used to generate the underlying data more often than the EM algorithm.

Ndt & E International | 2003

Image acquisition techniques for automatic visual inspection of metallic surfaces

Franz Pernkopf; Paul O'Leary

This paper provides an overview of three different image acquisition approaches for automatic visual inspection of metallic surfaces. The first method is concerned with gray-level intensity imaging, whereby the most commonly employed lighting techniques are surveyed. Subsequently, two range imaging techniques are introduced which may succeed in contrast to intensity imaging if the reflection property across the intact surface changes. However, range imaging for surface inspection is restricted to surface defects with three-dimensional characteristics, e.g. cavities. One range imaging approach is based on light sectioning in conjunction with fast imaging sensors. The second introduced range imaging technique is photometric stereo.

Neurocomputing | 2012

Sparse nonnegative matrix factorization with ℓ 0 -constraints

Robert Peharz; Franz Pernkopf

Although nonnegative matrix factorization (NMF) favors a sparse and part-based representation of nonnegative data, there is no guarantee for this behavior. Several authors proposed NMF methods which enforce sparseness by constraining or penalizing the ℓ1-norm of the factor matrices. On the other hand, little work has been done using a more natural sparseness measure, the ℓ0-pseudo-norm. In this paper, we propose a framework for approximate NMF which constrains the ℓ0-norm of the basis matrix, or the coefficient matrix, respectively. For this purpose, techniques for unconstrained NMF can be easily incorporated, such as multiplicative update rules, or the alternating nonnegative least-squares scheme. In experiments we demonstrate the benefits of our methods, which compare to, or outperform existing approaches.

Pattern Recognition | 2005

Bayesian network classifiers versus selective k-NN classifier

Franz Pernkopf

In this paper Bayesian network classifiers are compared to the k-nearest neighbor (k-NN) classifier, which is based on a subset of features. This subset is established by means of sequential feature selection methods. Experimental results on classifying data of a surface inspection task and data sets from the UCI repository show that Bayesian network classifiers are competitive with selective k-NN classifiers concerning classification accuracy. The k-NN classifier performs well in the case where the number of samples for learning the parameters of the Bayesian network is small. Bayesian network classifiers outperform selective k-NN methods in terms of memory requirements and computational demands. This paper demonstrates the strength of Bayesian networks for classification.

international conference on machine learning | 2005

Discriminative versus generative parameter and structure learning of Bayesian network classifiers

Franz Pernkopf; Jeff A. Bilmes

In this paper, we compare both discriminative and generative parameter learning on both discriminatively and generatively structured Bayesian network classifiers. We use either maximum likelihood (ML) or conditional maximum likelihood (CL) to optimize network parameters. For structure learning, we use either conditional mutual information (CMI), the explaining away residual (EAR), or the classification rate (CR) as objective functions. Experiments with the naive Bayes classifier (NB), the tree augmented naive Bayes classifier (TAN), and the Bayesian multinet have been performed on 25 data sets from the UCI repository (Merz et al., 1997) and from (Kohavi & John, 1997). Our empirical study suggests that discriminative structures learnt using CR produces the most accurate classifiers on almost half the data sets. This approach is feasible, however, only for rather small problems since it is computationally expensive. Discriminative parameter learning produces on average a better classifier than ML parameter learning.

IEEE Transactions on Audio, Speech, and Language Processing | 2011

A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

Michael Wohlmayr; Michael Stark; Franz Pernkopf

We present a simple and efficient feature modeling approach for tracking the pitch of two simultaneously active speakers. We model the spectrogram features of single speakers using Gaussian mixture models in combination with the minimum description length model selection criterion. To obtain a probabilistic representation for the speech mixture spectrogram features of both speakers, we employ the mixture maximization model (MIXMAX) and, as an alternative, a linear interaction model. A factorial hidden Markov model is applied for tracking pitch over time. This statistical model can be used for applications beyond speech, whenever the interaction between individual sources can be represented as MIXMAX or linear model. For tracking, we use the loopy max-sum algorithm, and provide empirical comparisons to exact methods. Furthermore, we discuss a scheduling mechanism of loopy belief propagation for online tracking. We demonstrate experimental results using Mocha-TIMIT as well as data from the speech separation challenge provided by Cooke We show the excellent performance of the proposed method in comparison to a well known multipitch tracking algorithm based on correlogram features. Using speaker-dependent models, the proposed method improves the accuracy of correct speaker assignment, which is important for single-channel speech separation. In particular, we are able to reduce the overall tracking error by 51% relative for the speaker-dependent case. Moreover, we use the estimated pitch trajectories to perform single-channel source separation, and demonstrate the beneficial effect of correct speaker assignment on speech separation performance.

Pattern Analysis and Applications | 2004

Detection of surface defects on raw steel blocks using Bayesian network classifiers

Franz Pernkopf

This paper proposes an approach that detects surface defects with three-dimensional characteristics on scale-covered steel blocks. The surface reflection properties of the flawless surface changes strongly. Light sectioning is used to acquire the surface range data of the steel block. These sections are arbitrarily located within a range of a few millimeters due to vibrations of the steel block on the conveyor. After the recovery of the depth map, segments of the surface are classified according to a set of extracted features by means of Bayesian network classifiers. For establishing the structure of the Bayesian network, a floating search algorithm is applied, which achieves a good tradeoff between classification performance and computational efficiency for structure learning. This search algorithm enables conditional exclusions of previously added attributes and/or arcs from the network. The experiments show that the selective unrestricted Bayesian network classifier outperforms the naïve Bayes and the tree-augmented naïve Bayes decision rules concerning the classification rate. More than 98% of the surface segments have been classified correctly.

IEEE Transactions on Audio, Speech, and Language Processing | 2011

Source–Filter-Based Single-Channel Speech Separation Using Pitch Information

Michael Stark; Michael Wohlmayr; Franz Pernkopf

In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.

IEEE Transactions on Pattern Analysis and Machine Intelligence | 2012

Maximum Margin Bayesian Network Classifiers

Franz Pernkopf; Michael Wohlmayr; Sebastian Tschiatschek

We present a maximum margin parameter learning algorithm for Bayesian network classifiers using a conjugate gradient (CG) method for optimization. In contrast to previous approaches, we maintain the normalization constraints on the parameters of the Bayesian network during optimization, i.e., the probabilistic interpretation of the model is not lost. This enables us to handle missing features in discriminatively optimized Bayesian networks. In experiments, we compare the classification performance of maximum margin parameter learning to conditional likelihood and maximum likelihood learning approaches. Discriminative parameter learning significantly outperforms generative maximum likelihood estimation for naive Bayes and tree augmented naive Bayes structures on all considered data sets. Furthermore, maximizing the margin dominates the conditional likelihood approach in terms of classification performance in most cases. We provide results for a recently proposed maximum margin optimization approach based on convex relaxation [1]. While the classification results are highly similar, our CG-based optimization is computationally up to orders of magnitude faster. Margin-optimized Bayesian network classifiers achieve classification performance comparable to support vector machines (SVMs) using fewer parameters. Moreover, we show that unanticipated missing feature values during classification can be easily processed by discriminatively optimized Bayesian network classifiers, a case where discriminative classifiers usually require mechanisms to complete unknown feature values in the data first.

EURASIP Journal on Advances in Signal Processing | 2002

Visual inspection of machined metallic high-precision surfaces

Franz Pernkopf; Paul O'Leary

This paper presents a surface inspection prototype of an automatic system for precision ground metallic surfaces, in this case bearing rolls. The surface reflectance properties are modeled and verified with optical experiments. The aim being to determine the optical arrangement for illumination and observation, where the contrast between errors and intact surface is maximized. A new adaptive threshold selection algorithm for segmentation is presented. Additionally, is included an evaluation of a large number of published sequential search algorithms for selection of the best subset of features for the classification with a comparison of their computational requirements. Finally, the results of classification for 540 flaw images are presented.

Explore More