Jonathan K. Kummerfeld
University of California, Berkeley
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jonathan K. Kummerfeld.
empirical methods in natural language processing | 2015
Jonathan K. Kummerfeld; Taylor Berg-Kirkpatrick; Daniel Klein
Despite the convexity of structured maxmargin objectives (Taskar et al., 2004; Tsochantaridis et al., 2004), the many ways to optimize them are not equally effective in practice. We compare a range of online optimization methods over a variety of structured NLP tasks (coreference, summarization, parsing, etc) and find several broad trends. First, margin methods do tend to outperform both likelihood and the perceptron. Second, for max-margin objectives, primal optimization methods are often more robust and progress faster than dual methods. This advantage is most pronounced for tasks with dense or continuous-valued features. Overall, we argue for a particularly simple online primal subgradient descent method that, despite being rarely mentioned in the literature, is surprisingly effective in relation to its alternatives.
international world wide web conferences | 2017
Rebecca S. Portnoff; Sadia Afroz; Greg Durrett; Jonathan K. Kummerfeld; Taylor Berg-Kirkpatrick; Damon McCoy; Kirill Levchenko; Vern Paxson
Underground forums are widely used by criminals to buy and sell a host of stolen items, datasets, resources, and criminal services. These forums contain important resources for understanding cybercrime. However, the number of forums, their size, and the domain expertise required to understand the markets makes manual exploration of these forums unscalable. In this work, we propose an automated, top-down approach for analyzing underground forums. Our approach uses natural language processing and machine learning to automatically generate high-level information about underground forums, first identifying posts related to transactions, and then extracting products and prices. We also demonstrate, via a pair of case studies, how an analyst can use these automated approaches to investigate other categories of products and transactions. We use eight distinct forums to assess our tools: Antichat, Blackhat World, Carders, Darkode, Hack Forums, Hell, L33tCrew and Nulled. Our automated approach is fast and accurate, achieving over 80% accuracy in detecting post category, product, and prices.
meeting of the association for computational linguistics | 2017
Youxuan Jiang; Jonathan K. Kummerfeld; Walter S. Lasecki
Linguistically diverse datasets are critical for training and evaluating robust machine learning systems, but data collection is a costly process that often requires experts. Crowdsourcing the process of paraphrase generation is an effective means of expanding natural language datasets, but there has been limited analysis of the trade-offs that arise when designing tasks. In this paper, we present the first systematic study of the key factors in crowdsourcing paraphrase collection. We consider variations in instructions, incentives, data domains, and workflows. We manually analyzed paraphrases for correctness, grammaticality, and linguistic diversity. Our observations provide new insight into the trade-offs between accuracy and diversity in crowd responses that arise as a result of task design, providing guidance for future paraphrase generation procedures.
empirical methods in natural language processing | 2012
Jonathan K. Kummerfeld; David Leo Wright Hall; James R. Curran; Daniel Klein
empirical methods in natural language processing | 2013
Jonathan K. Kummerfeld; Daniel Klein
meeting of the association for computational linguistics | 2010
Jonathan K. Kummerfeld; Jessika Roesner; Tim Dawborn; James Haggerty; James R. Curran; Stephen Clark
meeting of the association for computational linguistics | 2013
Jonathan K. Kummerfeld; Daniel Tse; James R. Curran; Daniel Klein
conference on computational natural language learning | 2011
Jonathan K. Kummerfeld; Mohit Bansal; David Burkett; Daniel Klein
Proceedings of the Australasian Language Technology Association Workshop 2008 | 2008
Jonathan K. Kummerfeld; James R. Curran
Astrophysical Journal Supplement Series | 2013
V. A. Moss; N. M. McClure-Griffiths; Tara Murphy; D. J. Pisano; Jonathan K. Kummerfeld; James R. Curran