Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Jonathan K. Kummerfeld is active.

Publication


Featured researches published by Jonathan K. Kummerfeld.


empirical methods in natural language processing | 2015

An Empirical Analysis of Optimization for Max-Margin NLP

Jonathan K. Kummerfeld; Taylor Berg-Kirkpatrick; Daniel Klein

Despite the convexity of structured maxmargin objectives (Taskar et al., 2004; Tsochantaridis et al., 2004), the many ways to optimize them are not equally effective in practice. We compare a range of online optimization methods over a variety of structured NLP tasks (coreference, summarization, parsing, etc) and find several broad trends. First, margin methods do tend to outperform both likelihood and the perceptron. Second, for max-margin objectives, primal optimization methods are often more robust and progress faster than dual methods. This advantage is most pronounced for tasks with dense or continuous-valued features. Overall, we argue for a particularly simple online primal subgradient descent method that, despite being rarely mentioned in the literature, is surprisingly effective in relation to its alternatives.


international world wide web conferences | 2017

Tools for Automated Analysis of Cybercriminal Markets

Rebecca S. Portnoff; Sadia Afroz; Greg Durrett; Jonathan K. Kummerfeld; Taylor Berg-Kirkpatrick; Damon McCoy; Kirill Levchenko; Vern Paxson

Underground forums are widely used by criminals to buy and sell a host of stolen items, datasets, resources, and criminal services. These forums contain important resources for understanding cybercrime. However, the number of forums, their size, and the domain expertise required to understand the markets makes manual exploration of these forums unscalable. In this work, we propose an automated, top-down approach for analyzing underground forums. Our approach uses natural language processing and machine learning to automatically generate high-level information about underground forums, first identifying posts related to transactions, and then extracting products and prices. We also demonstrate, via a pair of case studies, how an analyst can use these automated approaches to investigate other categories of products and transactions. We use eight distinct forums to assess our tools: Antichat, Blackhat World, Carders, Darkode, Hack Forums, Hell, L33tCrew and Nulled. Our automated approach is fast and accurate, achieving over 80% accuracy in detecting post category, product, and prices.


meeting of the association for computational linguistics | 2017

Understanding Task Design Trade-offs in Crowdsourced Paraphrase Collection.

Youxuan Jiang; Jonathan K. Kummerfeld; Walter S. Lasecki

Linguistically diverse datasets are critical for training and evaluating robust machine learning systems, but data collection is a costly process that often requires experts. Crowdsourcing the process of paraphrase generation is an effective means of expanding natural language datasets, but there has been limited analysis of the trade-offs that arise when designing tasks. In this paper, we present the first systematic study of the key factors in crowdsourcing paraphrase collection. We consider variations in instructions, incentives, data domains, and workflows. We manually analyzed paraphrases for correctness, grammaticality, and linguistic diversity. Our observations provide new insight into the trade-offs between accuracy and diversity in crowd responses that arise as a result of task design, providing guidance for future paraphrase generation procedures.


empirical methods in natural language processing | 2012

Parser Showdown at the Wall Street Corral: An Empirical Investigation of Error Types in Parser Output

Jonathan K. Kummerfeld; David Leo Wright Hall; James R. Curran; Daniel Klein


empirical methods in natural language processing | 2013

Error-Driven Analysis of Challenges in Coreference Resolution

Jonathan K. Kummerfeld; Daniel Klein


meeting of the association for computational linguistics | 2010

Faster Parsing by Supertagger Adaptation

Jonathan K. Kummerfeld; Jessika Roesner; Tim Dawborn; James Haggerty; James R. Curran; Stephen Clark


meeting of the association for computational linguistics | 2013

An Empirical Examination of Challenges in Chinese Parsing

Jonathan K. Kummerfeld; Daniel Tse; James R. Curran; Daniel Klein


conference on computational natural language learning | 2011

Mention Detection: Heuristics for the OntoNotes annotations

Jonathan K. Kummerfeld; Mohit Bansal; David Burkett; Daniel Klein


Proceedings of the Australasian Language Technology Association Workshop 2008 | 2008

Classification of Verb Particle Constructions with the Google Web1T Corpus

Jonathan K. Kummerfeld; James R. Curran


Astrophysical Journal Supplement Series | 2013

HIGH-VELOCITY CLOUDS IN THE GALACTIC ALL SKY SURVEY. I. CATALOG

V. A. Moss; N. M. McClure-Griffiths; Tara Murphy; D. J. Pisano; Jonathan K. Kummerfeld; James R. Curran

Collaboration


Dive into the Jonathan K. Kummerfeld's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Daniel Klein

University of California

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Greg Durrett

University of California

View shared research outputs
Top Co-Authors

Avatar

Jessika Roesner

University of Texas at Austin

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge