Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Gary Kacmarcik is active.

Publication


Featured researches published by Gary Kacmarcik.


empirical methods in natural language processing | 2005

MindNet: An Automatically-Created Lexical Resource

Lucy Vanderwende; Gary Kacmarcik; Hisami Suzuki; Arul Menezes

We will demonstrate MindNet, a lexical resource built automatically by processing text. We will present two forms of MindNet: as a static lexical resource, and, as a toolkit which allows MindNets to be built from arbitrary text. We will also introduce a web-based interface to MindNet lexicons (MNEX) that is intended to make the data contained within MindNets more accessible for exploration. Both English and Japanese MindNets will be shown and will be made available, through MNEX, for research purposes.


international conference on computational linguistics | 2000

Using a broad-coverage parser for word-breaking in Japanese

Hisami Suzuki; Chris Brockett; Gary Kacmarcik

We describe a method of word segmentation in Japanese in which a broad-coverage parser selects the best word sequence while producing a syntactic analysis. This technique is substantially different from traditional statistics- or heuristics-based models which attempt to select the best word sequence before handing it to the syntactic component. By breaking up the task of finding the best word sequence into the identification of words (in the word-breaking component) and the selection of the best sequence (a by-product of parsing), we have been able to simplify the task of each component and achieve high accuracy over a wide variety of data. Word-breaking accuracy of our system is currently around 97-98%.


international conference on computational linguistics | 2000

Robust segmentation of Japanese text into a lattice for parsing

Gary Kacmarcik; Chris Brockett; Hisami Suzuki

We describe a segmentation component that utilizes minimal syntactic knowledge to produce a lattice of word candidates for a broad coverage Japanese NL parser. The segmenter is a finite state morphological analyzer and text normalizer designed to handle the orthographic variations characteristic of written Japanese, including alternate spellings, script variation, vowel extensions and word-internal parenthetical material. This architecture differs from conventional Japanese wordbreakers in that it does not attempt to simultaneously attack the problems of identifying segmentation candidates and choosing the most probable analysis. To minimize duplication of effort between components and to give the segmenter greater freedom to address orthography issues, the task of choosing the best analysis is handled by the parser, which has access to a much richer set of linguistic information. By maximizing recall in the segmenter and allowing a precision of 34.7%, our parser currently achieves a breaking accuracy of ~97% over a wide variety of corpora.


NLPRS | 2001

Automatically Harvesting Katakana-English Term Pairs from Search Engine Query Logs.

Eric D. Brill; Gary Kacmarcik; Chris Brockett


Archive | 2006

Semantic annotations for virtual objects

Gary Kacmarcik


Archive | 2007

Web-based proofing and usage guidance

Chris Brockett; William B. Dolan; Michael Gamon; Jianfeng Gao; Lucy Vanderwende; Hsiao-Wen Hon; Ming Zhou; Gary Kacmarcik; Alexandre Klementiev


meeting of the association for computational linguistics | 2006

Obfuscating Document Stylometry to Preserve Author Anonymity

Gary Kacmarcik; Michael Gamon


Archive | 2006

Obfuscating document stylometry

Gary Kacmarcik; Michael Gamon


Archive | 2006

Dynamic interaction menus from natural language representations

Gary Kacmarcik


Archive | 2000

Method for segmenting non-segmented text using syntactic parse

Chris Brockett; Gary Kacmarcik; Hisami Suzuki

Collaboration


Dive into the Gary Kacmarcik's collaboration.

Researchain Logo
Decentralizing Knowledge