Researchain Logo Researchain
  • Decentralized Journals

    A

    Archives
  • Avatar
    Welcome to Researchain!
    Feedback Center
Decentralized Journals
A
Archives Updated
Archive Your Research
Computation and Language

A Bayesian hybrid method for context-sensitive spelling correction

Andrew R. Golding

Abstract
Two classes of methods have been shown to be useful for resolving lexical ambiguity. The first relies on the presence of particular words within some distance of the ambiguous target word; the second uses the pattern of words and part-of-speech tags around the target word. These methods have complementary coverage: the former captures the lexical ``atmosphere'' (discourse topic, tense, etc.), while the latter captures local syntax. Yarowsky has exploited this complementarity by combining the two methods using decision lists. The idea is to pool the evidence provided by the component methods, and to then solve a target problem by applying the single strongest piece of evidence, whatever type it happens to be. This paper takes Yarowsky's work as a starting point, applying decision lists to the problem of context-sensitive spelling correction. Decision lists are found, by and large, to outperform either component method. However, it is found that further improvements can be obtained by taking into account not just the single strongest piece of evidence, but ALL the available evidence. A new hybrid method, based on Bayesian classifiers, is presented for doing this, and its performance improvements are demonstrated.
Full PDF
Related Researches

Some Ontological Principles for Designing Upper Level Lexical Resources
by Nicola Guarino
A Comparison of WordNet and Roget's Taxonomy for Measuring Semantic Similarity
by Michael Mc Hale
Towards an implementable dependency grammar
by Timo Jarvinen
Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text
by Philip Resnik
Indexing with WordNet synsets can improve Text Retrieval
by Julio Gonzalo
An Empirical Evaluation of Probabilistic Lexicalized Tree Insertion Grammars
by Rebecca Hwa
A Variant of Earley Parsing
by Mark-Jan Nederhof
Segregatory Coordination and Ellipsis in Text Generation
by James Shaw
Spotting Prosodic Boundaries in Continuous Speech in French
by V. Pagel
Error-Driven Pruning of Treebank Grammars for Base Noun Phrase Identification
by Claire Cardie
Separating Surface Order and Syntactic Relations in a Dependency Grammar
by Norbert Broeker
Partial Evaluation for Efficient Access to Inheritance Lexicons
by Sven Hartrumpf
Primitive Part-of-Speech Tagging using Word Length and Sentential Structure
by Simon Cozens
Letter to Sound Rules for Accented Lexicon Compression
by V. Pagel
How to define a context-free backbone for DGs: Implementing a DG in the LFG formalism
by Norbert Broeker
Deriving the Predicate-Argument Structure for a Free Word Order Language
by Cem Bozsahin
Some Properties of Preposition and Subordinate Conjunction Attachments
by Alexander S. Yeh
Isometric Lineation in English Texts: An Empirical and Mathematical Examination of its Character and Consequences
by Hideaki Aoyama
Combining Expression and Content in Domains for Dialog Managers
by Bernd Ludwig
Word Length Frequency and Distribution in English: Observations, Theory, and Implications for the Construction of Verse Lines
by Hideaki Aoyama
Evaluating a Focus-Based Approach to Anaphora Resolution
by Saliha Azzam
Improving Data Driven Wordclass Tagging by System Combination
by Hans van Halteren
Character design for soccer commmentary
by Kim Binsted
Statistical Models for Unsupervised Prepositional Phrase Attachment
by Adwait Ratnaparkhi
Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text
by Davide Turcato

  • «
  • 1
  • 2
  • 3
  • 4
  • »
Submitted on 3 Jun 1996 Updated

arXiv.org Original Source
NASA ADS
Google Scholar
Semantic Scholar
How Researchain Works
Researchain Logo
Decentralizing Knowledge