Richard Rohwer | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Richard Rohwer is active.

Explore More

Publication

Featured researches published by Richard Rohwer.

conference on computational natural language learning | 2005

New Experiments in Distributional Representations of Synonymy

Dayne Freitag; Matthias Blume; John Byrnes; Edmond D. Chow; Sadik Kapadia; Richard Rohwer; Zhiqiang Wang

Recent work on the problem of detecting synonymy through corpus analysis has used the Test of English as a Foreign Language (TOEFL) as a benchmark. However, this test involves as few as 80 questions, prompting questions regarding the statistical significance of reported results. We overcome this limitation by generating a TOEFL-like test using WordNet, containing thousands of questions and composed only of words occurring with sufficient corpus frequency to support sound distributional comparisons. Experiments with this test lead us to a similarity measure which significantly outperforms the best proposed to date. Analysis suggests that a strength of this measure is its relative robustness against polysemy.

ESOA'06 Proceedings of the 4th international conference on Engineering self-organising systems | 2006

Dynamic decentralized any-time hierarchical clustering

H. Van Dyke Parunak; Richard Rohwer; Theodore C. Belding; Sven Brueckner

Hierarchical clustering is used widely to organize data and search for patterns. Previous algorithms assume that the body of data being clustered is fixed while the algorithm runs, and use centralized data representations that make it difficult to scale the process by distributing it across multiple processors. Self-Organizing Data and Search (SODAS) inspired by the decentralized algorithms that ants use to sort their nests, relaxes these constraints. SODAS can maintain a hierarchical structure over a continuously changing collection of leaves, requiring only local computations at the nodes of the hierarchy and thus permitting the system to scale arbitrarily by distributing nodes (and their processing) across multiple computers.

ieee aerospace conference | 2005

Text Modeling for Real-Time Document Categorization

John Byrnes; Richard Rohwer

We report on experiments in adapting document categorization techniques to provide for implementation in high-speed hardware. Because resources are scarce, it is important to have a small set of robust and maximally informative variables over which learning can occur. We generate variables using information-theoretic clustering. The resulting performance is on par with general-purpose computing implementations which are able to take advantage of large amounts of time and memory. We conclude that custom high-speed hardware for document categorization can be made very accurate. We also believe that some of the strengths of information-theoretic data analysis techniques are brought out

ieee aerospace conference | 2006

An architecture for streaming coclustering in high speed hardware

John Byrnes; Richard Rohwer

We seek to learn the semantics of a data stream at optical line speed. We focus on text data, but the techniques developed should apply to broad modalities of network data wherever appropriate features can be computed rapidly enough. We consider a custom hardware system designed to categorize documents based on feature clusters and document clusters that have been learned offline on standard general-purpose computers, and we present a technique for extending this system to permit online learning from arbitrarily large data sets

international conference on information technology: new generations | 2010

Simmered Greedy Optimization for Co-clustering

Sadik Kapadia; Richard Rohwer

We present a fast yet highly effective stochastic algorithm, Simmered Greedy Optimization (SG(N)) for solving the co-clustering problem: to simultaneously cluster two finite sets by maximizing the mutual information between the clusterings. (Clustering one set by this criterion is a special case.) This is a combinatorial optimization problem of great interest for deriving maximally predictive feature sets. Co-clustering has found applications in many areas, particularly statistical natural language processing and bioinformatics. We report results of tests on a suite of statistical natural language problems, comparing SG(N) with simulated annealing and a publicly available implementation of co-clustering. In all cases we obtain superior results with far less computation using SG(N).

Archive | 2005

Method and apparatus for automatic entity disambiguation

Matthias Blume; Richard Calmbach; Dayne Freitag; Richard Rohwer; Scott M. Zoldi

Archive | 2008

Soft Co-Clustering of Data

Frank Wall Elliott; Richard Rohwer; Stephen C. Jones; George J. Tucker; Christopher J. Kain; Craig N. Weidert

north american chapter of the association for computational linguistics | 2004

Towards full automation of lexicon construction

Richard Rohwer; Dayne Freitag

adaptive agents and multi agents systems | 2013

Characterizing and aggregating agent estimates

H. Van Dyke Parunak; Sven Brueckner; Lu Hong; Scott E. Page; Richard Rohwer

Emergent Information Technologies and Enabling Policies for Counter-Terrorism | 2005

Information Processing at Very High Speed Data Ingestion Rates

J. Brian Sharkey; Doyle Weishar; John W. Lockwood; Ron Loui; Richard Rohwer; John Byrnes; Krishna R. Pattipati; Stephen G. Eick; David Bruce Cousins; Michael John Nicoletti

Explore More