Richard Rohwer
FICO
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Richard Rohwer.
conference on computational natural language learning | 2005
Dayne Freitag; Matthias Blume; John Byrnes; Edmond D. Chow; Sadik Kapadia; Richard Rohwer; Zhiqiang Wang
Recent work on the problem of detecting synonymy through corpus analysis has used the Test of English as a Foreign Language (TOEFL) as a benchmark. However, this test involves as few as 80 questions, prompting questions regarding the statistical significance of reported results. We overcome this limitation by generating a TOEFL-like test using WordNet, containing thousands of questions and composed only of words occurring with sufficient corpus frequency to support sound distributional comparisons. Experiments with this test lead us to a similarity measure which significantly outperforms the best proposed to date. Analysis suggests that a strength of this measure is its relative robustness against polysemy.
ESOA'06 Proceedings of the 4th international conference on Engineering self-organising systems | 2006
H. Van Dyke Parunak; Richard Rohwer; Theodore C. Belding; Sven Brueckner
Hierarchical clustering is used widely to organize data and search for patterns. Previous algorithms assume that the body of data being clustered is fixed while the algorithm runs, and use centralized data representations that make it difficult to scale the process by distributing it across multiple processors. Self-Organizing Data and Search (SODAS) inspired by the decentralized algorithms that ants use to sort their nests, relaxes these constraints. SODAS can maintain a hierarchical structure over a continuously changing collection of leaves, requiring only local computations at the nodes of the hierarchy and thus permitting the system to scale arbitrarily by distributing nodes (and their processing) across multiple computers.
ieee aerospace conference | 2005
John Byrnes; Richard Rohwer
We report on experiments in adapting document categorization techniques to provide for implementation in high-speed hardware. Because resources are scarce, it is important to have a small set of robust and maximally informative variables over which learning can occur. We generate variables using information-theoretic clustering. The resulting performance is on par with general-purpose computing implementations which are able to take advantage of large amounts of time and memory. We conclude that custom high-speed hardware for document categorization can be made very accurate. We also believe that some of the strengths of information-theoretic data analysis techniques are brought out
ieee aerospace conference | 2006
John Byrnes; Richard Rohwer
We seek to learn the semantics of a data stream at optical line speed. We focus on text data, but the techniques developed should apply to broad modalities of network data wherever appropriate features can be computed rapidly enough. We consider a custom hardware system designed to categorize documents based on feature clusters and document clusters that have been learned offline on standard general-purpose computers, and we present a technique for extending this system to permit online learning from arbitrarily large data sets
international conference on information technology: new generations | 2010
Sadik Kapadia; Richard Rohwer
We present a fast yet highly effective stochastic algorithm, Simmered Greedy Optimization (SG(N)) for solving the co-clustering problem: to simultaneously cluster two finite sets by maximizing the mutual information between the clusterings. (Clustering one set by this criterion is a special case.) This is a combinatorial optimization problem of great interest for deriving maximally predictive feature sets. Co-clustering has found applications in many areas, particularly statistical natural language processing and bioinformatics. We report results of tests on a suite of statistical natural language problems, comparing SG(N) with simulated annealing and a publicly available implementation of co-clustering. In all cases we obtain superior results with far less computation using SG(N).
Archive | 2005
Matthias Blume; Richard Calmbach; Dayne Freitag; Richard Rohwer; Scott M. Zoldi
Archive | 2008
Frank Wall Elliott; Richard Rohwer; Stephen C. Jones; George J. Tucker; Christopher J. Kain; Craig N. Weidert
north american chapter of the association for computational linguistics | 2004
Richard Rohwer; Dayne Freitag
adaptive agents and multi agents systems | 2013
H. Van Dyke Parunak; Sven Brueckner; Lu Hong; Scott E. Page; Richard Rohwer
Emergent Information Technologies and Enabling Policies for Counter-Terrorism | 2005
J. Brian Sharkey; Doyle Weishar; John W. Lockwood; Ron Loui; Richard Rohwer; John Byrnes; Krishna R. Pattipati; Stephen G. Eick; David Bruce Cousins; Michael John Nicoletti