Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Wen-King Su is active.

Publication


Featured researches published by Wen-King Su.


international symposium on microarchitecture | 1995

Myrinet: a gigabit-per-second local area network

Nanette J. Boden; Danny Cohen; Robert E. Felderman; Alan E. Kulawik; Charles L. Seitz; Jakov Seizovic; Wen-King Su

The Myrinet local area network employs the same technology used for packet communication and switching within massively parallel processors. In realizing this distributed MPP network, we developed specialized communication channels, cut-through switches, host interfaces, and software. To our knowledge, Myrinet demonstrates the highest performance per unit cost of any current LAN. >


hypercube concurrent computers and applications | 1988

The architecture and programming of the Ametek series 2010 multicomputer

Charles L. Seitz; William C. Athas; Charles M. Flaig; Alain J. Martin; Jakov Seizovic; Craig S. Steele; Wen-King Su

During the period following the completion of the Cosmic Cube experiment [1], and while commercial descendants of this first-generation multicomputer (message-passing concurrent computer) were spreading through a community that includes many of the attendees of this conference, members of our research group were developing a set of ideas about the physical design and programming for the second generation of medium-grain multicomputers. Our principal goal was to improve by as much as two orders of magnitude the relationship between message-passing and computing performance, and also to make the topology of the message-passing network practically invisible. Decreasing the communication latency relative to instruction execution times extends the application span of multicomputers from easily partitioned and distributed problems (eg, matrix computations, PDE solvers, finite element analysis, finite difference methods, distant or local field many-body problems, FFTs, ray tracing, distributed simulation of systems composed of loosely coupled physical processes) to computing problems characterized by “high flux” [2] or relatively fine-grain concurrent formulations [3, 4] (eg, searching, sorting, concurrent data structures, graph problems, signal processing, image processing, and distributed simulation of systems composed of many tightly coupled physical processes). Such applications place heavy demands on the message-passing network for high bandwidth, low latency, and non-local communication. Decreased message latency also improves the efficiency of the class of applications that have been developed on first-generation systems, and the insensitivity of message latency to process placement simplifies the concurrent formulation of application programs. Our other goals included a streamlined and easily layered set of message primitives, a node operating system based on a reactive programming model, open interfaces for accelerators and peripheral devices, and node performance improvements that could be achieved economically by using the same technology employed in contemporary workstation computers. By the autumn of 1986, these ideas had become sufficiently developed, molded together, and tested through simulation to be regarded as a complete architectural design. We were fortunate that the Ametek Computer Research Division was ready and willing to work with us to develop this system as a commercial product. The Ametek Series 2010 multicomputer is the result of this joint effort.


field-programmable custom computing machines | 1998

Scalable network based FPGA accelerators for an automatic target recognition application

Ruth Sivilotti; Young H. Cho; Wen-King Su; Danny Cohen; Brian K. Bray

Image processing, specifically automatic target recognition (ATR) in synthetic aperture radar (SAR) imagery, is an application area that can require tremendous processing throughput. In this application, data comes from high bandwidth sensors, where the processing is time-critical. There is limited space and power for processing the data in the sensor platforms or in battlefield groundstations. DoDs strong push for using commercial-off-the-shelf (COTS) technology, the very high non-recurring engineering (NRE) costs for low volume ASICs, and evolving algorithms limit the feasibility of using custom special purpose hardware. In addition, a scalable system is required as the different sensor platforms have different image pixel rates and different mission requirements have different target recognition throughput needs per pixel. In this paper, we describe an ATR algorithm implementation using FPGA accelerators. We first describe the ATR algorithm that was implemented, the implementation on a single FPGA, how the FPGA nodes are connected to make a scalable system, and compare the performance to current scalable microprocessor-based implementations.


Proceedings of the 1993 symposium on Research on integrated systems | 1993

A family of routing and communication chips based on the Mosaic

Charles L. Seitz; Wen-King Su


Archive | 1988

Variants of the Chandy-Misra-Bryant Distributed Discrete-Event Simulation Algorithm

Wen-King Su; Charles L. Seitz


Proceedings of the 1993 symposium on Research on integrated systems | 1993

The design of the Caltech Mosaic C multicomputer

Charles L. Seitz; Nanette J. Boden; Jakov Seizovic; Wen-King Su


Archive | 1988

The C Programmer''s Abbreviated Guide to Multicomputer Programming

Charles L. Seitz; Jakov Seizovic; Wen-King Su


Archive | 1990

Reactive-process programming and distributed discrete-event simulation

Wen-King Su


Archive | 1995

Myrinet: A Gigabit-per-Second Local

Robert E. Felderman; Alan E. Kulawik; Charles L. Seitz; M. Seizovic; Wen-King Su


Archive | 1988

Message-passing concurrent computers: their architecture and programming

Charles L. Seitz; William C. Athas; William J. Dally; Reese Faucette; Alain J. Martin; Sven Mattisson; Craig S. Steele; Wen-King Su

Collaboration


Dive into the Wen-King Su's collaboration.

Top Co-Authors

Avatar

Charles L. Seitz

California Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Jakov Seizovic

California Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Alain J. Martin

California Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Craig S. Steele

California Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Danny Cohen

Information Sciences Institute

View shared research outputs
Top Co-Authors

Avatar

Reese Faucette

California Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

William C. Athas

California Institute of Technology

View shared research outputs
Top Co-Authors

Avatar

Brian K. Bray

Sandia National Laboratories

View shared research outputs
Top Co-Authors

Avatar

Charles M. Flaig

California Institute of Technology

View shared research outputs
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge