Xiao Limin
Chinese Academy of Sciences
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Xiao Limin.
Journal of Computer Science and Technology | 1999
Xiao Limin; Zhu Mingfa
On Dawning-1000, the two-dimension mesh interconnection network enables low-latency, high-bandwidth communication, however, these capabilities have not been realized because of the high processing overhead imposed by existing communication software. Active Messages provide an efficient communication mechanism with small overhead, which may expose the raw capabilities of the underlying hardware. In addition, one of the most promising techniques, user-level communication, is often used to improve the performance of the traditional protocols such as TCP and UDP, and is also adopted in implementing the novel abstractions like Active Messages. Thus a user-level Active Messages model is designed and implemented on Dawning-1000. Preliminary experiments show that the combination of Active Messages mechanism and user-level communication technique is quite efficient in reducing software overhead associated with sending and receiving messages, and in exploiting the capabilities of the interconnection network.
multimedia signal processing | 2011
Tao Yuan; Zhu Mingfa; Xiao Limin; Ruan Li; Dongyi Guan; Siming Chen; Ding Yi
The single precision in the computer is composed of two parts: the mantissa and the exponent. which are expressed by the limited binary bits. During adding on the single precision, the smaller one should shift to line up the decimal points, If the mantissa of the smaller one exceeds the range of registers, truncating or rounding off will be executed and cause losing precision. As far as the serials of GTX200 is concerned, the length of the register storing intermediate results is the same as the one for the final results. The accuracy problem is very prominent while the length of the mantissa exceeds the range of register after lining up the decimal points during the single precision adding. In this paper, we use the partial sum algorithm to improve the accuracy of single precision adding, and verify the correctness of the algorithm from the perspective of experiment by means of the matrix multiplication. Finally, we analyze the effect of partial sum algorithm on compute peak of the GPU and come to the conclusion that the partial sum algorithm has little influence on the compute peak of the GPU.
Archive | 2013
Xiao Limin; Mao Hong; Zhu Mingfa; Ruan Li; Hu Shengqiu
Archive | 2013
Zhu Mingfa; Wang Haiyan; Zhang Zhenzhong; Xiao Limin; Ruan Li
Archive | 2013
Zhu Mingfa; Zhang Wei; Xu Wei; Liu Jiajun; Xiao Limin; Ruan Li
Archive | 2013
Zhu Mingfa; Hu Shengqiu; Xiao Limin; Ruan Li; Mao Hong
Archive | 2014
Ruan Li; Xiao Limin; Xu Wei
Archive | 2015
Ruan Li; Xiao Limin; Zhu Mingfa
Archive | 2014
Ruan Li; Xiao Limin; Zhu Mingfa
Archive | 2013
Xiao Limin; Cheng Xianchu; Zhang Zhenzhong; Lin Bo; Qin Jingchao; Liu Yuhang