Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where John R. Nickolls is active.

Publication


Featured researches published by John R. Nickolls.


international conference on computer graphics and interactive techniques | 2008

Scalable Parallel Programming with CUDA

John R. Nickolls; Ian Buck; Michael Garland; Kevin Skadron

Presents a collection of slides covering the following topics: CUDA parallel programming model; CUDA toolkit and libraries; performance optimization; and application development.


international symposium on microarchitecture | 2008

NVIDIA Tesla: A Unified Graphics and Computing Architecture

Erik Lindholm; John R. Nickolls; Stuart F. Oberman; John S. Montrym

To enable flexible, programmable graphics and high-performance computing, NVIDIA has developed the Tesla scalable unified graphics and parallel computing architecture. Its scalable parallel array of processors is massively multithreaded and programmable in C or via graphics APIs.


international symposium on microarchitecture | 2010

The GPU Computing Era

John R. Nickolls; William J. Dally

GPU computing is at a tipping point, becoming more widely used in demanding consumer applications and high-performance computing. This article describes the rapid evolution of GPU architectures-from graphics processors to massively parallel many-core multiprocessors, recent developments in GPU computing architectures, and how the enthusiastic adoption of CPU+GPU coprocessing is accelerating parallel applications.


international symposium on microarchitecture | 2008

Parallel Computing Experiences with CUDA

Michael Garland; S. Le Grand; John R. Nickolls; J. Anderson; J. Hardwick; S. Morton; E. Phillips; Yao Zhang; V. Volkov

The CUDA programming model provides a straightforward means of describing inherently parallel computations, and NVIDIAs Tesla GPU architecture delivers high computational throughput on massively parallel problems. This article surveys experiences gained in applying CUDA to a diverse set of problems and the parallel speedups over sequential codes running on traditional CPU architectures attained by executing key computations on the GPU.


Archive | 2007

Virtual architecture and instruction set for parallel thread computing

John R. Nickolls; Henry Packard Moreton; Lars Nyland; Ian Buck; Richard Craig Johnson; Robert Steven Glanville; Jayant B. Kolhe


Archive | 2006

Atomic memory operators in a parallel processor

Ian Buck; John R. Nickolls; Michael C. Shebanow; Lars Nyland


Archive | 2006

Register based queuing for texture requests

John Erik Lindholm; John R. Nickolls; Simon Moy; Brett W. Coon


Archive | 2005

Synchronization of threads in a cooperative thread array

John R. Nickolls; Stephen D. Lew; Brett W. Coon; Peter C. Mills


Archive | 2005

Parallel data processing systems and methods using cooperative thread arrays and thread identifier values to determine processing behavior

John R. Nickolls; Stephen D. Lew


Archive | 2005

Parallel data processing systems and methods using cooperative thread arrays and SIMD instruction issue

Bryon S. Nordquist; John R. Nickolls; Luis I. Bacayo

Collaboration


Dive into the John R. Nickolls's collaboration.

Researchain Logo
Decentralizing Knowledge