#

nccl

Here are 29 public repositories matching this topic...

MurrellGroup / Conflux.jl

Single-node data parallelism in Julia with CUDA

flux julia cuda data-parallelism nccl

Updated May 6, 2024
Julia

melanie-t27 / 2024-EUMaster4HPC-Student-Challenge

EUMaster4HPC student challenge group 7 - EuroHPC Summit 2024 Antwerp

optimization scalability openmp mpi cuda efficiency conjugate-gradient cg parallel-programming nccl meluxina

Updated Apr 14, 2024
Cuda

sub-mod / nccl-builds

nccl built on centos6

Updated Mar 11, 2020

GPU-Blood-Cell-Simulation / Simulation-Server

Blood Cell Simulation server

opengl simulation hpc metaprogramming parallel-computing cuda nccl

Updated Jan 29, 2024
C++

SquareFactory / ml-default

Default Docker image used to run experiments on csquare.run.

python machine-learning deep-learning mxnet tensorflow pytorch pyspark train cudnn nccl horovod torchvision isquare csquare isquare-train

Updated Mar 6, 2023
Dockerfile

dereklstinson / nccl

golang wrapper for nccl

go deep-learning parallel-computing cuda nccl

Updated Nov 18, 2019
Go

lcskrishna / nccl-rccl-parser

Tool to run rccl-tests/nccl-tests based on from an application and gather performance.

Updated Nov 21, 2020
Python

lancelee82 / necklace

Distributed deep learning framework based on pytorch/numba/nccl and zeromq.

deep-learning mxnet pytorch distributed numba zerorpc distributed-deep-learning distributed-training nccl

Updated Aug 10, 2023
Python

rohwid / auto-nvidia-cuda-driver

Installation script to install Nvidia driver and CUDA automatically in Ubuntu

cuda ubuntu1604 cudnn bash-scripting nvidia-driver nccl ubuntu1804

Updated Apr 24, 2022
Shell

muriloboratto / hands-on-supercomputing-with-parallel-computing

Hands-on Labs in Parallel Computing

openmp mpi cuda nccl cuda-aware-mpi

Updated Aug 11, 2023
Jupyter Notebook

rodhuega / tfgMatrixNccl

Librería de operaciones matemáticas con matrices multi-gpu utilizando Nvidia NCCL.

gpu cuda cpp11 nvidia gpu-computing nccl gpu-programming

Updated Sep 9, 2020
Cuda

radix-ai / scipy-notebook-gpu

jupyter/scipy-notebook with CUDA Toolkit, cuDNN, NCCL, and TensorRT

docker tensorflow cuda cudnn tensorrt nccl scipy-notebook

Updated Jul 15, 2019
Dockerfile

lancelee82 / pynccl

Nvidia NCCL2 Python bindings using ctypes and numba.

python ctypes numba nccl

Updated Jun 28, 2021
Python

asprenger / distributed-training-patterns

Experiments with low level communication patterns that are useful for distributed training.

tensorflow mpi mpi4py distributed-training nccl horovod

Updated Nov 14, 2018
Python

UCBerkeley-Spring2022-CS267-project / blinkplus

Blink+: Increase GPU group bandwidth by utilizing across tenant NVLink.

gpu nccl collective-communication

Updated Jun 22, 2022
Jupyter Notebook

YinLiu-91 / ncclOperationPlus

use ncclSend ncclRecv realize ncclSendrecv ncclGather ncclScatter ncclAlltoall

cplusplus cpp mpi cuda nccl ncclsendrecv ncclgather ncclscatter ncclalltoall

Updated Mar 1, 2022
Cuda

openhackathons-org / nways_multi_gpu

N-Ways to Multi-GPU Programming

hpc mpi cuda nccl nvshmem nsight-systems

Updated Apr 4, 2023
C

1duo / nccl-examples

NCCL Examples from Official NVIDIA NCCL Developer Guide.

distributed-systems deep-learning nvidia nccl

Updated May 29, 2018
CMake

BaguaSys / bagua-net

High performance NCCL plugin for Bagua.

distributed-computing nccl bagua

Updated Sep 15, 2021
Rust

muriloboratto / NCCL

Sample examples of how to call collective operation functions on multi-GPU environments. A simple example of using broadcast, reduce, allGather, reduceScatter and sendRecv operations.

mpi cuda nccl laplacian-matrix

Updated Aug 28, 2023

Improve this page

Add a description, image, and links to the nccl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nccl topic, visit your repo's landing page and select "manage topics."