data-processing

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

python machine-learning deep-learning neural-network mxnet gpu image-processing pytorch gpu-tensorflow data-processing data-augmentation audio-processing paddle image-augmentation fast-data-pipeline

Updated Jun 6, 2024
C++

MDSplus / mdsplus

Star

The MDSplus data management system

data-visualization fusion data-management data-processing

Updated Jun 6, 2024
Java

legend-exp / legend-dataflow

Star

LEGEND data flow management

legend snakemake data-processing

Updated Jun 5, 2024
Python

SebKrantz / collapse

Sponsor

Star

Advanced and Fast Data Transformation in R

data-science cran r statistics time-series high-performance data-transformation scientific-computing econometrics rstats data-analysis data-manipulation data-processing weights panel-data weighted data-aggregation

Updated Jun 5, 2024
C

aces / cbrain

Star

CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.

ruby science hpc rails-application data-processing cbrain cbrain-architecture cbrain-api cbrain-platform cbrain-service

Updated Jun 5, 2024
Ruby

nikelui / sfdi

Star

A collection of Python scripts to acquire SFDI data and process it, in order to measure optical properties of tissue.

data-processing imaging optical-measurements sfdi

Updated Jun 5, 2024
Python

kfultz07 / go-dataframe

Star

A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.

go golang data-science pandas data-analytics data-analysis data-processing dataframe

Updated Jun 5, 2024
Go

helmholtz-analytics / heat

Star

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

python data-science machine-learning hpc gpu numpy mpi pytorch distributed parallelism data-analytics tensors data-processing multi-gpu mpi4py massive-datasets multi-node-cluster array-api

Updated Jun 6, 2024
Python

hstreamdb / hstream

Star

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

iot distributed-systems haskell streaming real-time sql database kafka scale stream-processing distributed-database realtime-database data-processing financial-analysis streaming-data materialized-view iot-database hstreamdb streaming-database

Updated Jun 5, 2024
Haskell

VladYashin / RAG

Star

A public repository for all things RAG (Retrieval Augmented Generation)

open-source data-processing rag retrieval-augmented-generation rag-implementation rag-agents

Updated Jun 5, 2024
Jupyter Notebook

Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.

bigquery snowflake data-catalog data-processing databricks data-governance policy-enforcement data-contracts

Updated Jun 6, 2024
Kotlin

numaproj / numaflow

Star

Kubernetes-native platform to run massively parallel data/streaming jobs

kubernetes pipeline stream-processing map-reduce k8s data-processing hacktoberfest

Updated Jun 6, 2024
Go

Improve this page

Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-processing

Here are 1,076 public repositories matching this topic...

CityofToronto / bdit_data-sources

johnkerl / miller

pathwaycom / pathway

CNIC-Proteomics / TurboPutative-web

mezantrop / tSQLike

zazuko / barnard59

crate-workbench / cratedb-toolkit

remotesensinginfo / rsgislib

NVIDIA / DALI

MDSplus / mdsplus

legend-exp / legend-dataflow

SebKrantz / collapse

aces / cbrain

nikelui / sfdi

kfultz07 / go-dataframe

helmholtz-analytics / heat

hstreamdb / hstream

VladYashin / RAG

getstrm / pace

numaproj / numaflow

Improve this page

Add this topic to your repo