hadoop
Here are 3,334 public repositories matching this topic...
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
May 24, 2024 - Java
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
-
Updated
May 24, 2024 - Java
-
Updated
May 23, 2024 - Java
Scalable data processing pipelines in JavaScript
-
Updated
May 24, 2024 - TypeScript
Management and automation platform for Stateful Distributed Systems
-
Updated
May 23, 2024 - Java
Apache Ignite
-
Updated
May 23, 2024 - Java
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
-
Updated
May 23, 2024 - Java
Scalable, redundant, and distributed object store for Apache Hadoop
-
Updated
May 23, 2024 - Java
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
-
Updated
May 23, 2024 - Shell
Smart Automation Tool for building modern Data Lakes and Data Pipelines
-
Updated
May 23, 2024 - Scala
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
-
Updated
May 24, 2024 - Scala
-
Updated
May 23, 2024 - R
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."