datawarehouse

Here are 393 public repositories matching this topic...

DataLinkDC / dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

sql olap flink datawarehouse datalake flinksql flinkcdc real-time-computing-platform

Updated May 23, 2024
Java

10Accademy-InsightStreamInc / Scalable_Datawarehouse_Amharic_Data_Ingestion_For_LLM_RAG

Star

The project aims to enhance NLP capabilities for Amharic Language by developing a data corpus for various NLP applications. The project involves collecting, cleaning, processing data, developing APIs, and automating the pipeline.

nlp scraper amharic datawarehouse rag llms

Updated May 23, 2024
Python

hydradatabase / hydra

Sponsor

Star

Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.

postgres postgresql data-warehouse datawarehouse postgresql-extension

Updated May 21, 2024
C

jitsucom / bulker

Star

Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)

pipeline etl data-engineering ingestion datawarehouse etl-pipeline

Updated May 21, 2024
Go

ali-bin-kashif / etl-pipeline-project-cola-next

Star

Developed a robust ETL pipeline for Next Cola Pvt. Ltd data which extracts data from many different OLTP sources, converts them into dimensions and facts and load into datawarehouse for analytical workload.

python aws airflow amazon-ec2 datawarehouse amazon-s3 dimensional-modeling

Updated May 21, 2024
Python

Rajsingh92 / MUST_HAVE_SKILLS

Star

This repo consists of all important concepts for data engineers.

python aws airflow tutorial sql big-data spark cassandra mongodb hadoop hbase nifi python-tutorial datawarehouse

Updated May 20, 2024
Java

Priyush02K / BE-CSE

Star

Computer Science and Engineering (CSE) is a multidisciplinary field that combines elements of computer science and computer engineering to design, develop, and maintain computer systems and software. It is a rapidly evolving field that plays a crucial role in shaping the modern world.

android mobile notes lab mobile-app data-structures manual internet-programming software-engineering datawarehouse software-testing cyber-security securiy tcs devsecops dlcs

Updated May 18, 2024
Java

Bernardbyy / MicrosoftNorthwindDatawarehouse

Star

A Data Warehouse project based on Microsoft Northwind Database.

etl rollup olap cube datawarehouse business-analytics

Updated May 17, 2024

Phelipe-Sempreboni / tutorials-informations-notes

Star

Repository for tutorials, information and notes on technology in general.

sql etl olap datawarehousing amazon-web-services sqlserver datawarehouse oracle-database datalake datamart pl-sql datahub rds-database oltp pl-sql-script modelagem-de-dados powerbi-desktop powerbi-service

Updated May 17, 2024
Python

Datavault-UK / automate-dv

Star

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

metadata sql etl snowflake datawarehousing dbt elt datawarehouse datalake dataengineering datavault datavault20 data-vault

Updated May 15, 2024

Ansuman21 / IA-Final-Group-5

Star

This project outlines the final project requirements for DAV6100 - Information Architectures, focusing on group assignments, scoring criteria, topic selection, core requirements, and project components such as design, development, visualization, and executive presentation.

visualization aws framework dashboard cloud-computing datawarehouse etl-pipeline glue-job dataarchitecture quicksight-dashboard awsservices informationarchitecture

Updated May 14, 2024
HTML

ErdemOzgen / Data-Engineering-Roadmap

Star

Roadmap for Data Engineering

devops data-science machine-learning development roadmap awesome cloud database deep-learning interview ci-cd awesome-list guidelines datawarehouse datapipeline dataengineering awesome-resources datapreprocessing mlops

Updated May 9, 2024
Java

data-solution-automation-engine / DIRECT

Star

DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, audit and control data integration / ETL processes.

etl datawarehouse etl-framework etl-pipeline etl-automation datawarehouseautomation

Updated May 8, 2024
TSQL

aventius-software / DataWarehouse

Star

An open source and free to use generic (basic) Microsoft SQL Server data warehouse

sql sqlserver datawarehouse

Updated May 7, 2024
TSQL

mchien15 / datascience

Star

Soccer Players Data Analyst and Similar Players Finder

datawarehouse object-storage datalake trino soccer-analytics streamlit fbref

Updated May 6, 2024
Jupyter Notebook

data-solution-automation-engine / data-warehouse-automation-metadata-schema

Star

Generic interface exchange format for Data Warehouse Automation and ETL generation.

datawarehouse metadata-management etl-automation datawarehouseautomation etlgeneration

Updated May 6, 2024
C#

data-solution-automation-engine / virtual-data-warehouse

Star

The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the 'engine' for data solution automation.

data etl virtual datawarehouse codegeneration datavault datavault20 etl-automation datawarehouseautomation virtual-data-warehouse

Updated May 10, 2024
Handlebars

xkyleann / Databases_SQL_Projects

Star

This repository contains a collection of Databases projects and code samples showcasing my skills and experience in SQL-PostgreSQL development. It serves as a portfolio to demonstrate my proficiency in various aspects of Database programming. Mostly, includes tasks about SQL, PostgreSQL and GIS.

sql neo4j nosql postgresql gis indexing datawarehouse

Updated May 5, 2024
PLpgSQL

glynnbird / couchwarehouse

Star

Data warehouse for CouchDB

nodejs mysql couchdb cli elasticsearch sql sqlite postgresql datawarehouse

Updated Apr 30, 2024
JavaScript

techsparksguru / data_ai_for_all

Star

Data Analysis, Analytics, Science, AI & ML, LLM etc.

python spark apache-spark jupyter pandas metabase datascience datawarehousing datawarehouse

Updated Apr 28, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the datawarehouse topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datawarehouse topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datawarehouse

Here are 393 public repositories matching this topic...

DataLinkDC / dinky

10Accademy-InsightStreamInc / Scalable_Datawarehouse_Amharic_Data_Ingestion_For_LLM_RAG

hydradatabase / hydra

jitsucom / bulker

ali-bin-kashif / etl-pipeline-project-cola-next

Rajsingh92 / MUST_HAVE_SKILLS

Priyush02K / BE-CSE

Bernardbyy / MicrosoftNorthwindDatawarehouse

Phelipe-Sempreboni / tutorials-informations-notes

Datavault-UK / automate-dv

Ansuman21 / IA-Final-Group-5

ErdemOzgen / Data-Engineering-Roadmap

data-solution-automation-engine / DIRECT

aventius-software / DataWarehouse

mchien15 / datascience

data-solution-automation-engine / data-warehouse-automation-metadata-schema

data-solution-automation-engine / virtual-data-warehouse

xkyleann / Databases_SQL_Projects

glynnbird / couchwarehouse

techsparksguru / data_ai_for_all

Improve this page

Add this topic to your repo