Python function to generate a mask analysis
-
Updated
Jul 22, 2017 - Jupyter Notebook
Python function to generate a mask analysis
.Net library for researching and inferring links between personal data used in genealogy.
Simple Spark wrapper for validating data
Generates a match score of two person names from 0-100, where 100 is the highest, on how closely two individual full names match. The scoring is based on a series of tests, algorithms, AI, and an ever-growing body of Machine Learning-based generated knowledge
Project for the "Data and Information Quality" course at Politecnico di Milano - AY 2023/2024 - Data Issues: Duplication, Variable Types - ML Task: Classification
DsFeatFreqComp – Dataset Feature-Frequency Comparison R Package
Data Quality control framework for dataframes in R
Scripts I wrote at my job which could be helpful to others
The guidelines to help you to manage your antarctic biodiversity data
O Hub é a solução responsável por centralizar a consolidação dos dados no BigQuery, ferramenta escolhida para servir de data warehouse do raft-suite.
This is a tool developed in Python to assist with the data governance process, particularly during the migration project Mainframe>MDM>PIC. The team checks the integrity of the data and evaluate business rules are being fullfiled by synchronizing the data between the MDM platform and the current item information on Mainframe. This tool's purpose…
Building Data Pipelines for a data warehouse with Airflow and AWS
⚡ Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.
Data quality monitoring library designed for time series data, made for modern data stack
📄 Assess information and data quality in various formats.
🚚 Agile Data Science Workflows made easy with Pyspark
DsProfiling – Dataset Profiling
Implementation of data typology for imbalanced datasets.
Add a description, image, and links to the data-quality topic page so that developers can more easily learn about it.
To associate your repository with the data-quality topic, visit your repo's landing page and select "manage topics."