-
Notifications
You must be signed in to change notification settings - Fork 871
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added 'data_dict' attribute (DataDictDataset) to AbstractVersionedDat… #3737
base: main
Are you sure you want to change the base?
Conversation
Hi @noamgoldberg, thanks for your PR ! Could you explain the rationale behind this? What problem does it solve? |
I use kedro a lot for personal projects, and it's helpful to have a data dictionary attached to large datasets. For example, I like to create data_dict.yml with the feature descriptions, ranges, and general source information, to be referenced in Jupyter notebooks and used dynamically in the code (i.e. visualizations, reports). The
This would create a dataset stocks_data with an attached data dictionary. |
@noamgoldberg so this data_dict basically contains metadata about the dataset? |
@merelcht yes :) I mainly use it for feature definitions and basic dataset information (i.e. author, source, location/date created) |
cd32932
to
f41a822
Compare
…aset Signed-off-by: Noam Goldberg <noamgoldberg2@gmail.com>
Hi @noamgoldberg, sorry it took us so long to get back to you. IIUC, the https://docs.kedro.org/projects/kedro-viz/en/latest/kedro-viz_visualisation.html#visualise-layers Please confirm if that would suit your needs. Arguably we could do a better job at documenting it, most likely here: https://docs.kedro.org/en/stable/data/data_catalog.html |
…aset
Description
Development notes
Developer Certificate of Origin
We need all contributions to comply with the Developer Certificate of Origin (DCO). All commits must be signed off by including a
Signed-off-by
line in the commit message. See our wiki for guidance.If your PR is blocked due to unsigned commits, then you must follow the instructions under "Rebase the branch" on the GitHub Checks page for your PR. This will retroactively add the sign-off to all unsigned commits and allow the DCO check to pass.
Checklist
RELEASE.md
file