Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vdk-postgres: schema inference for ingestion #3217

Open
DeltaMichael opened this issue Mar 18, 2024 · 0 comments
Open

vdk-postgres: schema inference for ingestion #3217

DeltaMichael opened this issue Mar 18, 2024 · 0 comments
Milestone

Comments

@DeltaMichael
Copy link
Contributor

Overview

The vdk-postgres plugin does not support schema inference when ingesting data.

from vdk.api.job_input import IJobInput

def run(job_input: IJobInput):
    payload = {
        "id": 1,
        "name": "Tony"
        "title": "Don't talk to strangers"
    }
    job_input.send_object_for_ingestion(
        payload=payload
    )

The above code, run as a single step in a data job against a postgres or redshift database should have the following result:

  1. Create a table with columns corresponding to the payload keys (provided the table does not already exist)
  2. If the table exists, the existing columns should be diffed with the payload keys and any new columns added
  3. Ingest the data in the newly created table

Acceptance criteria

  1. Above use case is supported
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant