Post-function modification (opposite/equivalent to `pipe`) #701

elijahbenizzy · 2024-02-15T18:57:27Z

Is your feature request related to a problem? Please describe.
People often want data quality checks to modify the output, but that's really not what they're supposed to do.
@pipe applies a function, but it is run beforehand. So if you just want to change the output of a node, you have to do it in two steps.

Describe the solution you'd like
New decorator that functions like pipe, but runs afterwards:

@mutate(
    step(_keep_range, range=(0,100)),
    step(_dropna)
)   
def data() -> pd.Series:
    return ...

This would form the DAG: data.raw -> data.with_drop_between -> data

Describe alternatives you've considered
Making it more central to data quality or just integating with the node.

Additional context
Related to an OS question.

The text was updated successfully, but these errors were encountered:

zilto · 2024-02-16T18:51:54Z

I think there's a place for this along with @pipe. Also the data.raw -> data fits the @check_output semantic.

Although, I propose to rename it @pipe_out @pipe_output or @pipe_post. Maybe we'd rename @pipe to @pipe_in (input, or pre) too?

Although these decorators are not much different than writing additional nodes, I think they can greatly facilitate migration by allowing to plug in an existing codebase into new/smaller Hamilton initiatives

elijahbenizzy · 2024-02-16T19:00:08Z

I think there's a place for this along with @pipe. Also the data.raw -> data fits the @check_output semantic.

Although, I propose to rename it @pipe_out @pipe_output or @pipe_post. Maybe we'd rename @pipe to @pipe_in (input, or pre) too?

Although these decorators are not much different than writing additional nodes, I think they can greatly facilitate migration by allowing to plug in an existing codebase into new/smaller Hamilton initiatives

Feels like @modify_input and @modify_output might be better than pipe?

zilto · 2024-02-16T19:01:34Z

I like your @pipe documentation and the fact that is echoes the pandas .pipe() operation. Apparently, Polars also has a pipe operator.

To me, "pipe" better communicates than "modify" the idea that you can stack many sequential transforms

elijahbenizzy · 2024-03-09T05:17:47Z

From @skrawcz on #749 (duplicate):

Is your feature request related to a problem? Please describe.
@pipe is nice, but can be a little counter-intuitive to read. i.e. the function parameter declares the dependency, is then modified by @pipe and then passed into the function.

Describe the solution you'd like
@post_pipe would be the opposite of @pipe. The function would declare the dependency, the body of the function would run, and then the @post_pipe steps would run after the function.

@post_pipe(# these run after the function is run.
    step(_transform_1, v=1),
    step(_transform_2, v=2),
)
def A_processed(A: pd.DataFrame) -> pd.DataFrame:
    return A # original A -- could do modifications here.

Describe alternatives you've considered
N/A

Additional context

elijahbenizzy added enhancement New feature or request decorators labels Feb 15, 2024

elijahbenizzy mentioned this issue Mar 8, 2024

Add @post_pipe #749

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Post-function modification (opposite/equivalent to `pipe`) #701

Post-function modification (opposite/equivalent to `pipe`) #701

elijahbenizzy commented Feb 15, 2024 •

edited

zilto commented Feb 16, 2024

elijahbenizzy commented Feb 16, 2024

zilto commented Feb 16, 2024 •

edited

elijahbenizzy commented Mar 9, 2024

Post-function modification (opposite/equivalent to pipe) #701

Post-function modification (opposite/equivalent to pipe) #701

Comments

elijahbenizzy commented Feb 15, 2024 • edited

zilto commented Feb 16, 2024

elijahbenizzy commented Feb 16, 2024

zilto commented Feb 16, 2024 • edited

elijahbenizzy commented Mar 9, 2024

Post-function modification (opposite/equivalent to `pipe`) #701

Post-function modification (opposite/equivalent to `pipe`) #701

elijahbenizzy commented Feb 15, 2024 •

edited

zilto commented Feb 16, 2024 •

edited