New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Spark] Add more tests checking the preservedRowTracking tag #2953
base: master
Are you sure you want to change the base?
[Spark] Add more tests checking the preservedRowTracking tag #2953
Conversation
c3d1656
to
44c6dc3
Compare
44c6dc3
to
66230a9
Compare
66230a9
to
83b4719
Compare
f697735
to
1bf0b0f
Compare
val allTags = RowTracking.addPreservedRowTrackingTagIfNotSet( | ||
protocol, metadata, tags) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Blindly tagging all commits as preserving row tracking is pretty brittle: there may be a case (now or introduced in the future) that doesn't preserve row tracking and there's virtually no chance that we detect it and correctly mark row tracking information as not preserved.
We could instead add tags to the most relevant code paths to explicitly mark that row tracking is preserved, at least in inserts (WriteIntoDelta, InsertOnlyMergeExecutor) so that most data ingestion operations are covered
We should in any case have a test somewhere that checks that the preserved row tracking tag is set to true for inserts.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Okay, let's think more about it, but for now let's keep the Preserving tests in the commands that we preserved Row Tracking (that I forgot to add in the previous Delta PRs).
...est/scala/org/apache/spark/sql/delta/columnmapping/RemoveColumnMappingRowTrackingSuite.scala
Outdated
Show resolved
Hide resolved
...est/scala/org/apache/spark/sql/delta/columnmapping/RemoveColumnMappingRowTrackingSuite.scala
Outdated
Show resolved
Hide resolved
1bf0b0f
to
a6eac93
Compare
Which Delta project/connector is this regarding?
Description
Add more tests checking the behavior of the
preservedRowTracking
tag in specific commands like Merge/Update/Delete/Optimize/Remove Column Mapping, we introduced Preserving Row Tracking for these commands in previous PRs, which include adding thepreservedRowTracking
tag.How was this patch tested?
Added UTs.
Does this PR introduce any user-facing changes?
No.