Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature-request] Prune Invalid Docs During Segment Commit for Upsert Tables #14588

Open
ankitsultana opened this issue Dec 3, 2024 · 0 comments

Comments

@ankitsultana
Copy link
Contributor

ankitsultana commented Dec 3, 2024

While Upsert Compaction is great, for tables which have very high ingestion throughput, it'd be ideal to prune invalid docs during segment commit itself, since compaction is costly and in many cases not able to catch up. In one of our use-cases, I think this feature would lead to a further reduction in table size of 2-3x.

This should be relatively simple to do for Full Upsert tables but I think would be harder for Partial Upsert tables.

cc: @tibrewalpratik17

@ankitsultana ankitsultana changed the title [feature-request] Prune Invalid Docs During Segment Commit for Full Upsert Tables [feature-request] Prune Invalid Docs During Segment Commit for Upsert Tables Dec 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants