Skip to content

Correctly handle blocks with "block byte size" fields in the Avro reader#22387

Open
vuule wants to merge 3 commits intorapidsai:mainfrom
vuule:bug-read_avro-support-block-byte-size
Open

Correctly handle blocks with "block byte size" fields in the Avro reader#22387
vuule wants to merge 3 commits intorapidsai:mainfrom
vuule:bug-read_avro-support-block-byte-size

Conversation

@vuule
Copy link
Copy Markdown
Contributor

@vuule vuule commented May 5, 2026

Description

When the number of elements in the Avro block is stored as a negative number, the block also includes its size in bytes. This PR allows the reader to correctly parse such files.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented May 5, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@vuule vuule added bug Something isn't working non-breaking Non-breaking change labels May 5, 2026
@github-actions github-actions Bot added libcudf Affects libcudf (C++/CUDA) code. Python Affects Python cuDF API. labels May 5, 2026
@GPUtester GPUtester moved this to In Progress in cuDF Python May 5, 2026
@vuule
Copy link
Copy Markdown
Contributor Author

vuule commented May 5, 2026

/ok to test e8ad97b

@vuule vuule changed the title Correctly handle blocks with the "block byte size" fields in the Avro reader Correctly handle blocks with "block byte size" fields in the Avro reader May 5, 2026
@vuule vuule marked this pull request as ready for review May 5, 2026 23:15
@vuule vuule requested review from a team as code owners May 5, 2026 23:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change Python Affects Python cuDF API.

Projects

Status: In Progress

Development

Successfully merging this pull request may close these issues.

2 participants