Commits


frazar authored and GitHub committed 68ba49d5018
GH-37242: [Python][Parquet] Parquet Support write and validate Page CRC (#38360) ### Rationale for this change The C++ Parquet API already supports enabling CRC checksum for read and write operations. CRC checksum are optional and can detect data corruption due to, for example, file storage issues or [cosmic rays](https://en.wikipedia.org/wiki/Soft_error). It would then be beneficial to expose this optional functionality to the Python API too. This PR is based on a previous PR which became stale: #37439 ### What changes are included in this PR? The PyArrow interface is expanded to include a `page_checksum_enabled` flag. ### Are these changes tested? [ ] NOT YET! ### Are there any user-facing changes? The change is backward compatible. An additional, optional keyword argument is added to some interfaces. Closes #37242 Supersedes #37439 * Closes: #37242 Lead-authored-by: Francesco Zardi <frazar0@hotmail.it> Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com> Co-authored-by: mwish <maplewish117@gmail.com> Co-authored-by: Alenka Frim <AlenkaF@users.noreply.github.com> Signed-off-by: AlenkaF <frim.alenka@gmail.com>