Commits


Will Jones authored and Joris Van den Bossche committed e05f032c1e5
ARROW-7800 [Python] implement iter_batches() method for ParquetFile and ParquetReader Implements an `iter_batches()` method for `ParquetFile` and `ParquetReader`. This is an attempt to complete the work started in #6386 by @rollokb. I wasn't sure how to set the `batch_size` parameter in `ArrowReaderProperties` after the init, so I ended up expanding the C++ API so that `GetRecordBatchReader` took a `batch_size` parameter. I know nearly nothing about C++ and Cython, so if there's a better way I'd love to learn. But I saw [Wes' comment in the PR creating those methods suggesting you might want this](https://github.com/apache/arrow/pull/4304#pullrequestreview-253163586), so I thought there's a chance I'm not overstepping too much. Closes #6979 from ghost/ARROW-7800/expose-get-record-batch-reader Lead-authored-by: Will Jones <willjones127@gmail.com> Co-authored-by: Will Jones <will.jones@mscience.com> Co-authored-by: Will Jones <wjones127@users.noreply.github.com> Signed-off-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>