Commits


Romain Francois authored and Neal Richardson committed cf9df1498a2
ARROW-6532 [R] write_parquet() uses writer properties (general and arrow specific) This adds parameters to `write_parquet()` to control compression, whether to use dictionary, etc ... on top of the C++ classes `parquet::WriterProperties` and `parquet::ArrowWriterProperties` e.g. ```r write_parquet(tab, file, compression = "gzip", compression_level = 7) ``` Closes #5451 from romainfrancois/ARROW-6532/write_parquet_compression and squashes the following commits: 413dd411d <Romain Francois> test make_valid_version() 50555f8cf <Romain Francois> rename arguments to `x` and `sink` 9aff79b9a <Romain Francois> implement ==.Object that calls $Equals instead of implementing for each class. ecd92184a <Romain Francois> rework documentation for write_parquet() 56dac333e <Romain Francois> Move read_parquet() and write_parquet() to top of the file 45ec63ba6 <Romain François> Update r/R/parquet.R 66c51fd2a <Romain Francois> added all.equal.Object() that uses == c5549decf <Romain Francois> Test ==.Table 5ade52dd9 <Romain Francois> wrong length for use_dictionary and write_statistics 00cc214ca <Romain Francois> abstract various ParquetWriterPropertiesBuilder$set_*() methods 1fdcc0b9d <Romain Francois> suggestsions from @nealrichardson 9bee8de47 <Romain Francois> define and use internal make_valid_version() function 004cf905b <Romain Francois> M%ake compression_from_name() vectorized 86d9ff4bd <Romain Francois> Remove the _ from builder classes 6c4f00345 <Romain Francois> add test helper so that we actually can test parquet roundtrip d318a66a8 <Romain Francois> ==.Table 7f1c18437 <Romain Francois> align arguments following tidyverse style guide 72caaabd7 <Romain Francois> using assert_that() 738ea6e50 <Romain Francois> Remove $default() methods and use $create() wityh default arguments instead. 11662643b <Romain Francois> using make_valid_time_unit() 4055f67c4 <Romain Francois> More flexible arguments use_dictionary= and write_statistics= 2f2ae00a8 <Romain Francois> More flexible compression= and compression_level= 1e3b5b6bd <Romain Francois> document() 2dd2cb96f <Romain Francois> + compression_level= in write_parquet() b8337e119 <Romain Francois> lint fa8990b63 <Romain Francois> Expose options from ParquetWriterProperties and ParquetArrowWriterProperties to write_parquet() 09ea0ad29 <Romain Francois> + ParquetWriterProperties$create() and associated ParquetWriterProperties_Builder class skeleton 1b84ad493 <Romain Francois> Exposing classes parquet::arrow::ArrowWriterProperties and parquet::arrow::WriterProperties to R side 0e09ac8f3 <Romain Francois> lint aa340957a <Romain Francois> passing down the right stream 9ed32b606 <Romain Francois> Make write_parquet() generic, internal impl using streams rather than file path for more flexibility Lead-authored-by: Romain Francois <romain@rstudio.com> Co-authored-by: Romain François <romain@purrple.cat> Signed-off-by: Neal Richardson <neal.p.richardson@gmail.com>