This can be used in pipelines that pass data back and forth between Arrow and other processes (like DuckDB).

to_arrow(.data, as_arrow_query = TRUE)

Arguments

.data

the object to be converted

as_arrow_query

should the returned object be wrapped as an arrow_dplyr_query? (logical, default: TRUE)

Value

a RecordBatchReader object, wrapped as an arrow dplyr query which can be used in dplyr pipelines.

Examples

library(dplyr)

ds <- InMemoryDataset$create(mtcars)

ds %>%
  filter(mpg < 30) %>%
  to_duckdb() %>%
  group_by(cyl) %>%
  summarize(mean_mpg = mean(mpg, na.rm = TRUE)) %>%
  to_arrow() %>%
  collect()
#> # A tibble: 3 × 2
#>     cyl mean_mpg
#>   <dbl>    <dbl>
#> 1     6     19.7
#> 2     4     23.7
#> 3     8     15.1