In R, I have switched to using vroom
due to its speed at reading in large delimited files, but I cannot find a simple way to pre-filter large datasets as I could do with say the sqldf
package or through using SQLite and dplyr as described here
The Vroom documentation suggests using awk
to pre-filter CSVs, but I am wondering if there is an easier way to do this that ideally lets you write in the dplyr language.