I want to copy some parquet files into AWS Redshift, but the Redshift table schema has fewer columns compared to the parquet files, because those columns contain sensitive information. Therefore, I want to skip them during the copy process. How should I proceed?
How to ignore some columns when copy Parquet file into AWS Redshift?
249 views Asked by Rinze At
1
The
COPY
command does not allow to skip columns, as described in the documentation:If you can use the Glue Data Catalog, you can create an external schema, where your parquet files would be an external table. You can then
SELECT
from this external table and use only the rows you are interested in.