Filtering JDBC Ingestion with AWS Glue and PySpark

1k views Asked by Gerasimos At 30 September 2020 at 15:25

I am using AWS Glue to ingest from a mysql database. I know that I can use custom queries when using pyspark-JDBC to ingest data. Does the same apply for when ingesting based on a crawler? Right now I am using this:

datasource =glueContext.create_dynamic_frame.from_catalog(database="db_name",table_name="table_name")

Is there any way that I can ingest, instead of the whole table, only part of it? Like using a select * from table where column_x > value.

Original Q&A

TechQA.

Filtering JDBC Ingestion with AWS Glue and PySpark

There are 0 answers

Related Questions in AMAZON-WEB-SERVICES

Related Questions in PYSPARK

Related Questions in INGEST

Popular Questions

Popular Tags

Trending Questions