Data Pipeline Solution

195 views Asked by At

We have a use-case to build data pipeline solution in which we need following things:

  1. Ability to have multiple steps (outputs from one step should feed as input to next)
  2. Ability to have multiple algorithms (SQL Query or probably invoke REST endpoint) in each step.

Input to first step can be anything. We have DW tables, but we can pre-process and keep the relevant information in AWS S3 or other data store.

Something like this: Data Pipeline

Is there an existing solution that already provides functionalities similar to this or can be modified to support this?

Having something in AWS would be easier to integrate.

1

There are 1 answers

2
Rownum Highart On

How about AWS Glue? Sounds like a fit to your goals...