Summary of Don't Use Apache Airflow

The video discusses the limitations of Apache Airflow and recommends alternatives such as Nifi and Pentaho Kettle.

  • 00:00:00 In this video, Brian Kafke explains the basics of Apache Airflow, a platform designed to make data engineering easier. Kafke points out that airflow is not actually related to data engineering, but is instead a job scheduler that makes it easier to manage dependencies and schedules tasks. Kafke also notes that Apache airflow can be used for various purposes, such as machine learning or data movement.
  • 00:05:00 Apache airflow is a Python-based framework for managing jobs. The framework has many limitations, including no etl functionality and a steep learning curve. Apache nifi, an open source alternative, is also worth consideration.
  • 00:10:00 Apache Airflow is a job scheduler with a lot of features, but it can be confusing to use. One problem is that it uses different terminology than many other programming languages. For example, tasks in Apache Airflow are called "dags".
  • 00:15:00 The presenter discusses limitations of Apache Airflow, and recommends alternatives such as Nifi and Pentaho Kettle.

