Search
Close this search box.

Share your passion for data!

Build your confidence, strenghten your network business connections and exchange best practice knowledge in the community by being a public speaker in your field!

 

We are looking for guest speakers at #DataEngineersLunch and #CassandraLunch!

 

If you are experienced in #ETL, #datawrangling, etc, and are interested in joining our events, please fill out this form and we will reach out to you ASAP!

Data Engineer’s Lunch

Speak at Data Engineer’s Lunch happening every Monday!

Apache Cassandra Lunch

Speak at Apache Cassandra Lunch happening every Thursday!

Become a guest speaker at Data Engineer’s Lunch!

Become a
guest speaker at
Data Engineer’s Lunch!

If you want to submit to speak at Cassandra Lunch, please check the following requirements:
Step 1: Please check the following requirements:
The session must be in English.
Session length will be 60 minutes.
Sessions will be online (via Zoom).
Step 2: Please fill out the form below:

Become a guest speaker at Cassandra Lunch!

Become a
guest speaker at
Cassandra Lunch!!

If you want to submit to speak at Cassandra Lunch, please check the following requirements:
Step 1: Please check the following requirements:
The session must be in English.
Session length will be 60 minutes.
Sessions will be online (via Zoom).
Step 2: Please fill out the form below:
Loading Events

« All Events

  • This event has passed.

Automating Data Operations for Apache Cassandra with Apache Airflow

December 7, 2022 @ 11:00 am - 12:00 pm

We’ll go over automating Data Operations/Spark Processes with Cassandra with Airflow and provide a hands-on demonstration on Gitpod.

Most Cassandra administrators have to import / export data as part of their Database Administrator role. Being Cassandra Admin, this means at least knowing Spark, DSBulk, etc. Wouldn’t it be cool to automate these processes and allow a self-service option? This talk will go over automating Data Operations / Spark Processes with Cassandra with Airflow and provide a hands-on demonstration on Gitpod with Astra so everyone can try it out.

Take Aways:

  • Learn how Apache Airflow, Apache Spark, and Apache Cassandra can be used together for DataOps
  • Learn how Airflow can wrap complex Import/Export/ETL Spark jobs in a GUI for users
  • Learn how to delete data in Cassandra with Apache Spark
  • Hands-on: Create Tables/Keyspaces in Cassandra/Astra
  • Hands-on: Extract / Load data from a CSV file into Cassandra table
  • Hands-on: Transform data from Cassandra table into another Cassandra table

Ref:

  • https://github.com/Anant/example-cassandra-etl-with-airflow-and-spark
  • https://github.com/Anant/example-cassandra-presto-airflow

Organizer

Anant