Data Engineer’s Lunch #41: PygramETL
In Data Engineer’s Lunch #41: Pygrametl, we discussed PygramETL, a python ETL tool. This is the end for now of […]
Data Engineer’s Lunch #41: PygramETL Read More »
In Data Engineer’s Lunch #41: Pygrametl, we discussed PygramETL, a python ETL tool. This is the end for now of […]
Data Engineer’s Lunch #41: PygramETL Read More »
In Apache Cassandra Lunch #59: Functions in Cassandra, we discussed the functions that can be used inside of the Cassandra
Apache Cassandra Lunch #59: Functions in Cassandra Read More »
In Data Engineer’s Lunch #28: Petl for Data Engineering, we discussed Petl as part of our ongoing series on python
Data Engineer’s Lunch #28: Petl for Data Engineering Read More »
In this blog post, we will discuss a number of ways of doing dependency management when running spark scripts. This
Spark Script Dependency Management Read More »
In Apache Cassandra Lunch #52: Airflow and Cassandra for Cluster Management, we discussed using Airflow to schedule tasks on a
Apache Cassandra Lunch #52: Airflow and Cassandra for Cluster Management Read More »
In Data Engineer’s Lunch #24: Pandas for Data Engineering, we discussed using Pandas for performing Data Engineering tasks in Python. This
Data Engineer’s Lunch #24: Pandas for Data Engineering Read More »