Is there a good guide for getting the Spark operators to work?

cdabel · February 5, 2021, 11:30pm

I am an Astronomer Enterprise customer and I am just starting to look at creating a DAG that will connect to Spark on an AWS EMR cluster and process some data. The DAG should support both SparkSQL and some basic pySpark code.

Is there an Astronomer guide for getting this type of a DAG set up? What are the basic steps I need to do get this working? I need to deploy some pySpark code in a Jupyter Notebook that my data scientist has developed within Sagemaker. Any tips? or is it already supported out of the box?

Thank you,
–Chris

Topic		Replies	Views
Issue installing the Databricks Operator Astronomer Nebula	4	4423	January 15, 2019
Making use of the GCP DataflowOperators Astronomer	0	1672	January 10, 2019
How do I connect Astronomer Cloud to Databricks? Astronomer Nebula	0	2596	August 6, 2019
Can I use an operator available in 1.10.2 if I'm running an earlier Airflow version? Astronomer	1	1878	April 8, 2019
ECS Operators - docs Updated? Airflow	1	1158	October 20, 2022

Is there a good guide for getting the Spark operators to work?

Related topics