How to integrate JupyterHub with the existing Cloudera cluster

Hello,

We have an existing Cloudera cluster, python 3.3, pyspark is running python 2.7.14.

We are able to set up Jupyterhub on the same host with both python 2 and 3

How can we set up the environment in JupyterHub so that we can create Spark Context (python 2/spark 1.6) or Spark Session (python 3/spark 2) and access the HDFS data on the Cloudera Cluster?

Example would be greatly appreciated.

Thank you in advance.

BTW: the authentication part is very nice.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to integrate JupyterHub with the existing Cloudera cluster #2116

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to integrate JupyterHub with the existing Cloudera cluster #2116

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions