Getting Started with SANSA-Stack

This document summarizes all instructions to help first time users to get and use SANSA-Stack.


Set up SANSA

In order to get quickly started, SANSA provides project templates for the following build tools: Maven and SBT.

Maven

  • Use this Maven template to generate a SANSA project using Apache Spark.

    The subsequent steps depend on your IDE. Generally, just import this repository as a Maven project and start using SANSA / Spark.

  • Use this Maven template to generate a SANSA project using Apache Flink.

    The subsequent steps depend on your IDE. Generally, just import this repository as a Maven project and start using SANSA / Flink.

SBT

  • Use this SBT template to generate a SANSA project using Apache Spark.

    The subsequent steps depend on your IDE. Generally, just import this repository as a SBT project and start using SANSA / Spark.

  • Use this SBT template to generate a SANSA project using Apache Flink.

    The subsequent steps depend on your IDE. Generally, just import this repository as a SBT project and start using SANSA / Flink.

These templates help you to set up the project structure and to create the initial build files. Enjoy it! 🙂

IDE Setup


For developers using SANSA:

SANSA-Notebooks

Interactive Spark Notebooks can run SANSA-Examples and are easy to deploy with docker-compose. Deployment stack includes Hadoop for HDFS, Spark for running SANSA examples, Hue for navigation and copying file to HDFS. The notebooks are created and run using Apache Zeppelin.

Clone the SANSA-Notebooks git repository:

Get the SANSA Examples jar file (requires wget):

Start the cluster (this will lead to downloading BDE docker images, will take a while):

When start-up is done you will be able to access the following interfaces:

Go on and open Zeppelin, choose any available notebook and try to execute it.

For more information refer to SANSA-Notebooks Github repository. If you have questions or found bugs, feel free to open an issue on the Github.

Configuring the Computing Frameworks

   
   
   
SANSA VersionSpark VersionFlink VersionScala Version
0.8.03.0.x2.12
0.7.12.4.x2.11
0.6.02.4.x1.8.x2.11
0.5.02.4.x1.7.x2.11
0.4.02.3.x1.5.x2.11
0.3.02.2.x1.4.x2.11
0.2.02.1.x1.3.x2.11
0.1.02.0.x1.1.x2.11

Using SANSA in Maven Projects

If you want to import the full SANSA Stack for Apache Spark, please add the following Maven dependency to your project POM file:

If you want to use only a particular layer of the stack, the pattern is always “sansa-LAYER_NAME-spark_SCALA_VERSION” for the Maven artifact name, i.e. it looks in your POM file as follows:

For example, if you just want to use latest RDF layer version 0.8.0 with Scala 2.12 in your project, you have to add