Overview

Data Scientist Workbench aims to be your one-stop shop for data science tools. At this time, it is a technology preview limited to IPython/Jupyter notebooks enabled with Spark integration. You will use notebooks to develop and run Spark programs, as well visualize, document and present your analysis. Currently, Spark integration is only supported by the Scala runtime.

Requesting your own Data Scientist Workbench

Go to Data Scientist Workbench and click the big blue button. After completing the registration, you will receive an e-mail with instructions.

Hello World

To get a feel of how the Data Scientist Notebooks work with Scala and Spark, follow the steps below to load a "Hello World" notebook along with its sample data.

  1. Go to Data Scientist Workbench and click the My Notebooks menu.
  2. Use the search bar to import the following file and notebook.

    • Notebook: https://share.knowledgeanyhow.org/#/api/v1/workbench/10.114.214.68/shares/sHbSUP0luy1xE2q/Scala%20and%20Spark%20in%2015%20minutes.ipynb

    • Data: https://share.knowledgeanyhow.org/#/api/v1/workbench/10.114.214.68/shares/t9LwxWg0EkWKMYX/2015.csv

  3. Follow the notebook!

For more details on sharing data and notebooks, check this guide.