Native language Scala now enabled through transformations

2nd August 2016
Source: Dataiku
Posted By : Anna Flockett

Maker of the all-in-one predictive analytics software platform Dataiku Data Science Studio (DSS), have announced the release of Dataiku DSS 3.1. Dataiku have designed it to add additional external integrations, an improved UX interface, five visual machine learning engines, and now have the ability to enable transformations in Apache Spark’s native language, Scala.

The blending of visual code-free and free-form code-based transformations is one of the main strengths of Dataiku DSS for the prototyping and production of data applications. In addition to Python, R, SQL, Hive, Impala, and Pig, Dataiku DSS 3.1 now enables Apache Spark users to write transformations and interactive notebooks in Scala, bringing the power of Spark's native and most performant language to the data teams using Dataiku DSS.

Compared with Python, Scala is considered the ‘engineering language’ for developing data science applications. One of the main advantages of using Scala in an integrated data science production environment is its agility, which allows for easy testing and refactoring of code as a data application is being built. Dataiku DSS users who use Spark now have the ability to write transformations and interactive notebooks in Scala when developing data solutions.

Dataiku DSS 3.1 also introduces new visual machine learning engines that allow users to create incredibly powerful predictive applications within a code-free interface. Users of all skill levels can now leverage HPE Vertica machine learning, H2O Sparkling Water, MLlib, Scikit-Learn, and XGBoost directly from within the visual analysis section of Dataiku DSS 3.1 to apply powerful machine learning algorithms to their data science projects without having to write a single line of code.

“With Dataiku DSS 3.1, we continue to bridge the gap between day to day analytic needs and the latest cutting edge data science technologies,” said Florian Douetteau, CEO and co-founder of Dataiku.

By adding additional machine learning engines and enabling development in Scala, we are bringing even more tools to the table. This allows our users to build the best and most dynamic data science applications quickly, Douetteau added. “All of the new features in this release add to our goal of being a complete, end-to-end platform for the creation, development, and deployment of predictive analytics solutions for any organisation.”

Additional features of DSS 3.1 include:

  • New external databases - Integration with IBM Netezza, Hana, and Big Query.
  • New DSS project home page - Project dependencies are now visible in the user interface. Projects also now have a status (Active, Sandbox, and Archive) and can be tagged and filtered in various ways.
  • Fluid Navigation - A new, fluid way to navigate between items.
  • Better Integration with Tableau - Users can extend Dataiku DSS compatibility by creating custom export formats for datasets, including Tableau .tde files. This allows for better integration with Tableau and other tools.

Dataiku DSS 3.1 enables data teams of all skill levels to develop powerful data analytics solutions using the latest techniques in data science and technologies in Machine Learning.

You must be logged in to comment

Write a comment

No comments

Sign up to view our publications

Sign up

Sign up to view our downloads

Sign up

Girls in Tech | Catalyst | 2019
4th September 2019
United Kingdom The Brewery, London
DSEI 2019
10th September 2019
United Kingdom EXCEL, London
EMO Hannover 2019
16th September 2019
Germany Hannover
Women in Tech Festival 2019
17th September 2019
United Kingdom The Brewery, London
European Microwave Week 2019
29th September 2019
France Porte De Versailles Paris