Creation of the first Scikit-learn consortium

28th September 2018
Source: Dataiku
Posted By : Alex Lynn
Creation of the first Scikit-learn consortium

Launched in 2007 by members of the Python scientific community, the Scikit-learn project has been accelerated as part of Inria's research into functional imaging of the brain. Now, ten years later, Inria has announced the creation of a consortium of corporate sponsors to accelerate development.

The consortium includes enterprise AI development platform maker, Dataiku, alongside Microsoft, NVidia, Intel, AXA, Boston Consulting Group, and BNP Paribas Cardiff.

Mostly unknown to the general public, Scikit-learn is one of the flagship libraries in the field of advanced machine learning. The consortium hopes that, beyond financial support, the initial group of global innovators will help to promote the project to broader audience and create greater visibility among institutions.

Florian Douetteau, CEO of Dataiku, stated: "Today, more than 500,000 data scientists use Scikit-learn daily around the world. It's easy to imagine that the combined salary and the value created by these users of Scikit-learn is over $100bn a year. The benefits of this project are extraordinary. By becoming a sponsor of this consortium, we are making a fantastic investment for the future of data science in the world.

"In addition, at Dataiku, we have been integrating Scikit-Learn into our offering since 2013. Dataiku provides us with a clickable version of Scikit-learn to enable everyone to use it, from the business analyst to the most advanced data scientist. This corresponds to a trends of technology innovation, where big scientific advances are often accelerated in communities by open source, then made known to everyone else through the work of software companies."

A software library developed in Python, Scikit-learn is dedicated to machine learning. It’s simple and powerful predictive models make it possible to extract power insights from data using many different models, from the efficient linear model on texts to random forests, and well adapted to heterogeneous databases. Scikit-Learn's competitors are Tensorflow, supported by Google, and SparkML, supported by the DataBricks, which are also integrated with Dataiku. Compared to TensorFlow which focuses on Deep Learning, Scikit-Learn provides an unparallelled diversity of algorithms.

Today, Scikit-Learn is used by the biggest players in the technology: AirBnb for the detection of fraud, Uber for the prediction of the demand, or by Spotify for the recommendation of music.


You must be logged in to comment

Write a comment

No comments




Sign up to view our publications

Sign up

Sign up to view our downloads

Sign up

Girls in Tech | Catalyst | 2019
4th September 2019
United Kingdom The Brewery, London
DSEI 2019
10th September 2019
United Kingdom EXCEL, London
EMO Hannover 2019
16th September 2019
Germany Hannover
Women in Tech Festival 2019
17th September 2019
United Kingdom The Brewery, London
European Microwave Week 2019
29th September 2019
France Porte De Versailles Paris