Launched in 2007 by members of the Python scientific community, the Scikit-learn project has been accelerated as part of Inria's research into functional imaging of the brain. Now, ten years later, Inria has announced the creation of a consortium of corporate sponsors to accelerate development.
The consortium includes enterprise AI development platform maker, Dataiku, alongside Microsoft, NVidia, Intel, AXA, Boston Consulting Group, and BNP Paribas Cardiff.
Mostly unknown to the general public, Scikit-learn is one of the flagship libraries in the field of advanced machine learning. The consortium hopes that, beyond financial support, the initial group of global innovators will help to promote the project to broader audience and create greater visibility among institutions.
Florian Douetteau, CEO of Dataiku, stated: "Today, more than 500,000 data scientists use Scikit-learn daily around the world. It's easy to imagine that the combined salary and the value created by these users of Scikit-learn is over $100bn a year. The benefits of this project are extraordinary. By becoming a sponsor of this consortium, we are making a fantastic investment for the future of data science in the world.
"In addition, at Dataiku, we have been integrating Scikit-Learn into our offering since 2013. Dataiku provides us with a clickable version of Scikit-learn to enable everyone to use it, from the business analyst to the most advanced data scientist. This corresponds to a trends of technology innovation, where big scientific advances are often accelerated in communities by open source, then made known to everyone else through the work of software companies."
A software library developed in Python, Scikit-learn is dedicated to machine learning. It’s simple and powerful predictive models make it possible to extract power insights from data using many different models, from the efficient linear model on texts to random forests, and well adapted to heterogeneous databases. Scikit-Learn's competitors are Tensorflow, supported by Google, and SparkML, supported by the DataBricks, which are also integrated with Dataiku. Compared to TensorFlow which focuses on Deep Learning, Scikit-Learn provides an unparallelled diversity of algorithms.
Today, Scikit-Learn is used by the biggest players in the technology: AirBnb for the detection of fraud, Uber for the prediction of the demand, or by Spotify for the recommendation of music.