2018 Big Data 100: The 10 Coolest Data Science And Machine Learning Tools

Science Projects

Data scientists and machine learning engineers are in big demand in the big data world, consistently topping the LinkedIn lists of the most in-demand jobs. With a shortage of people with data science and machine learning expertise, businesses are increasingly looking for software tools that can help fill that skills gap.

The global market for data science platforms hit $19.76 billion in 2016 and will grow at a CAGR of more than 36 percent to reach $128.21 billion by 2022, according to Stratistics Market Research Consulting.

As part of the 2018 Big Data 100, we've put together a list of 10 data science and machine learning software companies that solution providers should be aware of.


Top Executive: CEO Gurjeet Singh

Ayasdi, which develops a machine intelligence platform that businesses and organizations use to run big data applications, likes to say that it sits at the juncture of AI and big data.

Ayasdi Envision, which debuted last year, is a framework for developing intelligent applications that run on the Ayasdi platform and take advantage of its artificial intelligence and machine learning capabilities.

The company is based in Palo Alto, Calif.


Top Executive: CEO Florian Douetteau

Dataiku offers a collaborative platform for advanced analytics, allowing organizations to apply data science and machine learning techniques to build and deploy their own big data software.

Dataiku raised $28 million in Series B funding in January. In February the New York-based company said it tripled its revenue in 2017 and doubled the size of its staff.


Top Executive: CEO Jeremy Achin

DataRobot develops an automated machine learning platform that captures the knowledge, experience and best practices of data scientists and uses that information to build and deploy predictive models much more quickly than previously possible. With those models, analysts can uncover hidden opportunities and predict outcomes from huge volumes of data.

Boston-based DataRobot snagged $54 million in Series C financing in March, 2017, bringing its total funding to more than $124 million, and the company has aggressively invested in its global partner ecosystem.


Top Executive: CEO Ian Swanson

The DataScience.com Platform provides capabilities that cater to data scientists, business users and IT teams. The system makes it possible for data science teams to collaborate on data-driven projects by helping them explore and visualize data, share analyses, deploy models into production and track their performance.

In March the Culver City, Calif.-based company partnered with GitHub in a bid to promote version control best practices for enterprise data science teams looking to scale up their data science projects.

Domino Data Lab

Top Executive: CEO Nick Elprin

Domino develops a data science platform for use by data scientists and data science teams, business executives and IT managers. The company's system for developing and deploying predictive models includes a workbench, a collaboration hub, and tools for publishing and deployment.

In January San Francisco-based Domino Data Lab reported triple-digit revenue growth in 2017.


Top Executive: CEO SriSatish Ambati

H2O offers open-source machine learning software, including its namesake flagship product that empowers businesses to quickly deploy machine learning and predictive analysis systems for critical applications and operational intelligence.

H20.ai, based in Mountain View, Calif., recently launched Driverless AI, software that automates machine learning and introduces visualization and interpretability features for data modeling.


Top Executive: CEO Michael Berthold

Knime develops an open-source business analytics platform that is used by data scientists for data mining and predictive modeling tasks. The Zurich, Switzerland-based company's software is used for a range of analytical applications including customer intelligence, financial credit scoring and retail market basket analysis.


Top Executive: CEO Peter Lee

RapidMiner's data science software provides an integrated environment for data preparation, machine learning, deep learning, text mining and predictive analysis tasks.

In February Boston-based RapidMiner launched RapidMiner 8.1 and RapidMiner Auto Model, the latter an addition to RapidMiner Studio that helps automate the development of machine learning models.


Top Executive: President and CEO Doug Merritt

Fast-growing Splunk develops software for collecting and analyzing machine data to provide operational intelligence in IT management, security and business processes.

In April Splunk, based in San Francisco, debuted its first product specifically developed for industrial Internet of Things applications. Also in April the company said that it has expanded the artificial intelligence capabilities across its entire product portfolio.

And in recent months Splunk has acquired several companies that develop machine learning technology including Rocana and SignalSense.

Sumo Logic

Top Executive: President and CEO Ramin Sayar

Sumo Logic develops a cloud-native machine data analytics platform that provides log data management and metrics monitoring for IT operations and security applications. The platform incorporates machine learning technology within its machine data analytics and anomaly detection functionality.

In January Sumo Logic, based in Redwood City, Calif., acquired FactorChain, a developer of security threat investigation software, that the company said accelerates the convergence of IT and security workflows.