Sunday, 25th August 2019

Databricks and RStudio introduce new version of MLflow with R Integration

Databricks and RStudio has introduced a new release of MLflow, an open source multi-cloud framework for the machine learning lifecycle, now with R integration. RStudio has partnered with Databricks to develop an R API for MLflow v0.7.0 which was showcased today at Spark + AI Summit Europe. This new integration adds to features that have already been released, making MLflow the most comprehensive open source machine learning platform, with support for multiple programming languages, integrations with popular machine learning libraries, and support for multiple clouds.

Previous to MLflow, the industry did not have a standard process or end-to-end infrastructure to develop and productionize machine learning applications in a simple and consistent way. With MLflow, organizations can package their code as reproducible runs, execute and compare hundreds of parallel experiments, leverage any hardware or software platform for training, tuning, hyperparameter search and more. Additionally, organizations can deploy and manage models in production on a variety of clouds and serving platforms. As a testament to MLflow’s design to be an open platform, RStudio’s contribution extends the MLflow platform to the large community of data scientists who use RStudio and R programming language.

"In many organizations machine learning workflows are far too ad-hoc, with no systematic tracking of experiments, inadequate protocols around reproducibility, and no consistent way to package and deploy models. MLflow helps address these issues in a uniform fashion across languages and frameworks," said JJ Allaire, chief executive officer at RStudio. “Integration of R with MLflow will significantly broaden the reach of the project by allowing a broader community to use and contribute to MLflow.”

Since launching MLflow only four months ago, community engagement and contributions have led to an impressive array of new features and integrations that have been released, including:

  • Support for Multiple Programming Languages: To give developers a choice, in addition to R, MLflow supports Python, Java and Scala; as well as a REST server interface which can be used from any language.
  • Integration with Popular Machine Learning Libraries and Frameworks: MLflow has built-in integrations with the most popular machine learning libraries such as scikit-learn, TensorFlow, Keras, PyTorch, H2O, and Apache Spark MLlib to help teams build, test, and deploy machine learning applications.
  • Cross-cloud Support: Organizations can use MLflow to quickly deploy machine learning models to multiple cloud services, including Databricks, Azure Machine Learning, and Amazon SageMaker based on their needs. MLflow leverages AWS S3, Google Cloud Storage, and Azure Data Lake Storage allowing teams to easily track and share artifacts from their code.

“With MLflow, data science teams can systematically package and reuse models across frameworks, track and share experiments locally or in the cloud, and deploy models virtually anywhere,” according to Matei Zaharia, chief technologist at Databricks, the original creator of Apache Spark, and Tech Lead of MLflow. “The flurry of interest and contributions we’ve seen from the data science community validates the need for an open source framework to streamline the machine learning lifecycle.”

Cloudera says that many of the leading global healthcare and life sciences companies, including 9 of...
Mott MacDonald will offer its customers Insight Search, powered by Netwrix, to deliver more visibili...
Shoppers still don’t trust brands to protect their private details.
This collaboration between Microsoft and Informatica provides customers an accelerated path for thei...
The solution, based on the technology acquired from Concept Searching in December, enables organisat...
MapR technology provides innovative file system for unified analytics from edge to cloud.
HVR’s real-time data replication software and WhereScape’s automation software combine best-in-class...
Revolut selects Exasol over alternative cloud vendors for freedom and performance.