Databricks will deliver a certified Apache Spark distribution offering for the SAP HANA platform, following the announcement of a new partnership with SAP.
The production-ready distribution offering, based on Apache Spark 1.0, is deployable in the cloud or on premise and available for immediate download from SAP at no cost at spr.ly/SAP_and_Spark.
The Databricks-certified distribution offering fro SAP HANA contains the Spark processing engine that works with any Hadoop distribution out of the box.
“We’re thrilled to be embarking on this journey with SAP to bring together two powerful technologies to better enable enterprises to derive value from their data,” said Ion Stoica, CEO of Databricks. “SAP HANA is both an incredibly powerful and fast analytics engine, as well as a repository for some of the most valuable enterprise data by virtue of the enterprise applications that it helps run. This integration will help enable the large and growing community of Hadoop and Spark developers and applications to harness these capabilities immediately via Spark as well as extend the reach of SAP HANA.”
SAP HANA integrated with Spark will help enable real-time applications and interactive analysis across corporate application data with content stored in Hadoop Distributed File System (HDFS).
SAP HANA’s end-to-end processing acceleration helps to simplify the integration of mission-critical applications with contextual data stored in Hadoop-like data stores. In-memory computation can therefore occur where data resides and can help minimise costly and time-consuming data movement.
“SAP has continually been at the forefront of innovation to simplify and better serve customers, and bringing together Spark and SAP HANA is simply the latest example of this,” said Steve Lucas, president, platform solutions, SAP.
“This can allow enterprises to build on SAP HANA’s value proposition by providing some of the best-of-breed capabilities across the full spectrum of data and processing needs without the need to painstakingly stitch together independent solutions.”
These new capabilities will enable the creation of a new class of applications with SAP HANA and Spark that span data domains – for example, to combine sensor data with billing systems to deliver personalised resource and cost-saving recommendations for utilities, or integrate inventory analysis with social media trends for retailers.

