SAP AG today announced advanced support and integration with Hadoop environments including an expanded "big data" partner council and customer showcases.
These "big data" integration capabilities are planned in SAP® Data Services and SAP® Information Steward software with the release of service pack 4 for the SAP HANA® platform. SAP also showcased game-changing business results among customers, made possible by "big data" implementations based on the SAP real-time data platform. The announcement was made at SAPPHIRE® NOW, being held in Orlando, Florida, May 14-16, 2012.
The SAP real-time data platform, based on the flagship SAP HANA platform, includes data management capabilities from SAP Sybase IQ, SAP Sybase ESP, SAP Sybase ASE and SAP Enterprise Information Management. It unlocks business value from "big data" by providing for real-time decision support within the window of opportunity with extreme capabilities to ingest, store and process big data in real-time.
"Groundbreaking innovations like SAP HANA help our customers access and deliver information at unprecedented speeds -- up to 100,000 times faster than before -- and empower them with fundamentally new ways to run their businesses and master 'big data,'" said Steve Lucas, global executive vice president and general manager, Database & Technology, SAP. "The SAP real-time data platform delivers an information value chain that uncovers and harnesses the right information at the right moment by moving data among SAP HANA, SAP Sybase IQ and Hadoop file systems."
Customer Showcases "Big Data" Results Powered by SAP HANA
MITSUI KNOWLEDGE INDUSTRY analyzes genomes for cancer research and treatment. By building an information value chain that uses SAP HANA with R, open source programming language and software environment for statistical computing and graphics, for real-time analysis and Hadoop for preprocessing DNA sequences, the company found a way to shorten the genome analysis time from several days down to only 20 minutes.
"Going from a process measured in days to one measured in minutes is radically transforming our customer relationships," said Yukihisa Kato, CTO and director, MITSUI KNOWLEDGE INDUSTRY. "Using the SAP real-time data platform with SAP HANA at its core will be critical to our DNA going forward and to future business growth."
SAP Enterprise Information Management Solutions to Power Access to All "Big Data" Sources
SAP envisions SAP Data Services and SAP Information Steward as the industry's first and only unified solution for data quality, data integration, text data processing, data profiling and metadata management. The company aims to allow users to unlock the full business potential of "big data" by making it easier to integrate and synthesize information across an enterprise's SAP and non-SAP applications. SAP Data Services and SAP Information Steward are intended to provide both business users and IT with an intuitive and comprehensive information management solution with planned enhancements that include:
Hadoop integration: reading from and loading to Hive and Hadoop Distributed File System (HDFS), rapid batch updating and loading to SAP HANA, SAP® Sybase® IQ server and any other data store, resulting in faster performance
Text data processing: extending the data view to analyze data sources by performing linguistic analysis and extracting relevant content from files, Web logs and social media
Authoritative and trustworthy data: helping users understand, assess and improve the quality of data by integrating data quality scorecards directly within business intelligence (BI) and enterprise applications so users can immediately determine the accuracy of the data and if necessary take corrective action
"Our goal is to help organizations access build and govern information value chains across all data sources," said Lucas. "With our enterprise information management solutions, customers will have the ability to easily understand and access any data source -- be it from an SAP, custom or partner application, enterprise database or new data sources such as Hadoop -- so they can now better manage information throughout the organization."
SAP Extends "Big Data" Partner Ecosystem
SAP has set up a "big data" partner council that will co-innovate to produce solutions uncovering use cases and architectures that leverage the SAP real-time data platform and Hadoop. The council will include a cross-section of companies, including startups, hardware vendors, software providers and technology services organizations that will collaborate together with SAP on select projects in the SAP® Co-Innovation Lab. The cornerstone of this strategy is building a strong integration with the Hadoop ecosystem, starting with Cloudera, the leading provider of enterprise-grade Hadoop data management software, services and training. By co-innovating with Cloudera, the top contributor to the Hadoop development community, SAP plans to enable customers to go beyond business analytics and into the world of next-generation applications that innovate on data and open the door to myriad new possibilities.
"We are very excited to work with SAP to provide customers with real-time insights from their Hadoop environments using our complementary solutions," said Mike Olson, co-founder and CEO, Cloudera. "The SAP real-time data platform, combined with the Cloudera Hadoop Distribution, will deliver unmatched capabilities in next-generation 'big data' applications and analytics to the enterprise."