SAP Data Management Overview

SAP Data Management

 

Data management has become a key business area over the past decade. As companies acquire gigabytes and terabytes of data from various sources, they need a way to parse and store it. Those using SAP for their data management processes have multiple options available to help them meet these goals.

 

Table of Contents

  1. General Data Management Concepts
  2. SAP's Data Management Solutions
    1. Data Lifecycle Manager
    2. Data Warehouse Options
      1. SAP Business Warehouse
      2. SAP BW/4HANA
      3. SAP Datasphere
    3. SAP Agile Data Preparation
    4. SAP Data Intelligence (SAP Data Hub)
    5. SAP Data Services
    6. SAP HANA and SAP HANA Cloud
      1. SAP HANA Cloud Services
      2. SAP HANA Smart Data Access
      3. SAP HANA Smart Data Integration
      4. SAP HANA Smart Data Quality
    7. SAP Information Lifecycle Management
    8. SAP Information Steward
    9. SAP LT Replication Server
    10. SAP Master Data Governance
    11. SAP Vora
  3. Other Key SAP Data Management Definitions
  4. Additional Resources
    1. Blog Posts
    2. Books by SAP PRESS
    3. Videos

General Data Management Concepts

When data is first collected or loaded into a system, it must be validated and cleaned prior to storage, a concept known as data provisioning. SAP solutions such as the SAP HANA Smart Data Integration and Smart Data Quality, SAP Data Services, and SAP LT Replication Server all provide users with ways to get data ready for use, from collecting to standardizing to storing.

 

Many SAP solutions are equipped to handle big data acquired from various sources such as the Internet of Things and stored in repositories such as data lakes, data warehouses, data marts, etc. In instances where solutions only handle specific sets of data, SAP users can connect them to databases and other warehousing options.

(Back to ToC.)

SAP’s Data Management Solutions

SAP has released a robust collection of products that focus on data management. Here are the key solutions:

Data Lifecycle Manager

The data lifecycle manager is an SAP HANA Extended Application Services tool that moves data from SAP HANA to other storage sources based on one of three different states of the data. These are “hot storage” for presently used data that should remain in the memory layer; “warm storage” for disk-based extended storage; and “cold storage” for data not needed and which can be stored on an external drive until it is.

(Back to ToC.)

Data Warehousing Options

SAP has created multiple ways for clients to store data for use. By keeping data structured and easily accessible, models and reports are easier to create.

SAP Business Warehouse

SAP Business Warehouse (SAP BW) is SAP’s legacy data warehouse offering. It can be run on any database from third party to SAP HANA. Choosing this option gives clients the ability to run reports using SAP HANA’s memory layer rather than compressing data into SAP BW first. SAP BW will no longer receive maintenance following SAP’s 2027 target date of discontinuing SAP ERP solutions. Extended maintenance options will be available through 2030.

SAP BW/4HANA

SAP BW/4HANA was first released in 2016 and serves as the successor to legacy SAP BW. This solution is an updated version of SAP BW that includes multiple new features. It runs on top of the SAP HANA database, and focuses on software simplicity, openness, a modern interface, and high performance.

SAP Datasphere

SAP Datasphere (formerly known as SAP Data Warehouse Cloud) is an integrated, fully managed, and persona-driven data-warehouse-as-a-service solution that is suitable for SAP and non-SAP customers. It offers reduced deployment complexity, flexible pricing with integration to SAP Intelligent Suite solutions, SAP Analytics Cloud, SAP Business Technology Platform services, partner solutions and open-source technologies.

 

The architecture is designed to attach to existing on-premise data warehouse deployments (SAP SQL Data Warehousing, SAP Business Warehouse, and others) and extend them into the software-as-a-service world. This feature enables hybrid scenarios and avoids the need to move the entire data mart into the cloud.

(Back to ToC.)

SAP Agile Data Preparation

SAP Agile Data Preparation is a tool that allows connection to data sources such as spreadsheets or databases. It can help users transform and manipulate data as needed.

(Back to ToC.)

SAP Data Intelligence (SAP Data Hub)

SAP Data Intelligence is a cloud service that combines artificial intelligence and machine learning to better use siloed data and bring together the IT and data science departments of an organization. In Q2-2020, the functionality of SAP Data Hub was fully integrated under the SAP Data Intelligence name, adding data orchestration, discovery, refinement, and governance.

(Back to ToC.)

SAP Data Services

SAP Data Services is SAP’s flagship enterprise information management solution. It helps users ensure data quality, migration of data, text analysis, and interconnectivity with both SAP- and non-SAP systems.

(Back to ToC.)

SAP HANA and SAP HANA Cloud

As SAP’s main database offering, the in-memory SAP HANA provides users with the tools they need to quickly call and sort through data, often in real-time. It is available both on-premise and in the cloud. As part of its architecture, SAP HANA has multiple data management solutions available for users.

SAP HANA Cloud Services

SAP HANA Cloud services is a suite of solutions that provides multiple data management functionalities via SAP HANA Cloud and SAP Datasphere.

SAP HANA Smart Data Access

SAP HANA smart data access enables virtual remote data access to third-party SAP and non-SAP data sources, without copying data locally into SAP HANA. This keeps data bloat down as it does not need to be queried over to SAP HANA for viewing.

SAP HANA Smart Data Integration

SAP HANA Smart Data Integration loads data in batch mode or in real time into SAP HANA based on a variety of source systems, leveraging built-in and custom adapters. The SDI feature comes with SAP HANA.

SAP HANA Smart Data Quality

SAP HANA smart data quality (SDQ) is a subset of select SDI transformations. These include data cleansing, address cleansing, and geospatial data enrichment.

(Back to ToC.)

SAP Information Lifecycle Management

SAP Information Lifecycle Management (SAP ILM) is a tool that allows the blocking and deletion of data from an SAP system. This is especially important in instances where data privacy laws such as the General Data Protection Regulation (GDPR) and California Consumer Protection Act (CCPA) require businesses to delete customer data upon request.

(Back to ToC.)

SAP Information Steward

SAP Information Steward is a single-platform solution used to discover, assess, define, monitor, and improve data quality.

(Back to ToC.)

SAP LT Replication Server

SAP Landscape Transformation Replication Server (SLT) is a comprehensive data replication tool designed to facilitate the real-time transfer of data between different systems within an SAP landscape.

(Back to ToC.)

SAP Master Data Governance

SAP Master Data Governance (SAP MDG) is a data governance solution that consolidates and creates enterprise master data, defines data management workflows, and determines the quality of said data.

(Back to ToC.)

SAP Vora

SAP Vora is a distributed computing solution deployed on Apache Hadoop and Spark clusters. It provides a semantic layer for big data and can be used to run combined analytics across datasets.

(Back to ToC.)

Other Key SAP Data Management Terms

In addition to the information laid out above, there are a handful of important SAP data management terms you should also be familiar with. Here are they are in list form:

    • Apache Hadoop: An external, open-source software library used for storing and processing big data with the goal of increasing data storage and decreasing processing times. Often used in tangent with SAP HANA.
    • Data aging: A way to systematically store data as it grows stale, while allowing access as needed.
    • Dynamic tiering: A way to relocate data from the memory layer of SAP HANA to disk.
    • Hortonworks Data Platform: An open source data framework used for the distribution of large data sets. Acts as a connector for SAP HANA data to be sent to cold storage on Apache Hadoop.
    • SAP Adaptive Server Enterprise: A relational database formerly known as Sybase ASE prior to SAP’s acquisition of Sybase in 2010.
    • SAP Advanced Data Migration by Syniti: An application used to manage data migrations so they run smoothly and finish to spec.
    • SAP Data Mapping and Protection by BigID: An application used to manage data risk, such as determining where data originated and whether it’s okay to access it.
    • SAP Extended Enterprise Content Management by OpenText: An application that connects business documents to core processes.
    • SAP HANA Spark: A tool that bridges SAP HANA and Hortonworks Data Platform; must be used prior to sending data to cold storage on Apache Hadoop.
    • SAP Information Steward Accelerator by Syniti: A passive data governance integration that allows data stewards to investigate and remove data errors.
    • SAP Intelligent RPA: A tool that integrated AI and machine learning, used for automating repetitive, rule-based tasks such as collecting large sets of data and saving them for use.
    • SAP SQL Anywhere: A solution used to manage data in real-time without needing connectivity.

(Back to ToC.)

Additional Resources

Eager to learn more about SAP data management? These blog posts and books can help.

Blog Posts

Books by SAP PRESS

Videos

(Back to ToC.)

What Next?

Learn more SAP from our official Learning Center.SAP PRESS Learning Center

And to continue learning even more about SAP and data management, sign up for our weekly blog recap here: