SAP HANA Performance Tuning Guide for Developers

<p>SAP HANA is one of the fastest-growing products in <a href="/platforms/sap/sap/">SAP</a>’s 49-year-old history. It is an in-memory database engine that can analyze large datasets in real-time. Companies conduct various performance optimization techniques for SAP HANA to streamline and automate the data management process.</p> <p><a href="https://www.gspann.com/resources/case-studies/data-driven-assessment-of-customer-acquisition-and-retention-opportunities/">Read how team GSPANN helped a US-based semiconductor processing equipment manufacturer, with annual sales of approx. $10B, in automating the capture of customer acquisition and retention data by developing a custom solution within SAP HANA.</a></p> <p>This blog highlights various SAP HANA performance tuning techniques in the following segments:</p> <ul> <li>SAP HANA Table-level Optimization</li> <li>Optimization of SAP HANA Stored Procedures / Scripted Calculation Views / Table Functions</li> <li>Leveraging SAP HANA Using Linked Server</li> </ul> <h4 style="text-align: left;">SAP HANA Table-level Optimization</h4> <p>You can improve the performance of any SAP HANA system following the best practices mentioned below:</p> <ul> <li>Use the appropriate table type based on the usage. If a table is small and almost all columns of the table are used in most of the queries, then use the Row-type table. For all other cases, use a Column-type table.</li> <li>Avoid using indexes on non-unique columns. Since every index imposes an overhead in terms of memory and performance, it’s better to create as few indexes as possible. Also, use fewer columns in an index.</li> <li>Get better performance by assigning range, hash, and round-robin partitions to the large tables since the amount of data brought into working memory can be limited.</li> </ul> <h4 style="text-align: left;">Optimization Techniques for SAP HANA Stored Procedures / Scripted Calculation Views / Table Functions</h4> <p>The SAP HANA optimization is necessary to avoid the termination of tasks or processes when they breach the memory threshold (set for each process). It also improves resource utilization, such as memory and CPU time required.</p> <p>Based on our experience, you can tune your stored procedures by following the best practices mentioned below:</p> <ul> <li>Use input parameters to filter the required data at the source level.</li> <li>Replace local temporary tables with table variables. Unload the tables as soon as they are not required for further processing when using large tables.</li> <li>Reduce the overall memory consumption by partitioning large tables, which brings lesser data into working memory. The type of partitioning and columns used for partition should be determined based on the ways the table is used in a majority of scenarios.</li> <li>Instead of writing complex queries, it is recommended to split a complex query into multiple simple queries for better optimization.</li> <li>In case of delta processing on tables that don’t have a timestamp, create a table that stores the key, timestamp (of the record coming into the HANA system), and populate this table by using triggers. While processing, extract records from the main table about a period of time by joining with the delta table.</li> <li>If a process requires a large amount of data to be processed quickly (and you have exhausted all other options), then materialize the slow-changing data and store the partial results in a separate table. Schedule this materialization process to run at appropriate intervals to refresh its data. Once you have the data in a materialized table, then you can use this table and decrease the number of tables used in the main stored procedure.</li> <li>Using query pruning technique by using input parameters, i.e., when a user specifies the data/table that is required at that moment via input parameters, you can prune the logic that is not needed to fetch the requested data.</li> <li>Use table functions instead of scripted calculation views.</li> <li> In graphical calculation views, that bring a large volume of data, use query pruning to minimize data load based on users’ selection.<p><img style="padding: 0px 20px;" width="650px" height="auto" alt="SAP HANA Optimization Steps" src="https://images.prismic.io/gspann/2a331194-1534-43a1-b890-d566361357d5_Image+1_+SAP+HANA+Optimization+Steps.jpg?auto=compress,format" /></p> </li> <li> In a calculation view, minimize the amount of data that will be pulled into the working memory by using input parameters and applying filters at source projection in the data flow. If the calculation view is a union of multiple calculation views, then constant mapping in the union node will result in performance improvement by using query pruning, where the data is pulled only from the desired subview (See the picture below).<p><img style="padding: 0px 20px;" width="600px" height="auto" alt="Query Pruning with Constant Mapping" src="https://images.prismic.io/gspann/d46b3487-f72f-4a39-9f1b-77876d85cad7_Image+2_+Query+Pruning+with+Constant+Mapping.jpg?auto=compress,format" /></p> </li> </ul> <p>While the above-mentioned optimization techniques can help you with SAP HANA optimization, there are some things that <strong>you must avoid</strong> in the process:</p> <ul> <li>Using Smart Data Access (SDA) – SDAs use a high-level of memory consumption</li> <li>Using cursors</li> <li>Data conversions - since creating columns to be converted in every query for the desired data type is expensive</li> <li>Using calculated columns in ‘joins’</li> <li>Aggregation nodes</li> <li>Apply IS NULL check before filtering the data - filter the data set before applying the IS NULL check because they prove to be expensive on large data sets</li> </ul> <h4 style="text-align: left;">Leveraging SAP HANA using a Linked Server</h4> <p>For a large number of stored procedures that are already present in a different environment, like SQLServer or Oracle, instead of migrating the stored procedures to SAP HANA, leverage the power of HANA by creating a linked server connection between the existing system and SAP HANA. This will reduce the overall effort and cost of development.</p> <p>Benefits of using a linked server:</p> <ul> <li>Reduced cost and effort required to migrate logic to HANA</li> <li>Doesn’t require migration of non-SAP tables to HANA</li> <li>Leverage the power of SAP HANA by pushing most of the logic that involves SAP ERP Central Component (ECC) tables to HANA</li> </ul> <p>Here are a few examples of OpenQuery to fetch the data from HANA:</p> <div id="contentBlocker"></div> <div class="col-md-12"> <code> <pre style="padding: 0px 20px; margin: 0px; font-size: 15px; text-align: left; border:1px solid #B5B5B5; background-color: #F4F4F4;"> Select * From OpenQuery(ALEX_HANA, 'Select Distinct "MGMT_REGION", "MYCOMPANY", "CUSTOMER_NAME", "SHIP_TO","KPI_REGION" From "_SYS_BIC"."prd.stg2/CV_INSTALLED" Where "MYCOMPANY" <> '''' Order by MGMT_REGION, MYCOMPANY, SHIP_TO'). </pre> </code> </div><br> <p>Passing Dynamic values using an open query:</p> <div class="col-md-12"> <code> <pre style="padding: 0px 20px; margin: 0px; font-size: 15px; text-align: left; border:1px solid #B5B5B5; background-color: #F4F4F4;"> DECLARE @TableHANA VARCHAR(256) ,@TableSDA VARCHAR(256) ,@PrimaryKey VARCHAR(MAX) ,@GroupByPrimaryKey VARCHAR(MAX) ,@COLUMNS NVARCHAR(MAX) ,@COL_LEN INT; SET @COLUMNS = ',['; SELECT @COLUMNS = @COLUMNS + ISNULL((NAME + '],['),'') FROM SYS.columns WHERE object_id IN (SELECT OBJECT_ID FROM SYS.VIEWS WHERE NAME = 'X_BSD_25' ) --<====== CHANGE TABLE NAME HERE AND NAME <> 'iKEY_ID'; PRINT @COLUMNS; SET @COL_LEN = LEN(@COLUMNS); SET @COLUMNS = LEFT(@COLUMNS,@COL_LEN - 2 ) PRINT @COLUMNS; IF OBJECT_ID(N'tempdb..#TMPHANATABLE', N'U') IS NOT NULL DROP TABLE #TMPHANATABLE SELECT * INTO #TMPHANATABLE FROM OPENQUERY(ALEX_HANA,'SELECT * FROM "ALEX_CUSTOM"."ZT_X_BSD_25"'); SET @TableHANA = '#TMPHANATABLE' SET @TableSDA = '[dbo].[X_BSD_25]' SET @PrimaryKey = ' ,[iKEY_ID] AS PrimaryKey ' --Paste Field Names and create a single composite Key if more than one primary key fields exist in the field names below PRINT @PrimaryKey PRINT @Columns PRINT @GroupByPrimaryKey EXEC('SELECT PRIMARYKEY, COUNT(*) FROM (SELECT MIN(TableName) AS TableName, PrimaryKey' + @Columns + ' FROM ( SELECT ''' + @TableHANA + ''' AS TableName' + @PrimaryKey + @Columns + ' FROM ' + @TableHANA + ' A ' + 'UNION ALL SELECT ''' + @TableSDA + ''' AS TableName' + @PrimaryKey + @Columns + ' FROM ' + @TableSDA + ' B ' + ') N GROUP BY PrimaryKey' + @Columns + ' HAVING COUNT(*) = 1 ) XYZ GROUP BY PrimaryKey HAVING COUNT(*) > 1 ' ); </pre> </code> </div><br> <p>We hope you liked our first-hand account of SAP HANA optimization techniques in this blog. You can use these techniques to reduce the memory and CPU utilization of your business to get more work done without increasing the size of the system.</p>

Home /

SAP HANA is one of the fastest-growing products in SAP’s 49-year-old history. It is an in-memory database engine that can analyze large datasets in real-time. Companies conduct various performance optimization techniques for SAP HANA to streamline and automate the data management process.

Read how team GSPANN helped a US-based semiconductor processing equipment manufacturer, with annual sales of approx. $10B, in automating the capture of customer acquisition and retention data by developing a custom solution within SAP HANA.

This blog highlights various SAP HANA performance tuning techniques in the following segments:

SAP HANA Table-level Optimization
Optimization of SAP HANA Stored Procedures / Scripted Calculation Views / Table Functions
Leveraging SAP HANA Using Linked Server

SAP HANA Table-level Optimization

You can improve the performance of any SAP HANA system following the best practices mentioned below:

Use the appropriate table type based on the usage. If a table is small and almost all columns of the table are used in most of the queries, then use the Row-type table. For all other cases, use a Column-type table.
Avoid using indexes on non-unique columns. Since every index imposes an overhead in terms of memory and performance, it’s better to create as few indexes as possible. Also, use fewer columns in an index.
Get better performance by assigning range, hash, and round-robin partitions to the large tables since the amount of data brought into working memory can be limited.

Optimization Techniques for SAP HANA Stored Procedures / Scripted Calculation Views / Table Functions

The SAP HANA optimization is necessary to avoid the termination of tasks or processes when they breach the memory threshold (set for each process). It also improves resource utilization, such as memory and CPU time required.

Based on our experience, you can tune your stored procedures by following the best practices mentioned below:

Use input parameters to filter the required data at the source level.
Replace local temporary tables with table variables. Unload the tables as soon as they are not required for further processing when using large tables.
Reduce the overall memory consumption by partitioning large tables, which brings lesser data into working memory. The type of partitioning and columns used for partition should be determined based on the ways the table is used in a majority of scenarios.
Instead of writing complex queries, it is recommended to split a complex query into multiple simple queries for better optimization.
In case of delta processing on tables that don’t have a timestamp, create a table that stores the key, timestamp (of the record coming into the HANA system), and populate this table by using triggers. While processing, extract records from the main table about a period of time by joining with the delta table.
If a process requires a large amount of data to be processed quickly (and you have exhausted all other options), then materialize the slow-changing data and store the partial results in a separate table. Schedule this materialization process to run at appropriate intervals to refresh its data. Once you have the data in a materialized table, then you can use this table and decrease the number of tables used in the main stored procedure.
Using query pruning technique by using input parameters, i.e., when a user specifies the data/table that is required at that moment via input parameters, you can prune the logic that is not needed to fetch the requested data.
Use table functions instead of scripted calculation views.
In graphical calculation views, that bring a large volume of data, use query pruning to minimize data load based on users’ selection.
In a calculation view, minimize the amount of data that will be pulled into the working memory by using input parameters and applying filters at source projection in the data flow. If the calculation view is a union of multiple calculation views, then constant mapping in the union node will result in performance improvement by using query pruning, where the data is pulled only from the desired subview (See the picture below).

While the above-mentioned optimization techniques can help you with SAP HANA optimization, there are some things that you must avoid in the process:

Using Smart Data Access (SDA) – SDAs use a high-level of memory consumption
Using cursors
Data conversions - since creating columns to be converted in every query for the desired data type is expensive
Using calculated columns in ‘joins’
Aggregation nodes
Apply IS NULL check before filtering the data - filter the data set before applying the IS NULL check because they prove to be expensive on large data sets

Leveraging SAP HANA using a Linked Server

For a large number of stored procedures that are already present in a different environment, like SQLServer or Oracle, instead of migrating the stored procedures to SAP HANA, leverage the power of HANA by creating a linked server connection between the existing system and SAP HANA. This will reduce the overall effort and cost of development.

Benefits of using a linked server:

Reduced cost and effort required to migrate logic to HANA
Doesn’t require migration of non-SAP tables to HANA
Leverage the power of SAP HANA by pushing most of the logic that involves SAP ERP Central Component (ECC) tables to HANA

Here are a few examples of OpenQuery to fetch the data from HANA:

Dharanidhar Malluri

Sr. Technical Lead

Published Oct 13 2020

GSPANN for SAP

SAP HANA Performance Optimization Techniques

SAP HANA Table-level Optimization

Optimization Techniques for SAP HANA Stored Procedures / Scripted Calculation Views / Table Functions

Leveraging SAP HANA using a Linked Server

You May Also Like

Blog

Case Study

Case Study

Blog

Case Study

Case Study

Blog