Solved: integer vs varchar joins

rmuhuri · ‎06-27-2016

Hi

I have come across a project where they have used a lot of ETL and done the datawarehousing the old classical way i.e generating integer Surrogate Keys. I believe they have done for performance as integer joins performs better that varchar joins . The dev system has 57 million fact table rows and dim tables have anywhere from 100 K to 1 m rows . So prod tables would be even horrendously larger.

so leaving aside modelling optimization techniques like partition pruning/ query pruning / etc etc . what is the best join strategy for just large tables .

The join fields are 10 characters , one option could be converting the alpha numeric characters to ascii or unicode but 10 characters would be blown to a 20 byte integers , so a join on just a big field would be worse .

any idea's on join optimization .

PS : we will be using SDI , so we can do complex transformation on real time .

lbreddemann · ‎06-28-2016

You seem to assume that SAP HANA performs operations on the actual values of columns in a table.

That's not the case.

Instead, most of the very base operations like filter, join, project are performed with so called value-vectors. Only when the data needs to be materialized, e.g. to create a human readable output, the actual values are pulled from the so called dictionaries put in place.

The openHPI and openSAP courses about in memory databases cover that quite nicely.

So, to answer your question: don't bother "optimizing" your table design by making funky data type choices.

If you need a surrogate key, then create one.

If you don't need it,don't.

You can save storage by picking e.g. a numeric data type for numbers instead of a character data type - which is important in itself. But this won't have a tremendous impact on the join performance.

integer vs varchar joins

Accepted Solutions (1)

Accepted Solutions (1)

Answers (0)

Re: Table DDNTF is not active in the dictionary er...

CDS View Table functions Abap Class consume with p...

Re: Incorrect documentation regarding "Error Sanit...

Can CrestalReports.NET be developed even if the DB...

How-to-guide for SAP GUI Scripting