Solved: SAP HANA - How system understands the columnar dat...

Former Member · ‎01-26-2012

Hi All,

SAP HANA database stores the data in columnar format. So from row based table perspective, each column will be treated as an individual table with distinct values.

How HANA database maintains the relationship of each row so the user gets correct data? In other words, how HANA database internally process the column based records?

Regards,

Mandar

Former Member · ‎01-31-2012

Thank you Venkatesh and Lars for your reply.

Lars, first of all thank you!

I got the point that system stores distinct values of each column.. let me copy your statements

- per column you store a dictionary of every distinct value that occurs in this column

- for each distinct value you need to maintain a list or array or vector (however you call it) that tells you, for which row a specific value occurs.

My doubt is clear till this point.

now, when you say "when we want to put back together a full specific row, we need to go through every column and find the matching column value for each row id "

So how does system creates a row when there are distinct column values. As there are n permutations and combinations, how internal algorithm creates the right record?

Regards,

Mandar

rama_shankar3 · ‎01-31-2012

Thanks - good points !

Former Member · ‎01-31-2012

Thank you Lars for a quick reply... I understand that even if you knew you would not be allowed to tell, but I couldn't resist myself since it is a very unconventional way of storage. Thanks again for your valuable inputs, at the same I will check out for other databases as well....

Regards,

Mandar

lbreddemann · ‎01-31-2012

Hi Mandar,

I guess the question how to put back together rows from column store data is something that puzzles many who are used to work with classic row-store databases.

I haven't seen the column store coding, but usually something like this is done:

- per column you store a dictionary of every distinct value that occurs in this column

- for each distinct value you need to maintain a list or array or vector (however you call it) that tells you, for which row a specific value occurs.

This is exactly what a classic index does:


[distinct value] --> [rowid_1, rowid_2, ... , rowid_n]

Now, when we want to put back together a full specific row, we need to go through every column and find the matching column value for each rowid - so we need to perform a kind of inverse index function.

As you can assume, this is something pretty 'expensive' and in fact: rebuilding a complete row is the most expensive operation of a column store.

This also should explain why column store tables are that good for aggregation and data analysis but terribly bad when it's about working on specific rows.

I hope this is what you wanted to know.

regards,

Lars

Former Member · ‎01-31-2012

Hi

You can check the HANA Architecture to understand how system will differentiate Row and Column data storage.

http://www.erphowtos.com/sap-bw/33-sap-hana-overview-and-architecture.html

We have two different for data flows for row and column tables.

When it comes to data consistency, this will be achieved using MVCC(Multi Version Concurrency Control)

In this we have two types i) CID(commit ID) ii) TID(Transaction ID)

CID is used for row based storage where as TID is used for Column based storage.

Regards,

Venkatesh

SAP HANA - How system understands the columnar data relation?

Accepted Solutions (1)

Accepted Solutions (1)

Answers (4)

Answers (4)

Re: Mass Update a Zfield in Std Table with Split &...

Error in SmartForm after changes

Re: odata Metadata issue for standard service

Custom Fields for Goods Receipt Label

Re: Struggling with Filters on Select - Fiori App