INSERT INTO TABLE (SELECT ...) Poor Performance

Former Member · ‎02-27-2016

Hi folks,

I have another performance issue on HANA. I have a query Insert Into table (Select...) as follows:


insert into "SYSTEM"."SIMILARITYSCORES"
(select T."id", T."text_data", T.TOTAL_TERM_COUNT, T.SCORE
FROM TM_GET_RELATED_DOCUMENTS (
DOCUMENT 'some text here to find its related documents'
SEARCH "text_data" FROM "SYSTEM"."TABLE_TEST"
RETURN
TOP 100000
"id", "text_data"
) AS T);

This query takes 7-8 minutes to be completed. However, if I just execute select statement, it finishes instantly. Select part returns about 750 rows, so is it normal to take 7-8 minutes to insert them into another table? By the way, there is no primary key nor index on SIMILARITYSCORES table.

I tried to replaced insert into part with create table as follows, but nothing changed with performance


create table "SYSTEM"."SIMILARITYSCORES" AS
(select T."id" as "id", T."text_data" as "text", T.TOTAL_TERM_COUNT as "TOTAL_TERM_COUNT", T.SCORE as "SCORE"
FROM TM_GET_RELATED_DOCUMENTS (
 DOCUMENT 'some text here to find its relevant documents '
 SEARCH "text_data" FROM "SYSTEM"."TABLE_TEST"
 RETURN
 TOP 100000
 "id", "text_data"
) AS T)

I want to find a way to improve its performance, maybe you can help me with this.

Note: TM_GET_RELATED_DOCUMENTS is a function which is coming from text mining.

Thanks in advance,

Inanc

Message was edited by: Tom Flanagan

Former Member · ‎02-28-2016

Hi Inanc,

I see it no way a query optimization question. If we would talk about milliseconds then probably yes. But if insert of 750 records takes 7-8 minutes… then I’d rather think about HANA bug or something is completely wrong in your system. Which HANA revision you are using?

Try to narrow down the problem, e.g.:

Replace
create table "SYSTEM"."SIMILARITYSCORES"
with
create column table "SYSTEM"."SIMILARITYSCORES
Check if this is a general INSERT problem. E.g. check how much time the following statement will take:
CREATE COLUMN TABLE TEST_TABLE as (SELECT TOP 1000 * FROM PUBLIC.M_CS_TABLES).
In my HANA system it took 7.5s.
Limit number of records to be inserted by your statement. Check how long will it take to insert different number of records e.g. 10, 100, 200
Monitor thread(s) executing the statement in Performance\Thread in Studio. Check if there is any evidence of lock situation and anything else suspicious. Look at column Status if there anything like Semaphore Wait etc. Switch on Create Call Stack check box to see if the time is spent in particular piece of code. This may help to search SAP Note for already known problems or give and idea how to analyse the problem further.
During or after execution of the statement check index server trace file and check if there are any messages that may be relevant and indicate potential root cause of the problem.

Regards,

Dmitry

Bojan-lv-85 · ‎02-27-2016

Hi Inanc,

without knowning th exact execution path of the query I'm afraid it will be quite hard to tell where exactly the most time is spent, except anyone sees anything obvious in there. The runtime differences can also be due to the fact that the first one needs to insert the subquery result which is a costly operation.

Anyway, a good starting point to find optimization potential is the following note

2000002 - FAQ: SAP HANA SQL Optimization

Give it a try and let us know how that goes. In any other case, maybe you could generate an executed PlanViz trace which would show where exactly the most time is spent.

Thanks! BR, Bojan

INSERT INTO TABLE (SELECT ...) Poor Performance

Accepted Solutions (0)

Answers (2)

Answers (2)

Re: Questions about the Conversion & Upgrade new S...

Questions about the Conversion & Upgrade new Simpl...

Re: Mass Update a Zfield in Std Table with Split &...

Error in SmartForm after changes

Re: odata Metadata issue for standard service