Solved: How to avoid cursor and make more efficient code

Former Member · ‎03-07-2016

Hello HANA experts,

We have a problem with a preprocessing procedure. It prepares some data to be processed with R. A simpler version of what we need to do can be summarized as:

We have two tables: A and B. We need to do an aggregation from table B for each row in table A, using some table A columns to limit the range of table B rows to be aggregated.

As a sample, table A has information of some buildings (construction start, construction end and where it is located), and table B has weather information for the last years for many locations. What we want to do is to aggregate how much it rained during the construction of each building, and add that information to table A.

Currently we've developed it using a SQLSCRIPT procedure with a cursor, but it takes too long to preprocess the hole table due to the fact that cursors are not parallelized. Thus resulting in a sequential execution of as many aggregation queries as the number of rows in table A.

Do you have any advise regarding how to make more efficient our preprocessing stage? There is any way to run the aggregation queries in parallel, avoiding the use of the cursor?

Thanks in advance!

Juan

suresh_devarajan · ‎03-08-2016

I think this can be done using a query like the below. The inner queries are to locate the station closest to the building. Outer query joins the building and weather tables using the found closest station ID.

SELECT bloc.id_building, SUM(w1.prec) prec

FROM (

/* Select the station with the least distance */

SELECT e.*

FROM (

/* Rank the distance */

SELECT d.*, RANK() OVER (PARTITION BY d.id_building ORDER BY d.dist ASC) rnk

FROM

(

/* Distance between the buildings & stations */

SELECT b.*, w.id_estation, b.location.ST_DISTANCE(w.obs_location) dist

FROM buildings b

CROSS JOIN (SELECT DISTINCT id_estation, obs_location FROM weather) w

) d

) e

WHERE e.rnk = 1

) bloc

LEFT OUTER JOIN weather w1

ON bloc.id_estation = w1.id_estation

AND w1.obs_date BETWEEN bloc.cons_start and bloc.cons_end

GROUP BY bloc.id_building

ORDER BY bloc.id_building

;

former_member185511 · ‎03-08-2016

i think this document may help you

after finding the closest distance you can aggregate PREC value. But what i didn't understand is, what is the threshold value for you defining the closest distance ? If it is 1km distance to building A, and 500meter distance to building B, can we accept it rained to both buildings?

lucas_oliveira · ‎03-08-2016

Hi Juan,

I trust it might be possible do what you're doing with plain SQL or WITH but we need to understand what exactly you want to do.

Provide the table create statements, data samples and desired results for the data samples.

BRs,
Lucas de Oliveira

How to avoid cursor and make more efficient code

Accepted Solutions (1)

Accepted Solutions (1)

Answers (2)

Answers (2)

Re: cloud connector

Re: SAP Cloud Identity Services

Corporate Git Setup on SAP BTP versus connecting t...

Re: Shadow user in BTP

SAP scripting findbyID