Solved: Mass delete and performances

Former Member · ‎01-27-2011

Hi,

we use a few tables to log access to our various e-commerce sites.

Now some of these tables are very large (20-50 millions row) so we decided to remove old data after having consolidated it.

However deleting a large number of rows takes a long time (it took about 18 minutes to remove 2millions).

We're removing all rows with primary key less than x (key is an integer).

Does a method exists to mass delete with better performances (maybe disabling logging or ....) ?

If not, can I send a delete statement that will run on background (i.e. disconnecting the client) possibly at a lower priority ?

Rows are always inserted with primary key greater than x (it's a db sequence) so delete and insert never conflicts.

Thank you

markus_doehr2 · ‎01-27-2011

> However deleting a large number of rows takes a long time (it took about 18 minutes to remove 2millions).

> We're removing all rows with primary key less than x (key is an integer).

> Does a method exists to mass delete with better performances (maybe disabling logging or ....) ?

Well - without an example how you delete the data it's very difficult to give you a "better" method

> If not, can I send a delete statement that will run on background (i.e. disconnecting the client) possibly at a lower priority ?

You can put the statement in the background using sqlcli and an input (and output) file.

Markus

Former Member · ‎01-28-2011

Data Cache is 512MB. This db is only for storing website access so this table it's mostly "insert only" 24/24h and a few queries are done at late night after midnight to consolidate the previous day into another table so all query results are on the cache, they are the last inserted rows.

The data size is now 17GB (we have 6 identical databases, one for each site, this is the bigger).

The "delete" operation is done every 2-3 years so the db isn't optimized to do that.

I'll try to delete in batches.

I'm thinking that we should change our "delete" policy and everyday after the consolidation, remove the info with date < (today - 3 years). This way each day a few data will be removed and it should be faster.

What I was really asking is if there's a way to execute a mass delete that is faster than traditional delete with some compromises (for example can't be rolled back).

Thank you

Former Member · ‎01-28-2011

Hi,

another idea is, that the lock-list may cause the problem, when so many locked rows are handled in one transaction.

Ok, WE (you told us, therefore we know) know, that there will be no conflicts, as the old rows will not be used and new rows will receive different keys, but the system does not know. therefore the normal locking is done.

Perhaps it therefore would help, to use several transactions, meaning:

delete where ... and try to specify smaller amount, perhaps 1000 or 10000, to name some number.

And after this amount, do a commit, then the next delete, then commit and so on.

Good luck,

Elke

Mass delete and performances

Accepted Solutions (1)

Accepted Solutions (1)

Answers (2)

Answers (2)

Re: HTTP Receiver adapter with Dynamic value in UR...

How to create a rolling forecast in SAC? I have an...

In SAP Hana Cloud, do packages exist as database o...

Re: Mass Update a Zfield in Std Table with Split &...

Re: Change BW Query Initial Variable Input at runt...